Nils Jungclaus,
Helge Ritter,
Gerhard Sagerer
University of Bielefeld
We present an approach to a system architecture integrating speech and image recognition to follow human instructions in a simplified assembly scenario. The main architectural aspects focus on data-flow, control-flow and handling of different time requirements of modules operating at various levels within a hybrid system. In particular, we propose a generic memory module to coordinate bottom-up data-driven and top-down expectation-driven evaluation as well as focus information. The practical realization is based on a flexible communication tool (DACS) that provides a carefully chosen set of communication primitives that can be used in parallel and allow the integration of existing system modules easily.