There are two overall goals of Computer Vision. Many of the methods and techniques applied in artificial vision systems are motivated by biological vision systems. Those parts of human and animal brains that are occupied with the processing of visual information are probably the parts best understood today. Vision serves as the example par excellence for biological cognition. Thus, the first motivation to occupy oneself with Computer Vision, is to gain deeper insight into the nature of cognition and thus into an integral part of consciousness. This can be achieved by proposing computational models of visual information processing, implementing them on a computer system, running simulations, comparing the capabilities of the artificial system with those of biological systems, and adapting the theory of processing where appropriate or inspiring new investigations of the biological systems.
The second goal of Computer Vision is the analysis of information extracted from images with the purpose of understanding their content. As explained under Interactive Systems, one of the demands made on interactive systems is their ability to perceive and understand their environment. Thus, Computer Vision provides the visual modality of automatic scene understanding. Typical technologies comprise the visual acquisition, recognition, and tracking of objects in videos and their localization in the environment.
Major challenges of Computer Vision consist in
To master the challenges of interactive systems such as big data and real-time processing, learning and recognition should not be regarded as two separated, sequential processes. Rather, learning to recognize or classify objects is supposed to be incremental and should persist while a system is in the field already.
We have a strong background in classical Computer Vision topics with experiences in low level image processing, image segmentation, object recognition, and person tracking. In the context of the DFG-funded project "Dynamic Learning for Geometric and Graphic Object Acquisition" (2005-2012) we have developed a system that is able to handle a huge input data space by an intelligent selection of only those data relevant for a specific task. Furthermore, we proposed a method for object recognition by interaction, which involves a robotic arm rotating objects to views advantageous for recognition.
More recent activities include the fusion of techniques from Computer Vision, Computer Graphics, and Machine Learning to not only utilize 2D information for object classification but 3D information as well. Here we also deal with the question of appropriate object descriptors and their dynamic, context-dependent selection.