Local image descriptors are computed in areas around salient points in images are essential for many algorithms in computer vision.
Object recognition VOC 2007Object recognition VOC 2007Object recognition VOC 2007
Techniques such as the bags-of-words approach are originally inspired by text retrieval. These have been extended to “2D” techniques on images and build the state-of-the-art in image matching. These approaches are now successfully carried out in both the spatial and the temporal domains for action recognition, video understanding and video matching
spatio-temporal featuresspatio-temporal featuresspatio-temporal features
Video understanding gains great attention in current computer vision research. With growing on-line data sources of videos, big digital private video archives and the need for storage and retrieval of surveillance videos, automated video understanding becomes necessary.