Spatio-temporal curvature and the visual coding of motion

Erhardt Barth

Institute for Signal Processing
Medical University of Luebeck
Ratzeburger Allee 160, 23538 Luebeck, Germany
barth@inb.uni-luebeck.de

 

Abstract:

As opposed to dealing with the geometry of objects in the 3D world, this paper considers the geometry of the visual input itself, i.e. the geometry of the spatio-temporal hypersurface defined by image intensity as a function of two spatial coordinates and time. The Riemann curvature tensor of this hypersurface can be used to estimate speed and direction of motion in four different ways. While this four motion vectors are equal for pure translations, they differ otherwise, e.g. for discontinuous motion. The differences can be used to build confidence measures. Applications demonstrate that the approach can improve the computation of motion by avoiding the aperture problem, discontinuous motions, and occlusions. Moreover, the approach allows to predict global motion percepts and properties of MT neurons and it is argued that important aspects of early and middle level visual coding may be understood as resulting from basic geometric processing of the spatio-temporal visual input.
Keywords: vision models, motion, flow-field, nonlinear features, curvature tensor

Full paper in PDF format

Taxi movie

Click on the image to view the movie.