Motion Flashcards
Why is motion information usefull?
For action recognition and object tracking
What are the basic assumptions behind optical flow?
Brightness constancy and small displasement
What do we know about I(x+u, y+v, t+1) given displacement (u,v) and the brigthness constancy assumption?
It is the same as I(x,y,t)
What is the optical flow constraint?
dI/du * u + DI/dv * v + dI/Dt = 0
What assumption does the the Lucas Kanade method make?
Flow, or displacement, is constant in a local neighbourhood
What equation do we have to solve in the Lucas Kanade method
Ax = b -> x = (A^TA)^-1 A^Tb
where A = [Ix(p1) iy(p1); Ix(p2) Iy(p2)….]
b = [-It(p1); I(t(p2); ……]
What is the condition number of a 2-d matrix?
abs(lambda_max) / abs(lambda_min)
How can the condition number be interpreted?
Large conditon number makes the matrix inverse sensitive to noise and small changes in values.
Describe the implementation of Lucas Kanade method.
1) Calculate Ix, Iy and It.
2) For a small window calculate:
AA^T = sum_p [ Ix^2, IxIy
IxIy, Iy^2]
A^tb = sum_p [Ix It, Iy * It]
3) Calculate the optical flow:
[u,v] = (A^TA)^-1 A^Tx
What is the idea behind the Horn- Schunk method?
Define a global energy function:
Integral_(x,y) ((Ix*u) + (Iy * v) + I_t)^2 +
alpha(abs(gradient(u))^2 + abs(gradient(u))^2) dxdy
And minimize the energy with respect to u(x,y), v(x,y).
What can we do if the small displacement condition does not hold?
Use spatial pyramids of downsampled images. Estimate motion from the coarsest (smallest) image first and iteratively to larger versions.
How does the Middlebury dataset create GT for optical flow?
It use textured UV paint and UV cameras.
How does the Kitty dataset create GT for optical flow?
It uses a lidar.
What metric can we use for flow evauation with ground truth?
Average end point error:
sum_x,y sqrt((u-u_gt)^2 + (v_v_gt)^2)
Averaged over all images
What metric can we use for flow evauation without GT?
We can skip one frame, calculate optical flow between frame 0 and 2 and interpolate frame 1. We then use the true frame 1 for GT.