Multiple View Geometry in Computer Vision Chapter 9 Solutions

Epipolar Geometry and the Fundamental Matrix

A 7 minute read, posted on 11 Jun 2020
Last modified on 11 Jun 2020

Tags computer vision, problem solution

Here’s a quick index to all the problems in this chapter.

i ii iii iv vi viii ix

The main index can be found here.

I. Fixating cameras. Suppose two cameras fixate on a point in space such that their principal axes intersect at that point. Show that if the image coordinates are normalized so that the coordinate origin coincides with the principal point then the F33 element of the fundamental matrix is zero.

We know that two corresponding points x and x must obey the relationship xTFx=0. In this case it is given that x=x=(0,0,1)T. Hence the constraint on F becomes (001)F(001)=0 which implies that F33=0.

II. Mirror images. Suppose that a camera views an object and its reflection in a plane mirror. Show that this situation is equivalent to two views of the object, and that the fundamental matrix is skew-symmetric. Compare the fundamental matrix for this configuration with that of: (a) a pure translation, and (b) a pure planar motion. Show that the fundamental matrix is auto-epipolar (as is (a)).

Let a point on the object be X. Then a point on the reflection will be RfX, where Rf represents the 3D reflection. Under the camera P the images of the two points will be PX and PRfX. This situation is equivalent to two cameras P and PRf both viewing X.

A reflection Rf about an arbitrary plane can be decomposed as Rf=He(Λ00T1)He1, where He is a Euclidean transformation (Rt01) and Λ=diag(1,1,1). The proof of this statement is given at the end of this solution.

Without loss of generality, we can take P=K[I0]. Then the second camera will be PRf=K[I0]HeΛHe1. We can write this in terms of R and t as PRf=K[RΛRTRΛRTt+t]=K[RΛRTRΓRTt], where Γ=IΛ=(200000000).

The corresponding canonical cameras1 will be [I0] and K[RΛRTK1RΓRTt] and from Result 9.9 (pg 254), the fundamental matrix relating these two cameras will be F=[KRΓRTt]×KRΛRTK1.

To check if this matrix is skew-symmetric, we must check if it satisfies xTFx=0 for all x. xTFx=xT[KRΓRTt]×KRΛRTK1x

Using result A4.3(p582), we get xTFx=xTKT[RΓRTt]×RΛRTK1x =(K1x)T[RΓRTt]×RΛRT(K1x)

Replacing K1x by x, we can rewrite this as xT[RΓRTt]×RΛRTx

So now we just have to check if [RΓRTt]×RΛRT is skew symmetric. Using result A4.3(p582) again, we get R[ΓRTt]×ΛRT This matrix is skew symmetric as [ΓRTt]× is a matrix of the form (00000x0x0)

This also means F is auto-epipolar.

Just like in the case of translation where the epipole was the vanishing point of the direction of translation, in this case, the epipole is the vanishing point of the direction of reflection i.e. the normal to the plane of reflection or the direction of the symmetry line joining a point and its reflection (X and X in the figure below).

The coordinate system for bilateral symmetry
Joseph L. Mundy and Andrew Zisserman. Repeated structures: Image correspondence constraints and 3D structure recovery.

Proof of Rf=HeΛHe1

A reflection Rf about an arbitrary plane can be decomposed as a translation of the origin to the plane followed by a reflection followed by a translation of the origin back to the original position Rf=TRoT1.

Ro has the form Ro=(A00T1) with the 3×3 matrix A being a householder matrix. A can be further decomposed using spectral decomposition to give A=RΛRT with Λ=diag(1,1,1) and R an orthogonal matrix.

So we can rewrite Rf as

Rf=T(R00T1)(Λ00T1)(RT00T1)T1

Rf=(It)(R00T1)(Λ00T1)(R00T1)1(It)1

Rf=(Rt0T1)(Λ00T1)(Rt0T1)1

Rf=HeΛHe1

where He represents the euclidean transformation given by (Rt0T1) and Λ=diag(1,1,1,1).

III. Show that if the vanishing line of a plane contains the epipole then the plane is parallel to the baseline.

The epipole is the vanishing point of the baseline direction. Parallel planes in 3-space intersect Π in a common line and the image of this line is the vanishing line of the plane. So if the vanishing point of the baseline lies on this vanishing line then the baseline must lie in a plane parallel to the given plane.

IV. Show that the polar of xa intersects the Steiner conic Fs at the epipoles (figure 9.10a). Hint, start from Fe=Fse+Fae=0. Since e lies on the conic Fs, then l1=Fse is the tangent line at e, and l2=Fae=[xa]×e=xa×e is a line through xa and e.

Fe=Fse+Fae=0 Fse=Fae

This means Fse and Fae are the same lines further implying that that the line through xa and e is tangent to Fs at e. Applying a similar logic to e, it is clear that the lines through xa tangent to Fs intersect Fs at e and e. Hence e and e lie on the polar of xa.

VI. Planar motion. It is shown by [Maybank-93] that if the rotation axis direction is orthogonal or parallel to the translation direction then the symmetric part of the essential matrix has rank 2. We assume here that K=K. Then from (9.12), F=KTEK1, and so Fs=(F+FT)/2=KT(E+ET)K1/2=KTEsK1 It follows from det(Fs)=det(K1)2det(Es) that the symmetric part of F is also singular. Does this result hold if KK?

If KK, then the equation becomes Fs=(F+FT)/2=KTEK1+KTETK1

This equation can not be further reduced to a simpler form due to asymmetric components and hence we can not say anything about the rank of Fs.

I think this question is incorrectly placed in this chapter as we haven’t really learnt about reconstructions yet. I might get back to this one after learning more about reconstruction in subsequent chapters.

IX. Following a similar development to section 9.2.2, derive the form of the fundamental matrix for two linear pushbroom cameras. Details of this matrix are given in [Gupta-97] where it is shown that affine reconstruction is possible from a pair of images.

As I’m not interested in pushbroom cameras at the moment, I’m going to skip this one.

References


  1. Mundy, Joseph L., and Andrew Zisserman. Repeated structures: Image correspondence constraints and 3D structure recovery. Joint European-US Workshop on Applications of Invariance in Computer Vision. Springer, Berlin, Heidelberg, 1993. ↩︎