Here’s a quick index to all the problems in this chapter. I have skipped the text for notes and only included the text for the exercises.
The main index can be found here.
Note that there is a typographical error in Note (i). should be instead.
III. Scaling unbounded point sets. In the case of points at or near infinity in a plane, it is neither reasonable nor feasible to normalize coordinates using the isotropic (or non-isotropic) scaling schemes presented in this chapter, since the centroid and scale are infinite or near infinite. A method that seems to give good results is to normalize the set of points such that
Note that the coordinates and appearing here are the homogeneous coordinates, and the conditions no longer imply that the centroid is at the origin. Investigate methods of achieving this normalization, and evaluate its properties.
An equivalent problem, that of normalizing line coordinates when some lines pass through or near the origin, is discussed in detail in the paper “A new normalized method on line-based homography estimation” 1. I recommend reading the paper to get a complete understanding of how the normalization reduces the condition number of the DLT matrix, making it less sensitive to noise. I will summarize below the transformations that lead to the required normalizations, and their properties. The goal is to spread out the points around the origin instead of having clusters of points far apart.
The following matrix ensures that . In effect, it makes the relative distribution of the point coordinates more homogeneous about the -axis in the 3D space.
The following matrix , applied after , ensures that . It makes the ratio of distance of a point from the plane to its distance from the axis equal to , thus make the distribution of the point coordinates in space more homogeneous.
Finally ensures that , thus making the transformed point coordinates lie on the unit sphere centered at the origin.
Note that simply satisfying the constraint without requiring the other two does not help much as demonstrated by Fig. 2a in the paper which shows that after such a normalization three points are clustered near (0, 0, 1) and one point is far away near (0.5, 0.8, 0), in other words, the matrix is still ill-conditioned.
In effect, the normalizing transformations reduce the condition number of the matrix by homogeneously distributing the point coordinates around the origin in 3D space.
IV. Transformation invariance of DLT. We consider computation of a 2D homography by minimizing algebraic error subject to various constraints. Prove the following cases:
(a) If is minimized subject to the constraint , then the result is invariant under change of scale but not translation of coordinates.
(b) If instead the constraint is then the result is similarity invariant.
(c) Affine case: The same is true for the constraint .
For each case, we need to prove that the given transformation class preserves the constraint under the transformation .
Note that in each case, we are only concerned with the last row of .
Let be a similarity transformation. Any affine transformation leaves the last component of a homogeneous vector unchanged. Hence the last row of will be left unchanged by multiplication with , i.e .
Thus the last row of only depends on the result of right multiplication with , i.e. .
(a) If represented a change of scale transform then so would . Such a matrix would have the last column as and hence would leave unchanged. Hence if , then .
If represented a translation, then so would . Such a matrix would have the last column as making . Hence being equal to 1 would not imply that .
Thus, if is minimized subject to the constraint , then the result is invariant under change of scale but not translation of coordinates.
(b) If represented a similarity transform then the first two columns of would be and . This means and . Furthermore,
As implies , the DLT algorithm will be invariant under this constraint.
(c) From the discussion in (b), we can see that if then . The last column of will be of the form if is a similarity transform leading to . As the constraints imply , the DLT algorithm will be invariant under these constraints.
V. Expressions for image coordinate derivative. For the map , derive the following expressions (where are the inhomogeneous coordinates of the image point):
(a) Derivative wrt where is the -th row of .
(b) Derivative wrt with as defined in (4.2-p89).
(a) It is clear that
Hence,
Now,
Applying the chain rule
Substituting the values of the partial derivatives we calculated earlier
Substituting and
Finally, substituting
By applying a similar process we observe that
Hence, we have established that
(b) In a similar fashion, applying the chain rule, we get
It is clear that
Substituting this into the expansion above
Substituting and
Finally, substituting
Performing a similar process for , we get
Hence, we have established that
VI. Sampson error with non-isotropic error distributions. The derivation of Sampson error in section 4.2.6(p98) assumed that points were measured with circular error distributions. In the case where the point is measured with covariance matrix it is appropriate instead to minimize the Mahalanobis norm . Show that in this case the formulae corresponding to (4.11-p99) and (4.12-p99) are and Note that if the measurements in the two images are independent, then the covariance matrix will be block-diagonal with two diagonal blocks corresponding to the two images.
In this case, we need to find the extrema of .
Derivative of with respect to is as is symmetric (due to being symmetric). Hence taking the derivative of the whole expression with respect to and equating to zero will give us
Taking the derivative with respect to and equating to zero will give us back the original constraint, , as before. Substituting for leads to
and
VIII. Minimizing geometric error for affine transformations. Given a set of correspondences , find an affine transformation that minimizes geometric error (4.8-p95). We will step through the derivation of a linear algorithm based on Sampson’s approximation which is exact in this case. The complete method is summarized in algorithm 4.7.
(a) Show that the optimum affine transformation takes the centroid of the to the centroid of the , so by translating the points to have their centroid at the origin, the translation part of the transformation is determined.
Let’s express the affine transformation as a composition of transformation and scaling (in that order) as , where is the scaling transform and is the translation.
Then will have the form
with .
The residual in this case will be
Assuming that , the inhomogeneous coordinates are simply and the residual can be written as
The Jacobian for this residual will then be
where is the identity matrix.
The Sampson error for the correspondence can then be calculated as
From this it is clear that minimizing the geometric error is the same as minimizing the Sampson error which is the same as minimizing the algebraic error for a translation.
Expanding this error and dropping the constant, we get the error for the correspondence to be
The sum of the errors for all such correspondences will be
To find the optimal values of and we take the derivatives w.r.t. and and set them to zero. By doing this we get
Hence,
where and are the centroids of the points in each image respectively.
Thus the translation portion of the optimal affine transformation takes the centroid of points in the first image to the centroid of the points in the second.
References
- Zeng Hui, Xiaoming Deng, and Zhanyi Hu. A new normalized method on line-based homography estimation. Pattern Recognition Letters 29.9 (2008): 1236-1244. [return]