Introduction to Projective Geometry Solutions 1.2

The Elements of Perspective

A 30 minute read, posted on 3 Jul 2019
Last modified on 30 Jul 2019

Tags computer vision, projective geometry, problem solution

Please read this introduction first before looking through the solutions. Here’s a quick index to all the problems in this section.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25

To aid in solving problems in this section I recommend reviewing conic sections1 and their equations in the standard form and the General Cartesian form.

For those who don’t have the book at hand, here are the details of Example 1.

  • The viewing point, $C$, is the point $(0, -3, 2)$.
  • The picture plane is the plane $y = 0$.
  • The plane containing the scene is the plane $z = 0$.
  • $\Gamma$ is the circle $X^2 + (Y - 2)^2 = 1$.
  • The image of $\Gamma$ is the ellipse $X’^2 + 6Z’^2 - 9Z’ + 3 = 0$.
  • The $X$, $Y$ system is the $xy$ plane and the $X’$, $Z’$ system is the $xz$ plane.

1. In Example 1, does the center of the ellipse which is the image of $\Gamma$ coincide with the image of the center of $\Gamma$?

As you can see from the construction of the perspective transformation below, the image $A$ of the center of the circle $B$ just misses the center of the ellipse $N$.

You can also verify this using the equations of projection derived in Example 1.

$$ X’_A = \dfrac{3 * 0}{2 + 3} = 0 $$ $$ Z’_A = \dfrac{2 * 2}{2 + 3} = 0.8 $$

Thus A has coordinates $(0, 0, 0.8)$.

The ellipse’s center can be found by converting the equation in the General Cartesian form to the standard form. So

$$X’^2 + 6Z’^2 - 9Z’ + 3 = 0$$ can be rewritten as $$\dfrac{X’^2}{0.375} + \dfrac{(Z’ - 0.75)^2}{0.0625} = 1$$ to get the center of the ellipse $N$ as $(0, 0, 0.75)$. Thus it is clear that $A$ and $N$ do not coincide.

Note: I have double checked this answer and I think the answer in the book should be $Z’ = \dfrac{3}{4}$ instead of $Z’ = \dfrac{3}{2}$.

2. In Example 1, what is the image of the family of parallel lines defined in the object plane by the equation $X = 3Y + k$? By the equation $Y = mX + k$?

To find the image of the family of parallel lines defined by $X = 3Y + k$, we substitute $X$ and $Y$ with their expressions in terms of $X’$ and $Z’$ as derived in Example 1, in the equation for the family of lines. This gives us

$$ \dfrac{2X’}{2-Z’} = 3 * \dfrac{3Z’}{2-Z’} + k$$ $$\implies$$ $$ 2X’ = (9 - k)Z’ + 2k.$$

For all values of $k$ this line passes through $X’ = 9$ and $Z’ = 2$ which is a point on the vanishing line $Z’ = 2$, as required by Theorem 2 (Page 7).

The image of a family of parallel lines under perspective transformation in general is a family of lines passing through a point. In this case, the point is A $(9, 0, 2)$. If you’re viewing this in GeoGebra, hit the play button next to the slider variable $k$ to view the lines being drawn one-by-one.

Similarly, the image of the family of parallel lines defined in the object plane by $Y = mX + k$ is given by

$$(3 + k)Z’ = 2mX’ + 2k.$$

As this family of lines must pass through the vanishing line in the picture plane, it must pass through $Z’ = 2$. Solving for $X’$ we get $X’ = \dfrac{3}{2m}$. So the image of this family of parallel lines must pass through $(\dfrac{3}{2m}, 0, 2)$, $m \ne 0$.

In the special case that $m = 0$, the family of lines $y = k$ is parallel to the x-axis. The images of these lines, $z = \dfrac{2k}{3 + k}$, are also parallel to the x-axis. The x-axis is common to both the picture and object plane. So lines in the object plane that are parallel to such a common line, have images that are also parallel; their images do not converge to the vanishing line in the picture plane.

3. In Example 1, what is the image of the parabola whose equation in the object plane is $Y = X^2$? What is the image of the parabola whose equation is $Y = 1 - X^2$?

Rewriting $X$ and $Y$ in terms of their coordinates on the picture plane we get the image of both the parabolas as follows.

For parabola $Y = X^2$, we have

$$\dfrac{3Z’}{2 - Z’} = (\dfrac{2X’}{2 - Z’})^2$$ $$\implies$$ $$(3Z’)(2 - Z’) = 4X’^2$$ $$\implies$$ $$6Z’ - 3Z’^2 = 4X’^2$$ $$\implies$$ $$3 - 3(Z’ - 1)^2 = 4X’^2$$ $$\implies$$ $$(Z’ - 1)^2 + \dfrac{4}{3}X^2 = 1.$$

This is the equation of an ellipse in the standard form with center at $X’ = 0$, $Z’ = 1$ and axis lengths 2 and $\sqrt 3$.

For parabola $Y = 1 - X^2$, we have

$$\dfrac{3Z’}{2 - Z’} = 1 - (\dfrac{2X’}{2 - Z’})^2$$ $$\implies$$ $$1 - \dfrac{3Z’}{2 - Z’} = (\dfrac{2X’}{2 - Z’})^2$$ $$\implies$$ $$(2 - 4Z’)(2 - Z’) = 4X’^2$$ $$\implies$$ $$4 + 4Z’^2 - 10Z’ = 4X’^2$$ $$\implies$$ $$4(Z’ - \dfrac{5}{2})^2 - \dfrac{9}{4} = 4X’^2$$ $$\implies$$ $$(\dfrac{Z’ - \dfrac{5}{4}}{\dfrac{3}{4}})^2 - (\dfrac{X’}{\dfrac{3}{4}})^2 = 1$$

This is the equation of a hyperbola with center at $X’ = 0$, $Z’ = \dfrac{5}{4}$ and asymptotes $Z’ = \pm X’ + \dfrac{5}{4}$.

Isn’t it surprising to see a parabola looking like an ellipse and even a hyperbola depending on where you view it from? This made me acknowledge the reality of different perspectives and that my reality is no less right (or wrong) than another person’s take on a situation.

In a later section we will learn the specific conditions under which a parabola’s image will become an ellipse, a parabola and a hyperbola respectively.

4. Rework Example 1 if $C$ is the point $(1, -3, 2)$.

The equation for the projecting line PC, passing through a general point $P (X, Y, 0)$ on the object plane and the viewing point $C (1, -3, 2)$, is

$$ \dfrac{x - 1}{X - 1} = \dfrac{y + 3}{Y + 3} = \dfrac{z - 2}{0 - 2}.$$

To find the coordinates $(X’, 0, Z’)$ of the point $P’$ in which this line pierces the picture plane, we need to substitute $y = 0$ and solve these equations for $x$ and $z$, getting

$$x \equiv X’ = \dfrac{3X + Y}{Y + 3}$$ $$z \equiv Z’ = \dfrac{2Y}{Y + 3}.$$

Thus the image of the given point $(2, 1, 0)$ is $(\dfrac{3*2 + 1}{1 + 3}, 0, \dfrac{2*1}{1 + 3})$ or $(\dfrac{7}{4}, 0, \dfrac{1}{2})$

To find the image of the family of parallel lines defined in the object plane by the equation $y = x + k$, we first express $X$ and $Y$ in terms of $X’$ and $Z’$, getting

$$X = \dfrac{2X’ - Z’}{2 - Z’}$$ $$Y = \dfrac{3Z’}{2 - Z’}.$$

Then we substitute this in the equation of the line $Y = X + k$ to get

$$\dfrac{3Z’}{2 - Z’} = \dfrac{2X’ - Z’}{2 - Z’} + k$$ $$\implies$$ $$(4 + k)Z’ = 2X’ + 2k.$$

Thus the image of the given family of parallel lines is the family of lines passing through the point $(4, 0, 2)$.

To find the image of the circle, we again substitute the expression of object coordinates in terms of image coordinates in the equation of the circle to get

$$(\dfrac{2X’ - Z’}{2 - Z’})^2 + (\dfrac{3Z’}{2 - Z’} - 2)^2 = 1$$ $$\implies$$ $$4X’^2 + 25Z’^2 -4X’Z’ - 36Z’ + 12 = 0,$$

which is the equation of an ellipse in the General Cartesian form.

5. Rework Example 1 if the picture plane is the plane $y = 1$.

Taking the equation of the projecting line PC from the book, we get

$$ \dfrac{x - 0}{X - 0} = \dfrac{y + 3}{Y + 3} = \dfrac{z - 2}{0 - 2}.$$

To find the coordinates $(X’, 0, Z’)$ of the point $P’$ in which this line pierces the picture plane, we need to substitute $y = 1$ and solve these equations for $x$ and $z$, getting

$$x \equiv X’ = \dfrac{4X}{Y + 3}$$ $$z \equiv Z’ = \dfrac{2Y - 2}{Y + 3}.$$

Thus the image of the given point $(2, 1, 0)$ is $(\dfrac{4*2}{1 + 3}, 1, \dfrac{2*1 - 2}{1 + 3})$ or $(2, 1, 0)$! This point lies on the intersection of the picture plane and the object plane, and hence is invariant under the given perspective transformation.

To find the image of the family of parallel lines defined in the object plane by the equation $y = x + k$, we first express $X$ and $Y$ in terms of $X’$ and $Z’$, getting

$$X = \dfrac{2X’}{2 - Z’}$$ $$Y = \dfrac{3Z’ + 2}{2 - Z’}.$$

Then we substitute this in the equation of the line $Y = X + k$ to get

$$\dfrac{3Z’ + 2}{2 - Z’} = \dfrac{2X’}{2 - Z’} + k$$ $$\implies$$ $$(3 + k)Z’ = 2X’ + 2k - 2.$$

Thus the image of the given family of parallel lines is the family of lines passing through the point $(4, 1, 2)$.

To find the image of the circle, we again substitute the expression of object coordinates in terms of image coordinates in the equation of the circle to get

$$(\dfrac{2X’}{2 - Z’})^2 + (\dfrac{3Z’ + 2}{2 - Z’} - 2)^2 = 1$$ $$\implies$$ $$X’^2 + 6Z’^2 - 4Z’ = 0,$$

which is the equation of an ellipse in the General Cartesian form.

6. Rework Example 1 if $C$ is the point $(0, -2, 1)$ and the picture plane is the plane $y = 2$.

The equation for the projecting line PC, passing through a general point $P (X, Y, 0)$ on the object plane and the viewing point $C (0, -2, 1)$, is

$$ \dfrac{x - 0}{X - 0} = \dfrac{y + 2}{Y + 2} = \dfrac{z - 1}{0 - 1}.$$

To find the coordinates $(X’, 0, Z’)$ of the point $P’$ in which this line pierces the picture plane, we need to substitute $y = 2$ and solve these equations for $x$ and $z$, getting

$$x \equiv X’ = \dfrac{4X}{Y + 2}$$ $$z \equiv Z’ = \dfrac{Y - 2}{Y + 2}.$$

Thus the image of the given point $(2, 1, 0)$ is $(\dfrac{4*2}{1 + 2}, 2, \dfrac{1 - 2}{1 + 2})$ or $(\dfrac{8}{3}, 2, \dfrac{-1}{3})$

To find the image of the family of parallel lines defined in the object plane by the equation $y = x + k$, we first express $X$ and $Y$ in terms of $X’$ and $Z’$, getting

$$X = \dfrac{X’}{1 - Z’}$$ $$Y = \dfrac{2Z’ + 2}{1 - Z’}.$$

Then we substitute this in the equation of the line $Y = X + k$ to get

$$\dfrac{2Z’ + 2}{1 - Z’} = \dfrac{X’}{1 - Z’} + k$$ $$\implies$$ $$(2 + k)Z’ = X’ + k - 2.$$

Thus the image of the given family of parallel lines is the family of lines passing through the point $(4, 2, 1)$.

To find the image of the circle, we again substitute the expression of object coordinates in terms of image coordinates in the equation of the circle to get

$$(\dfrac{X’}{1 - Z’})^2 + (\dfrac{2Z’ + 2}{1 - Z’} - 2)^2 = 1$$ $$\implies$$ $$X’^2 + 15Z’^2 + 2Z’ - 1 = 0,$$

which is the equation of an ellipse in the General Cartesian form.

7. In a perspective transformation, are there any points which coincide with their images?

As we discovered in problem 5, the points on the line of intersection of the image and object planes coincide with their images. The line through the viewing point intersects both the planes at the same point everywhere on this line. There is another special point that is invariant but as it is explained in detail in the next section, I will not go through it here.

8. In a perspective transformation, is every point in the picture plane the image of some point in the object plane? Does every point in the object plane have an image in the picture plane?

Consider the line of intersection of the picture plane with the plane parallel to the object plane passing through the viewing point. The projecting line for a point on this line will never intersect with the object plane. Hence the points on this line are not the images of any points in the object plane.

In the figure below, points A, B, D, F & G are not the images of any points in the object plane.

Similarly, consider the line of intersection of the object plane with the plane parallel to the picture plane passing through the viewing point. The projecting line for a point on this line will never intersect with the picture plane. Hence the points on this line will not have an image in the picture plane.

In the figure below, points A, B, D, E, F do not have any images in the picture plane.

9. In a perspective transformation, is a line always represented by a line? Is a non-singular conic always represented by a non-singular conic?

The projecting lines from the viewing point to a line on the object plane extended to infinity on both sides together span an entire plane. The intersection of two planes is a line. So the image of a line under a perspective transformation will always be a line.

The only exception to this case is the line that has no image as we discovered in the solution to exercise 8.

We can look at the perspective projection of conic sections in the following way. The projecting lines from the viewing point to the conic on the object plane form a cone. The intersection of a plane and a cone is by definition a conic section! So the image on the picture plane must be a conic section too.

10. In a perspective transformation, are there any parallel lines which are not represented in the picture plane by lines meeting on the vanishing line?

Lines on the object plane that are parallel to the line of intersection of the object and picture plane are transformed into parallel lines that do not meet on the vanishing line.

F’G’ and B’D’, the images of FG and BD respectively which are parallel to the x-axis, remain parallel to it on the picture plane.

11. In a perspective transformation, is a circle ever represented by a circle?

The logic that is used in the solution here is to make the coefficients of the square terms, $(X’)^2$ and $(Z’)^2$, equal. This is because we know that any quadratic equation in two variables that has the same coefficients for the square terms and does not have the $xy$ term, is the equation of a circle in the General Cartesian form.

A general circle is represented by a quadratic equation in two variables as follows

$$x^2 + y^2 + ax + by + c = 0.$$

The image of this circle under the perspective transformation described in Example 1 will be

$$(\dfrac{2X’}{2 - Z’})^2 + (\dfrac{3Z’}{2 - Z’})^2 + a(\dfrac{2X’}{2 - Z’}) + b(\dfrac{3Z’}{2 - Z’}) + c = 0.$$

This expands to

$$4(X’)^2 + (9 - 3b + c)(Z’)^2 - 2aX’Z’ + 4aX’ + (6b - 4)Z’ + 4c = 0$$

Now in order to make the coefficients of the square terms equal, we must have

$$4 = 9 - 3b + c$$ $$\implies$$ $$c = 3b - 5$$

And for the $X’Z’$ term to be zero, $a$ must be zero.

So the circle

$$x^2 + y^2 + by + 3b - 5 = 0$$

is transformed to the circle

$$(X’)^2 + (Z’)^2 + (\dfrac{3}{2}b - 1)Z’ + 3b - 5 = 0$$

by the perspective transformation described in Example 1.

$$$$

12. In a perspective transformation, is a circle always represented by an ellipse? If not, what other possibilities are there?

Generalizing the solution to the previous question, the image of a circle under the perspective transformation described in Example 1 will be

  • an ellipse if 9 - 3b + c > 0
  • a parabola if 9 - 3b + c = 0
  • a hyperbola if 9 - 3b + c < 0

using the discriminant rule for classifying conics.

So, the image of a circle under a perspective transformation can be an ellipse, a parabola or a hyperbola.

13. What are the possibilities for the image of a parabola under perspective transformation? What are the possibilities for the image of a hyperbola?

We have already proven in problem 9 that the image of a non-singular conic will always be a non-singular conic. So we just need to delineate which conics can be transformed into which other conics.

Consider a general parabola

$$ax^2 + bxy + cy^2 + dx + ey + f = 0,$$

with $b^2 = 4ac$. The image of this parabola under the perspective transformation described in Example 1 will be

$$a(\dfrac{2X’}{2 - Z’})^2 + b(\dfrac{2X’}{2 - Z’})(\dfrac{3Z’}{2 - Z’}) + c(\dfrac{3Z’}{2 - Z’})^2 + d(\dfrac{2X’}{2 - Z’}) + e(\dfrac{3Z’}{2 - Z’}) + f = 0.$$

Expanding this we get

$$4a(X’)^2 - (6b - 2d)X’Z’ + (9c - 3e + f)(Z’)^2 + 4dX’ + (6e - 4f)Z’ + 4f = 0.$$

Using the discriminant rule, this will be

  • an ellipse if $(6b - 2d)^2 - 16a(9c - 3e + f) < 0$.
  • a parabola if $(6b - 2d)^2 - 16a(9c - 3e + f) = 0$.
  • a hyperbola if $(6b - 2d)^2 - 16a(9c - 3e + f) > 0$.

So the image of a parabola under perspective transformation has the possibility of being any conic.

The same kind of calculations can be performed for a hyperbola to prove that the image of a hyperbola under perspective transformation has the possibility of being any conic.

14. In a perspective transformation, is a segment always represented by a segment?

By segment I’m assuming the author meant a line segment (as opposed to a segment of a circle).

A line segment is uniquely determined by two points, the start and the end of the segment. If neither of these points lie on the line that has no image (see solution to problem 8), then both of them have an image. In this case, the image of a segment will be a segment.

If either endpoint of the segment lies on the line that has no image then the image of the segment will be a half-open line segment (a line segment that includes only one endpoint). The endpoint that does have an image is the included endpoint and the segment extends to infinity on the other side.

If the segment lies on the line that has no image, then the segment will also not have an image.

15. In a perspective transformation, if $P$ is a point between $Q$ and $R$, is it true that the image of $P$ is between the images of $Q$ and $R$?

If the point $P$ lies on the line that has no image (see the solution to problem 8) then $P$ will have no image. In this case, the image of $P$ can’t possibly lie between $Q$ and $R$.

I’ll admit that the proof of the general case had me stumped for a while. After all, it looked so obvious! I had to go back to the fundamentals and look up the theorems related to betweenness of points and lines to get to the solution.

To prove the general case, we can use this theorem of geometry2 that says if A, B, and C are three noncollinear points and if D is a point on the line BC then the point D is between the points B and C if and only if the ray is between the rays AB and AC.

Using this theorem on the perspective transformation of the 3 collinear points $Q * P * R$3, we see that the projecting line CP is in between the projecting lines CQ and CR because P is given to be between Q and R. We’ve already established that the image of a line under a perspective transformation is a line and so P’, Q’ and R’ lie on a line. Using the aforementioned theorem one more time, we see that the image P’ of P is in between the images Q’ and R’ because CP’ is in between CQ’ and CR’, which is what we set out to prove.

16. In a perspective transformation, if $P:(X, Y)$ is a general point in the object plane $z = 0$, and if the viewing point is an arbitrary point $C:(a, b, c)$, find the coordinates $(X’, Z’)$ of the image of $P$ on the picture plane $y = 0$. What are the equations expressing $X$ and $Y$ in terms of $X’$ and $Z’$?

The equation of the projecting line $PC$ through $P (X, Y, 0)$ and $C (a, b, c)$ will be

$$\dfrac{x - a}{X - a} = \dfrac{y - b}{Y - b} = \dfrac{z - c}{0 - c}.$$

Intersection of the line $PC$ with the picture plane $y = 0$ will give us the coordinates for $P’$ as

$$x \equiv X’ = \dfrac{aY - bX}{Y - b}$$ $$z \equiv Z’ = \dfrac{cY}{Y - b}.$$

Expressing the object coordinates in terms of the image coordinates we get

$$X = \dfrac{aZ’ - cX’}{Z’ - c}$$ $$Y = \dfrac{bZ’}{Z’ - c}.$$

We can do a quick check by substituting the values given in Example 1 in these equations and see that we do get the same result for $X’$, $Z’$, $X$ and $Y$ as in the book.

17. In a perspective transformation, are the lengths of all segments altered in the same ratio?

For such questions if we just find one example that contradicts the statement then we’re done.

I will take 3 arbitrary collinear points $A$, $B$ and $D$ and find their images $A’$, $B’$ and $D’$ under the perspective transformation described in Example 1.

As you can see $\dfrac{AB}{A’B’} \neq \dfrac{BD}{B’D’}.$

18. In a perspective transformation, is the angle between two lines the same as the angle between the images of the lines?

I’ll employ similar logic here as in the previous solution. We will find one example that contradicts the statement.

Here, I’ve taken three non-collinear points and found their images under the perspective transformation defined in Example 1. Comparing the two angles, we see that

$$\angle AFE \neq \angle A’F’E’$$

19. In a perspective transformation, if a line is tangent to a circle, is the image of the line tangent to the image of the circle?

We know that a perspective transformation maps points to points (the projecting line that intersects both the object and picture plane meets each plane in exactly one point). This implies that the point of tangency must be mapped to a single point on the picture plane. Now as the point of tangency lies on both the tangent and the circle, its image must also lie on the images of the tangent and the circle.

We can also establish that no two points on one plane are mapped to the same point in the other plane. If this were not true then two projecting lines, that already meet at the viewing point, will meet again at a point in one of the planes, which is not possible as two lines can not intersect in more than one point.

This means that the images of the tangent and the circle can not share any other point other than the point of tangency as the tangent and the circle by definition do not share a point other than the point of tangency in the object plane.

Any line that only shares only one point with a circle is a tangent to the circle (this is not true for a parabola and hence this statement can not be generalized for all conics). Thus, we can rest assured that the image of a tangent to a circle remains tangent to the image of that circle.

In the image below, the points of tangency on the object plane are $A$ and $B$, and the points of tangency on the image plane are their images $A’$ and $B’$.

20. In Example 1, verify that for the four collinear points, $P_1:(0, 1, 0)$, $P_2:(-1, 2, 0)$, $P_3:(-3, 4, 0)$, $P_4:(-7, 8, 0)$ and the images $P_1’$, $P_2’$, $P_3’$, $P_4’$, the following relation holds.

$$ \dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \dfrac{(P_1’P_3’)(P_2’P_4’)}{(P_1’P_4’)(P_2’P_3’)}$$

I’m going to cheat here and use numpy.linalg.norm because, well, I’m really bad at arithmetic and even if I was good, a computer would always be better at it than me.

$$\dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \dfrac{4.24 * 8.49}{9.90 * 2.83} = 1.29$$

The coordinates of the images are $P’_1 = (0 , 0 , 0.5)$, $P’_2 = (-0.6, 0, 0.8)$, $P’_3 = (-1.29, 0, 1.14)$ and $P’_4 = (-1.91, 0, 1.45)$ $$\dfrac{(P_1’P_3’)(P_2’P_4’)}{(P_1’P_4’)(P_2’P_3’)} = \dfrac{1.44 * 1.46}{2.13 * 0.77} = 1.29$$

21. If $P_i:(X_i, Y_i), i = 1, 2, 3, 4$, are four collinear points in the object plane $z = 0$, and if $P’_i:(X’_i, Y’_i)$ are the images of these points on the image plane $y = 0$ under a perspective transformation with center $C:(a, b, c)$, show that

$$ \dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \dfrac{(P_1’P_3’)(P_2’P_4’)}{(P_1’P_4’)(P_2’P_3’)}$$

Let’s consider the four collinear points $P_1$, $P_2$, $P_3$ and $P_4$ to lie on the general line

$$y = mx + c.$$

So the four sets of coordinates will be $(x_1, mx_1 + c, 0)$, $(x_2, mx_2 + c, 0)$, $(x_3, mx_3 + c, 0)$, and $(x_4, mx_4 + c, 0)$. Substituting this in the given ratio, we get

$$\dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \sqrt{\dfrac{((mx_1 - mx_3)^2 + (x_1 - x_3)^2)((mx_2 - mx_4)^2 + (x_2 - x_4)^2)}{((mx_2 - mx_3)^2 + (x_2 - x_3)^2)((mx_1 - mx_4)^2 + (x_1 - x_4)^2)}}$$ $$\implies$$ $$\dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \sqrt{\dfrac{(m^2 + 1)(x_1 - x_3)^2(m^2 + 1)(x_2 - x_4)^2}{(m^2 + 1)(x_2 - x_3)^2(m^2 + 1)(x_1 - x_4)^2}}$$ $$\implies$$ $$\dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \sqrt{\dfrac{(x_1 - x_3)^2(x_2 - x_4)^2}{(x_2 - x_3)^2(x_1 - x_4)^2}}$$ $$\implies$$ $$\dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \dfrac{(x_1 - x_3)(x_2 - x_4)}{(x_2 - x_3)(x_1 - x_4)}$$

We have already established that the image of a line under perspective projection is a line. So the images of the 4 collinear points will also be collinear. We can represent these collinear points as lying on the general line

$$y = m’x + c’.$$

Using the same logic as above, we get

$$\dfrac{(P’_1P’_3)(P’_2P’_4)}{(P’_1P’_4)(P’_2P’_3)} = \dfrac{(x’_1 - x’_3)(x’_2 - x’_4)}{(x’_2 - x’_3)(x’_1 - x’_4)}$$

Expressing the image coordinates in terms of the object coordinates as derived in the solution to exercise 16, we get

$$\dfrac{(P’_1P’_3)(P’_2P’_4)}{(P’_1P’_4)(P’_2P’_3)} = \dfrac{(\dfrac{(am’x_1 + ac’ - bx_1)}{(m’x_1 + c’ - b)} - \dfrac{(am’x_3 + ac’ - bx_3)}{(m’x_3 + c’ - b)})(\dfrac{(am’x_2 + ac’ - bx_2)}{(m’x_2 + c’ - b)} - \dfrac{(am’x_4 + ac’ - bx_4)}{(m’x_4 + c’ - b)})}{(\dfrac{(am’x_2 + ac’ - bx_2)}{(m’x_2 + c’ - b)} - \dfrac{(am’x_3 + ac’ - bx_3)}{(m’x_3 + c’ - b)})(\dfrac{(am’x_1 + ac’ - bx_1)}{(m’x_1 + c’ - b)} - \dfrac{(am’x_4 + ac’ - bx_4)}{(m’x_4 + c’ - b)})}$$

$$\implies$$

$$\dfrac{(P’_1P’_3)(P’_2P’_4)}{(P’_1P’_4)(P’_2P’_3)} = \dfrac{b(am’ + c’ - b)(x_3 - x_1)b(am’ + c’ - b)(x_4 - x_2)}{b(am’ + c’ - b)(x_3 - x_2)b(am’ + c’ - b)(x_4 - x_1)}$$

$$\implies$$

$$\dfrac{(P’_1P’_3)(P’_2P’_4)}{(P’_1P’_4)(P’_2P’_3)} = \dfrac{(x_3 - x_1)(x_4 - x_2)}{(x_3 - x_2)(x_4 - x_1)}$$

Hence the two ratios are equal.

22. Given a triangle in the plane $z = 0$, is it possible to find a viewing point $C$, from which the triangle will appear on the plane $y = 0$ as a right triangle?

You certainly are a geometry pro if you got this one straightaway. I managed to solve this only after reading the next two sections.

The key to solving this exercise is to remember that points on the line common to object and image planes are invariant under perspective projection (see the solution to exercise 7).

I’ll explain the logic behind the construction first and then summarize the steps at the end. The construction proceeds by identifying constraints on the points and lines of the image triangle one-by-one to reach the end goal.

Consider the triangle $\bigtriangleup ABC$ on the object plane. Let’s say we want to transform $\angle ACB$ to a right angle. It is helpful for the purpose of this solution to think of the sides of the triangle as lines rather than line segments. In other words, we want the images $A’C’$ and $B’C’$ of the two lines $AC$ and $BC$ must form a right angle.

Let’s think about the image of a line under perspective transformation in general. The image of a line $p$ must pass through the intersection of $p$ and the line common to the object and image plane (the x-axis in this case), provided they intersect. This means for any line, we know at least one point that lies on the image of the line.

So, the first constraint we discovered is that one point each, on the images of the two sides of the triangle, are fixed. We must now find the direction of the lines through these points so that they meet at a right angle.

One way I know of to construct a right angle between two given points is to construct a circle with the two points lying on opposite ends of the diameter. We also know that the angle inscribed in a semi-circle is always going to be a right angle. So I’ll find the intersection of $BC$ and $AC$ with the x-axis as $D$ and $E$ and draw a circle with the diameter being $DE$. Any point, $C’$ on this the circle will form a right angle with $D$ and $E$.

It is important to remember that $D$ and $E$ are not images of $B$ and $C$. They are points on the image lines $B’C’$ and $A’C’$. What we have ensured thus far is that the images of the lines $AC$ and $BC$ will meet at right angles if the image of $C$ lies on the circle.

If we find a way to constrain the image of $C$ to fall on this circle, then we’re done.

How do we make the image of $C$ lie on the circle? Connect $C$ and a point $C’$ (its intended image) on the circle, with a line. Now any point on this line, when taken as the viewing point, will result in the image of $C$ being the chosen $C’$ on the circle. As this construction will work equally well for any point on the circle, we get a double cone of possibilities! Any point on the double cone with apex at C and the circle as the directrix can be chosen as the viewing point to get a point on the circle to be the image of $C$.

As the image of C falls on the circle, the angle between $A’C’$ and $B’C’$ is guaranteed to be a right angle.

Here is the summary of the algorithm for this construction.

  1. Find the intersection of any two sides of the triangle with the line common to the object and picture planes. If one of the sides is parallel to this line then use the other side. More than one side of a triangle can’t be parallel to the same line.
  2. Draw a segment with these intersections as endpoints.
  3. Draw a circle with the segment constructed in #2 as its diameter.
  4. Draw a double cone with the vertex common to both sides of the triangle as the apex of the cone and the circle constructed in #3 as its directrix. (w.r.t to the image: that’s the best I could do to draw an oblique cone in GeoGebra.)
  5. Choose any point on this double cone to be the viewing point.


Steps 1 & 2
Step 3



Step 4
Step 5

The image of the triangle under this construction will be a right triangle.

23. Answer Exercise 22 if it is required that the image of the given triangle be isosceles.

This is another interesting one. The solution is in the same vein as the answer to the previous question. The only additional piece you will need is to know how to find the locus of points at which two given points subtend a particular angle.

If the angle is a right angle, we can construct a circle with the two points on the diameter as in the previous exercise. What if the angle is $30^\circ$? What about $72.5^\circ$? I still think it’s a circle/arc, but what are the parameters of this circle? Let’s try to figure it out.

We know that the angle subtended at the center by any two points on a circle is twice the angle subtended at any other point on the circle by the same two points. So if we settle on a particular angle, say $\theta$, then all we need to do is find the radius of the circle for which the given pair of points subtend an angle of $2\theta$ at the center.

Given the length of a chord and the angle it subtends at the center, we can find the radius of the circle that contains this chord using trigonometry as shown below. Here $AC$ is the radius. Note that $CE = ED$ because the radius that is perpendicular to a chord also bisects the chord.

Thus, the locus of the points at which two given points, $C$ and $D$, subtend an angle $\theta$ is the major arc $CFD$ of the circle that subtends an angle of $2\theta$ at the center. If the angle is $90^\circ$ then the chord becomes the diameter (as it subtends $180^\circ$ at the center) and the arc becomes the whole circle!

So, the steps of the algorithm to achieve a construction in which an arbitrary triangle $\bigtriangleup ABC$ is transformed to an isosceles triangle under perspective projection are as follows.

  1. Find the points of intersection of $AB$, $BC$, and $CA$ with the line common to the object and picture plane (x-axis in this case). Let’s call them $C_1$, $A_1$, $B_1$ respectively.
  2. Choose the value of the two equal angles of the final isosceles triangle. Let’s call the chosen angle $\theta$.
  3. Construct two circles, one in which $C_1$ and $A_1$ subtend $2\theta$ at the center, and another in which $B_1$ and $C_1$ subtend $2\theta$ at the center using steps explained above.
  4. Draw two double cones, one with apex at $B$ and directrix as the circle containing $A_1C_1$ and the other with apex at $A$ and directrix as the circle containing $B_1C_1$. From each cone remove the portion that is attached to the minor arc of each directrix. Let’s call the modified cones $m_1$ and $m_2$.
  5. Chose any point on the intersection of $m_1$ and $m_2$ to be the viewing point.

The image of the triangle under this construction will be an isosceles triangle with two angles measuring $\theta$.

Note: If you don’t remove the portion attached to the minor arc of the cone, you might get one angle to be $180 - \theta$ instead of $\theta$ as that’s the angle the chord subtends in the minor arc.

24. Answer Exercise 22 if it is required that the image of the given triangle be equilateral.

Just chose $\theta$ to be $60^\circ$ and follow the steps outlined in the solution to the previous exercise.

25. Discuss the perspective transformation when the object plane is not perpendicular to the picture plane.

Let’s summarize what we’ve learned about perspective transformation when the object and picture planes are perpendicular. This way we can evaluate the case when they are not perpendicular in relation to when they are.

Under a perspective transformation, when the object and picture plane are perpendicular, we can observe the following facts.

  1. There’s a one-to-one mapping between points on the object plane and the points on the picture plane except for points on the two lines discussed in the solution to problem 8. The same goes for lines. Lines map to lines except for the two lines discussed in the solution to problem 8.
  2. Parallel lines on the object plane (that are not parallel to the line common to the object and picture plane) are transformed into lines that meet at a point in the picture plane.
  3. Points on the line common to the object and picture plane coincide with their images.
  4. Non-degenerate conics on the object plane always map to non-degenerate conics on the picture plane.
  5. If $Q * P * R$ then $Q’ * P’ * R’$ unless the point $P$ does not have an image at all.
  6. $ \dfrac{(P_1P_3)(P_2P_4)}{(P_1P_4)(P_2P_3)} = \dfrac{(P_1’P_3’)(P_2’P_4’)}{(P_1’P_4’)(P_2’P_3’)}$.
  7. Angles, ratios, shape and distance are not preserved.
  8. A line segment is mapped to a line segment if and only if both endpoints are not on the line that has no image.
  9. Tangency to an ellipse is preserved if the tangent has an image.

We can verify that all these properties hold true even when the object and picture plane are not perpendicular to each other by following the same process as we did when the planes were perpendicular. Below I’ve highlighted some of the facts that I consider important.

Points to points and lines to lines
Conics to conics
Images of parallel lines meet at a point

References


  1. Wikipedia. Conic Section. https://en.wikipedia.org/wiki/Conic_section. [return]
  2. Theorem 3.3.10. Gerard A. Venema. (2006), Foundations of Geometry, Pearson. [return]
  3. $A * B * C$ means the point $B$ is in between points $A$ and $C$. [return]
comments powered by Disqus