Imagine a camera mounted perfectly level on a tripod. The lens axis (the line through the middle of the lens) is perfectly horizontal. At one end (the camera end) it passes through dead centre of the image and at the other end it passes through the front of a building, at a point which is at exactly the same height as the camera. It is dead centre in the field of view. In our imaginary case we see that with the camera level, we are cutting off most of the building's roof.
Because the film/sensor plane is perfectly vertical, there is no convergence of the buildings vertical lines.
Now, if I raise the tripod column 10 mm all that happens is that I see 10 mm more of the building's roof.
If, instead of raising the camera, I simply raise the lens while keeping the back stationary the lens axis will remain horizontal, but the line between the centre of the film/sensor and the "centre" of the lens will rise up above the vertical. It is this line that defines the field of view. Therefore the camera is now looking up, and 10 mm of lens rise translates into a much greater rise in the field of view.
Suppose you have a 50 mm lens, and you raise it 10 mm. On a building 50 m (about 150 ft) away the field of view will rise 50 000/ 50 x 10 = 10 000 mm, or 10 m (about 30 ft). The film/sensor is still vertical, so building verticals will remain straight and vertical in the image.
How does that sound?
Best,
Helen