I understand why Ansel Adams and others used the zone system to determine the the number of f stops the film would require to encompass the exposer latitude of the object being photographed.
However, if the subject of the photograph is what you want to show, why wouldn't you just take the exposure reading from subject?
Because the context in which you find the subject may be very important and has to be included in a final exposure determination. Here's a snapshot I took in the park. Tower Grove Park in St. Louis is modeled on an English Victorian park and includes the fountain and fake ruins you see here -- arguably the subject of the photo.
In the illustration below you see first the JPEG the camera created in response to the exposure I set for this scene. To take the photo I applied a +.7 EC to the camera's meter reading. The subject is underexposed -- by a lot -- and yet the brightest highlights in the clouds are blown.
In the second image below I had the camera software recreate the JPEG but with an additional +2.3 EC for a total of +3 EC over the camera meter reading. In the second image the subject is well exposed, but the sky is totally nuked.
So how do you meter a subject and scene like this to determine the exposure. You started off with a comment about the Zone System which would be applicable to film photography. The Zone System was created to address problems similar to the one here. The foundation of the Zone System is the fact that film's response to exposure and development is disproportional and can be manipulated to a degree via development.
Film has an exposure/development sweet spot. When the scene subject and lighting contrast is a good match then a normal film exposure and development will produce a best possible tone response. Unfortunately scene subject and lighting contrast isn't always cooperative and that's where Adams and Archer stepped in with the Zone System as a means to manipulate the film tone response to be a better fit to the outlier subject/lighting contrast conditions. It is as such a manipulation applied to address a problem and comes with some degree of compromise such that the best possible tone response delivered by the film's sweet spot exposure and processing will take a hit.
If keeping the clouds and sky is important to this photo then the Zone System photographer starts taking discreet measurements to determine the overall tonal contrast of the scene. From that calculation the Zone System photographer would compute the needed reduction in film development time to reduce the contrast response of the film and so better match the high contrast of the scene. In very low contrast scenes increased development times become indicated. How to meter the subject is derived from the calculations of the overall scene contrast and where in the film's tone response the photographer wishes specific detail in the scene to be recorded. At this point in the process the brightness and contrast of the specific subject (eg. human face) factors in but is not the single determinant as the brightest highlights and shadows are also factors.
The photo here was taken with a digital camera and as such the Zone System does not apply. Digital sensors record data linearly and so that foundation fact that makes the Zone System possible (film's response to exposure and development being disproportional) isn't operative for digital.
Digital cameras all create SOOC JPEGs that are designed to emulate the behavior of film and it's possible to work with those camera JPEGs and set exposure based on those JPEGs: "pretend" you're exposing film. In this case exposing for the specific subject would be a possible and very likely best option approach since there's not a lot of latitude in a SOOC JPEG for further adjustment. If shooting camera JPEGs, set an exposure that's ideal for what you've identified as the subject of the photo. If trying to do that you encounter very high or very low subject/lighting contrast conditions the Zone System isn't built into the camera picture controls and you have to make do with what the camera provides. In the case of the above photo my Nikon provides a function Nikon calls ADL (Active D-Lighting) to address very high contrast lighting. In the scene presented here ADL would not be sufficient. No camera JPEG controls would make it possible to take the photo I took. And to take the photo I took my exposure setting did not consider the subject.
So how did I set exposure with a digital camera? CAVEAT: got to save and process the raw files and it's really simple. Look at the JPEG my camera created for that scene. That is a perfect exposure for that scene because I set the exposure to capture as much data as the camera sensor was physically capable of recording. I set that same exposure for every photo I take unless circumstances force me to expose less in which case I expose as much as possible. Methodology: I place the brightest diffuse highlight (clouds in this scene) at the sensor's saturation threshold and -- click. To determine that I use the camera's highlight alert system (some tuning required). I do that without consideration for the subject. No Zone System, no fuss or confusion, just the same exposure for all photos = expose the sensor to it's recording capacity. With a modern digital camera that's going to cover the overwhelming majority of what I want to photograph. The rare case where the subject/lighting contrast exceeds what a modern camera can handle, I walk away.