The 2D images are ambiguous due to shading, occlusion, viewpoint, lightness, darkness. Edges can be unclear, so we need to make inferences by detecting features, grouping parts into similar objects, parsing scene so that figures can be identified from background, identifying object