Understanding and Modeling Explicit and Implicit Representations of the Visual World