Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs Paper • 2510.24514 • Published Oct 28 • 21
Lost in Embeddings: Information Loss in Vision-Language Models Paper • 2509.11986 • Published Sep 15 • 28
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Paper • 2501.07542 • Published Jan 13 • 3