The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper β’ 2512.19693 β’ Published Dec 22, 2025 β’ 65
Unified Multimodal Model Collection A curated list for Multimodal Model Generation papers. β’ 18 items β’ Updated Nov 27, 2025 β’ 4
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper β’ 2510.14979 β’ Published Oct 16, 2025 β’ 67
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper β’ 2510.08673 β’ Published Oct 9, 2025 β’ 126
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Paper β’ 2406.18516 β’ Published Jun 26, 2024 β’ 4
Reconstructing 4D Spatial Intelligence: A Survey Paper β’ 2507.21045 β’ Published Jul 28, 2025 β’ 38