SketchVLM Collection Datasets for SketchVLM: Vision-Language Models Can Annotate Images to Explain Thoughts and Guide Users (https://sketchvlm.github.io/) • 4 items • Updated 4 days ago
SketchVLM Collection Datasets for SketchVLM: Vision-Language Models Can Annotate Images to Explain Thoughts and Guide Users (https://sketchvlm.github.io/) • 4 items • Updated 4 days ago
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 10 days ago • 31
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 10 days ago • 31
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper • 2509.22186 • Published Sep 26, 2025 • 160