Running 315 Whisper Speaker Diarization 🗣 315 Separate and label different speakers in audio recordings
Running on L40S 62 NuMarkdown 8b Thinking 👁 62 Reasoning model specialized for OCR/Markdown generation.
Running on Zero Featured 113 VLM Object Understanding 🦀 113 Explore object detection, visual grounding, keypoint Detecti
docling-project/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 36.1k • 1.61k
Running Featured 230 PaddleOCR-VL Online Demo 📈 230 Convert documents and images into structured text and markdown