Models

19

Full-text search

Active filters: Sa2VA

ByteDance/Sa2VA-Qwen3-VL-4B

Image-Text-to-Text • 5B • Updated Oct 21, 2025 • 3.84k • 15

Dense-World/Sa2VA-4B

Image-Text-to-Text • 4B • Updated Jan 7, 2025 • 3

Dense-World/Sa2VA_InternVL2.5_4b

Image-Text-to-Text • 4B • Updated Jan 7, 2025 • 31 • 1

Dense-World/Sa2VA_InternVL2.5_8b

Image-Text-to-Text • 8B • Updated Jan 7, 2025 • 71

Dense-World/Sa2VA_InternVL2.5_26b

Image-Text-to-Text • 26B • Updated Jan 7, 2025 • 28

ByteDance/Sa2VA-4B

Image-Text-to-Text • Updated Sep 8, 2025 • 1.98k • 96

ByteDance/Sa2VA-8B

Image-Text-to-Text • 8B • Updated Sep 8, 2025 • 2.57k • 65

ByteDance/Sa2VA-1B

Image-Text-to-Text • 1B • Updated Sep 8, 2025 • 575 • 29

ByteDance/Sa2VA-26B

Image-Text-to-Text • 26B • Updated Sep 8, 2025 • 63 • 31

kumuji/Sa2VA-i-4B

Image Segmentation • 4B • Updated Nov 25, 2025 • 14

kumuji/Sa2VA-i-1B

Image Segmentation • 1B • Updated Nov 25, 2025 • 28

kumuji/Sa2VA-i-8B

Image Segmentation • 8B • Updated Nov 25, 2025 • 111

kumuji/Sa2VA-i-26B

Image Segmentation • 26B • Updated Nov 25, 2025 • 4

ByteDance/Sa2VA-InternVL3-2B

Image-Text-to-Text • 2B • Updated Oct 16, 2025 • 193 • 1

ByteDance/Sa2VA-InternVL3-8B

Image-Text-to-Text • 8B • Updated Oct 16, 2025 • 122 • 4

ByteDance/Sa2VA-InternVL3-14B

Image-Text-to-Text • 15B • Updated Oct 16, 2025 • 15 • 9

ByteDance/Sa2VA-Qwen2_5-VL-3B

Image-Text-to-Text • 4B • Updated Oct 16, 2025 • 52 • 2

ByteDance/Sa2VA-Qwen2_5-VL-7B

Image-Text-to-Text • 9B • Updated Oct 16, 2025 • 86 • 4

ByteDance/Sa2VA-Qwen3-VL-2B

Image-Text-to-Text • 3B • Updated Nov 27, 2025 • 630 • 16