Indic-dataset High-quality dataset for LLM to VLM damerajee/VQA-hi Viewer • Updated May 17, 2024 • 50k • 22 damerajee/Hindi-LLaVA-CC3M-Pretrain-595K Viewer • Updated May 2, 2024 • 595k • 22 damerajee/long_context_hindi Viewer • Updated May 6, 2024 • 807k • 52 damerajee/Instruct-hindi Viewer • Updated May 6, 2024 • 249k • 19 • 1
Awesome Indic-LLM sarvamai/OpenHathi-7B-Hi-v0.1-Base Text Generation • 7B • Updated Dec 22, 2023 • 1.77k • 115 soketlabs/pragna-1b Text Generation • 1B • Updated May 27, 2024 • 84 • 16 Cognitive-Lab/Ambari-7B-base-v0.1 Text Generation • Updated Jan 8, 2024 • 20 • 8 CohereLabs/aya-23-8B Text Generation • 8B • Updated Sep 11, 2025 • 13.3k • 427
Indic-dataset High-quality dataset for LLM to VLM damerajee/VQA-hi Viewer • Updated May 17, 2024 • 50k • 22 damerajee/Hindi-LLaVA-CC3M-Pretrain-595K Viewer • Updated May 2, 2024 • 595k • 22 damerajee/long_context_hindi Viewer • Updated May 6, 2024 • 807k • 52 damerajee/Instruct-hindi Viewer • Updated May 6, 2024 • 249k • 19 • 1
Awesome Indic-LLM sarvamai/OpenHathi-7B-Hi-v0.1-Base Text Generation • 7B • Updated Dec 22, 2023 • 1.77k • 115 soketlabs/pragna-1b Text Generation • 1B • Updated May 27, 2024 • 84 • 16 Cognitive-Lab/Ambari-7B-base-v0.1 Text Generation • Updated Jan 8, 2024 • 20 • 8 CohereLabs/aya-23-8B Text Generation • 8B • Updated Sep 11, 2025 • 13.3k • 427