-
Advancing Reference-free Evaluation of Video Captions with Factual Analysis
Paper • 2509.16538 • Published • 1 -
dipta007/VCInspector-7B
Image-Text-to-Text • 8B • Updated • 31 • 1 -
dipta007/VCInspector-3B
Image-Text-to-Text • 4B • Updated • 45 • 1 -
dipta007/ActivityNet-FG-It
Viewer • Updated • 242k • 43
Shubhashis Roy Dipta PRO
dipta007
AI & ML interests
Multimodal Understanding, Reasoning, Generation
Recent Activity
updated a model 7 days ago
dipta007/VCInspector-7B updated a model 7 days ago
dipta007/VCInspector-3B updated a dataset 7 days ago
dipta007/ActivityNet-FG-ItOrganizations
GanitLLM (ACL 2026 Findings)
-
GanitLLM: Difficulty-Aware Bengali Mathematical Reasoning through Curriculum-GRPO
Paper • 2601.06767 • Published -
dipta007/Ganit
Viewer • Updated • 32.3k • 69 -
dipta007/GanitLLM-4B_SFT_CGRPO
Text Generation • 196k • Updated • 1.77k -
dipta007/GanitLLM-4B_SFT_GRPO
Text Generation • 196k • Updated • 216 • 1
VC-Inspector (ACL 2026 Main)
-
Advancing Reference-free Evaluation of Video Captions with Factual Analysis
Paper • 2509.16538 • Published • 1 -
dipta007/VCInspector-7B
Image-Text-to-Text • 8B • Updated • 31 • 1 -
dipta007/VCInspector-3B
Image-Text-to-Text • 4B • Updated • 45 • 1 -
dipta007/ActivityNet-FG-It
Viewer • Updated • 242k • 43
GanitLLM (ACL 2026 Findings)
-
GanitLLM: Difficulty-Aware Bengali Mathematical Reasoning through Curriculum-GRPO
Paper • 2601.06767 • Published -
dipta007/Ganit
Viewer • Updated • 32.3k • 69 -
dipta007/GanitLLM-4B_SFT_CGRPO
Text Generation • 196k • Updated • 1.77k -
dipta007/GanitLLM-4B_SFT_GRPO
Text Generation • 196k • Updated • 216 • 1
models 20
dipta007/VCInspector-7B
Image-Text-to-Text • 8B • Updated • 31 • 1
dipta007/VCInspector-3B
Image-Text-to-Text • 4B • Updated • 45 • 1
dipta007/GanitLLM-0.6B_CGRPO
Text Generation • 0.6B • Updated • 251
dipta007/GanitLLM-0.6B-SFT
Text Generation • 0.8B • Updated • 258
dipta007/GanitLLM-0.6B_SFT_GRPO
Text Generation • 0.8B • Updated • 243
dipta007/GanitLLM-0.6B_SFT_CGRPO
Text Generation • 0.8B • Updated • 243
dipta007/GanitLLM-1.7B_CGRPO
Text Generation • 2B • Updated • 241
dipta007/GanitLLM-1.7B-SFT
Text Generation • 2B • Updated • 242
dipta007/GanitLLM-1.7B_SFT_GRPO
Text Generation • 2B • Updated • 237
dipta007/GanitLLM-1.7B_SFT_CGRPO
Text Generation • 2B • Updated • 233
datasets 52
dipta007/ActivityNet-FG-It
Viewer • Updated • 242k • 43
dipta007/Ganit
Viewer • Updated • 32.3k • 69
dipta007/DistractMath-Bn
Viewer • Updated • 3.69k • 26
dipta007/dagger
Viewer • Updated • 6.48k • 28
dipta007/BanglaBias
Viewer • Updated • 200 • 54
dipta007/youcook2-retouch-prompts
Viewer • Updated • 3.18k • 22
dipta007/youcook2-retouch-videos
Viewer • Updated • 7 • 14
dipta007/APIGen-MT-5k-with-cot
Updated • 2
dipta007/APIGen-MT-5k-train-val
Viewer • Updated • 4.8k • 17
dipta007/APIGen-MT-5k-with-cot-v1-deepseek_deepseek
Viewer • Updated • 2.5k • 4