Colin Raffel's picture

Colin Raffel

craffel

·

http://colinraffel.com

AI & ML interests

None yet

Recent Activity

updated a model 14 days ago

craffel/supertoken_models

updated a model about 2 months ago

fineinstructions/pretraining_experiments

updated a model about 2 months ago

fineinstructions/pretraining_experiments

View all activity

Organizations

liked 2 models 4 months ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 309k • • 1.33k

ShantanuT01/BERT-tiny-RAID

Text Classification • 4.39M • Updated Sep 15, 2025 • 227 • 1

liked a dataset 7 months ago

nvidia/Nemotron-Post-Training-Dataset-v2

Viewer • Updated Aug 21, 2025 • 6.34M • 10.1k • 116

liked 2 datasets 8 months ago

nvidia/HelpSteer3

Viewer • Updated Nov 16, 2025 • 133k • 5.36k • 106

NousResearch/Hermes-3-Dataset

Viewer • Updated Jul 11, 2025 • 959k • 840 • 302

liked a model 9 months ago

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 49.2k • 1.59k

liked 3 models 12 months ago

OpenHands/openhands-lm-32b-v0.1

Text Generation • 33B • Updated Apr 16, 2025 • 229 • • 392

teapotai/teapotllm

Text Generation • 0.8B • Updated about 1 month ago • 148 • 183

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14, 2025 • 126k • 476

liked 4 models about 1 year ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Updated Dec 22, 2025 • 457k • 1.35k

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 58k • • 2.88k

bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF

Text Generation • 33B • Updated Jan 22, 2025 • 35.8k • 299

mistralai/Mistral-Small-24B-Instruct-2501

Updated Jul 28, 2025 • 111k • 948

liked a Space about 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a dataset about 1 year ago

nvidia/Daring-Anteater

Viewer • Updated Jun 17, 2024 • 99.5k • 2.68k • 28

liked 3 datasets over 1 year ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1, 2024 • 1.05M • 1.32k • 460

mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17, 2024 • 44.2k • 477 • 301

LLM360/TxT360

Updated May 26, 2025 • 52.2k • 248

liked 2 models over 1 year ago

Zyphra/Zamba2-2.7B-instruct

Text Generation • 3B • Updated Feb 14, 2025 • 221 • 82

openbmb/MiniCPM-V-2_6

Image-Text-to-Text • Updated Jun 13, 2025 • 124k • 1.03k