Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

19,407

Full-text search

Active filters: grpo

lokahq/Trinity-Mini-DrugProt-Think

Text Generation • Updated 5 days ago • 49 • 4

webxos/microclaw-for-openclaw-version-2026.2.17

Text Generation • Updated about 7 hours ago • 346 • 5

trentmkelly/Qwen3-14B-ZeroGPT-beta

Updated Jul 3, 2025 • 5

erythropygia/Gemma-3-1b-it-OpenR1-Turkish

Text Generation • 1.0B • Updated Mar 21, 2025 • 6 • 2

trentmkelly/Llama-3.1-8b-Instruct-Pangram

Text Generation • Updated Oct 3, 2025 • 4

openmed-community/AFM-4.5B-OpenMed-GGUF

Text Generation • 5B • Updated Oct 2, 2025 • 230 • 8

Klingspor/StarPO-4B

Text Generation • 4B • Updated 15 days ago • 107 • 2

shannon-ai/shannon-1.6-pro

Text Generation • Updated 10 days ago • 1

Jarrodbarnes/KernelBench-RLVR-120b

Text Generation • 117B • Updated 17 days ago • 34 • 2

lightx2v/Wan2.1-T2V-1.3B-longcat-step1500

Text-to-Video • Updated 17 days ago • 93 • 6

snap-stanford/humanlm-opinion

Text Generation • 8B • Updated 15 days ago • 129 • 10

Jackrong/Qwen3-4B-Thinking-2507-GLM-4.7-Distilled-GGUF

Text Generation • 4B • Updated 4 days ago • 567 • 1

mradermacher/DECS_7B-i1-GGUF

8B • Updated 3 days ago • 3.64k • 1

Chun121/Qwen3-4B-RPG-Roleplay-V2

Text Generation • 4B • Updated Aug 24, 2025 • 8.54k • 40

onuryozcu/llama

Text Generation • 0.1B • Updated Mar 10, 2025 • 3

amiguel/promptTuning

8B • Updated Feb 16, 2025

sergiopaniego/Qwen2-0.5B-GRPO-test

Updated Oct 3, 2025

Novaciano/ESP-NSFW-GRPO-1B-Sin_Censura-GGUF

1B • Updated Jan 28, 2025 • 77 • 4

nbd22/Llama-3.1-8B-Instruct-GRPO-gsm8k-ft-lora

Updated Jan 28, 2025

sergiopaniego/Qwen2-0.5B-GRPO

Updated Jan 31, 2025

philschmid/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Jan 30, 2025 • 9 • 8

spinech/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Apr 28, 2025 • 3

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • 2B • Updated Feb 2, 2025 • 2 • 1

yooneo/qwen-0.5b-r1-aha

Updated Jan 31, 2025

yooneo/qwen-1.5b-r1-aha

Updated Jan 31, 2025

spinech/qwen2.5-3b-r1-rearc-stage1

Text Generation • 3B • Updated Apr 28, 2025 • 12

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO

Text Generation • 8B • Updated Feb 3, 2025 • 11 • 1

MasterControlAIML/DeepSeek-R1-Strategy-Qwen-2.5-1.5b-Unstructured-To-Structured

Text Generation • 2B • Updated Feb 3, 2025 • 11 • 5

mradermacher/DeepSeek-R1-Strategy-Qwen-2.5-1.5b-Unstructured-To-Structured-GGUF

2B • Updated Feb 3, 2025 • 165 • 2

hyunw3/qwen-2.5-0.5b-r1-countdown

Text Generation • 0.5B • Updated Apr 30, 2025