Prince Canuma's picture

Building on HF

Prince Canuma

prince-canuma

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

upvoted a paper about 14 hours ago

Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs

upvoted a paper about 14 hours ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

View all activity

Organizations

upvoted 4 papers about 14 hours ago

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

Paper • 2605.21226 • Published 3 days ago • 8

Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs

Paper • 2605.20315 • Published 4 days ago • 26

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published 4 days ago • 38

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 4 days ago • 124

updated 16 models 3 days ago

mlx-community/Gemma4-E2B-IT-Text-int4

Text Generation • 0.7B • Updated 3 days ago • 452 • 2

mlx-community/gemma-4-e2b-it-OptiQ-4bit

Text Generation • 1B • Updated 3 days ago • 2.97k • 2

mlx-community/gemma-4-e2b-it-mxfp8

Any-to-Any • 2B • Updated 3 days ago • 461

mlx-community/gemma-4-e2b-it-bf16

Any-to-Any • 5B • Updated 3 days ago • 1.43k

mlx-community/gemma-4-e2b-it-mxfp4

Any-to-Any • 2B • Updated 3 days ago • 391 • 1

mlx-community/gemma-4-e2b-it-nvfp4

Updated 3 days ago • 1

mlx-community/gemma-4-e2b-it-8bit

Any-to-Any • 2B • Updated 3 days ago • 1.5k • 3

mlx-community/gemma-4-e2b-it-6bit

Any-to-Any • 2B • Updated 3 days ago • 310

mlx-community/gemma-4-e2b-it-5bit

Any-to-Any • 1B • Updated 3 days ago • 143

mlx-community/gemma-4-E4B-it-OBLITERATED-mlx-8Bit

Text Generation • 8B • Updated 3 days ago • 6.11k • 2

mlx-community/gemma-4-E4B-it-OBLITERATED-mlx-fp16

Text Generation • 8B • Updated 3 days ago • 1.78k

mlx-community/gemma-4-e4b-it-4bit-MAD

Text Generation • 1B • Updated 3 days ago • 578

mlx-community/gemma-4-e4b-it-OptiQ-4bit

Text Generation • 8B • Updated 3 days ago • 11k • 18

mlx-community/gemma-4-e4b-it-nvfp4

Any-to-Any • 2B • Updated 3 days ago • 1.33k • 6

mlx-community/gemma-4-e4b-it-mxfp4

Any-to-Any • 2B • Updated 3 days ago • 834 • 2

mlx-community/gemma-4-e4b-it-mxfp8

Any-to-Any • 3B • Updated 3 days ago • 1.89k • 8