Efficient Intelligence and Systems

community

Efficient-ML

Activity Feed

AI & ML interests

Low-bit Quantization of Large Language Models (LLMs)

Recent Activity

AaronHuangWei authored a paper 8 days ago

Anchor Forcing: Anchor Memory and Tri-Region RoPE for Interactive Streaming Video Diffusion

AaronHuangWei authored a paper 8 days ago

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

AaronHuangWei authored a paper 8 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

View all activity

AaronHuangWei

authored 3 papers 8 days ago

Anchor Forcing: Anchor Memory and Tri-Region RoPE for Interactive Streaming Video Diffusion

Paper • 2603.13405 • Published Mar 12

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Paper • 2604.04911 • Published 9 days ago • 35

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 9 days ago • 107

AaronHuangWei

submitted a paper to Daily Papers 8 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 9 days ago • 107

mack-williams

authored 8 papers 2 months ago

LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation

Paper • 2510.08318 • Published Oct 9, 2025

Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention

Paper • 2602.04789 • Published Feb 4 • 3

PTQ4SAM: Post-Training Quantization for Segment Anything

Paper • 2405.03144 • Published May 6, 2024

LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit

Paper • 2405.06001 • Published May 9, 2024

AaronHuangWei

authored 2 papers 4 months ago

MC#: Mixture Compressor for Mixture-of-Experts Large Models

Paper • 2510.10962 • Published Oct 13, 2025

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 51

AaronHuangWei

submitted a paper to Daily Papers 4 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 51

AaronHuangWei

authored 5 papers 6 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 182

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 92

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Paper • 2505.13031 • Published May 19, 2025 • 4

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Paper • 2507.10548 • Published Jul 14, 2025 • 37

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189

AI & ML interests

Recent Activity

Team members 9

Efficient-ML's activity