Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

shanshan wang's picture

shanshan wang

cooleel

Edge-Quant's profile picture

Gargaz's profile picture

21world's profile picture

·

AI & ML interests

None yet

Organizations

Collections 6

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20, 2025 • 47
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published Feb 18, 2025 • 29
Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14, 2025 • 18
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47

Prompt-to-Leaderboard

Paper • 2502.14855 • Published Feb 20, 2025 • 7
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24, 2025 • 32
Generating Skyline Datasets for Data Science Models

Paper • 2502.11262 • Published Feb 16, 2025 • 7
Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Paper • 2502.12501 • Published Feb 18, 2025 • 6

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20, 2025 • 47
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published Feb 18, 2025 • 29
Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14, 2025 • 18
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47

Prompt-to-Leaderboard

Paper • 2502.14855 • Published Feb 20, 2025 • 7
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24, 2025 • 32
Generating Skyline Datasets for Data Science Models

Paper • 2502.11262 • Published Feb 16, 2025 • 7
Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Paper • 2502.12501 • Published Feb 18, 2025 • 6

View 6 collections

models 0

None public yet

datasets 1

cooleel/xfund_de

Updated Dec 2, 2022 • 21

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs