Celestine Floquet

Celestine-floquet

AI & ML interests

cute voice models, anime waifus

Recent Activity

upvoted an article 25 days ago

Differential Transformer V2

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

upvoted a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

upvoted an article 25 days ago

Article

Differential Transformer V2

27 days ago

•

liked a Space about 1 month ago

The Smol Training Playbook

📚

2.98k

The secrets to building world-class LLMs

upvoted 2 papers about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 225

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 109

liked a Space about 1 month ago

Nomic Embeddings

🦀

Generate text embeddings using various models

liked 2 models about 1 month ago

NovaSearch/stella_en_400M_v5

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • Updated Jun 20, 2025 • 3.48M • • 876

upvoted a collection about 1 month ago

NV-Embed

Collection

NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated 11 days ago • 17

upvoted 5 papers about 1 month ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180

Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published Dec 30, 2025 • 17

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 306

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published Dec 30, 2025 • 112

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published Dec 31, 2025 • 64

upvoted 2 papers about 2 months ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published Dec 29, 2025 • 98

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 151

upvoted a collection 3 months ago

Z-Image

Collection

7 items • Updated 19 days ago • 141

upvoted a paper 3 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 128

liked a Space 3 months ago

IndexTTS 2 Demo

🏢

744

Generate expressive speech from text and voice reference

upvoted 2 papers 5 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 135

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 118

Celestine Floquet

AI & ML interests

Recent Activity

Organizations