CellFlux: Simulating Cellular Morphology Changes via Flow Matching Paper β’ 2502.09775 β’ Published Feb 13, 2025
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks Paper β’ 2310.19677 β’ Published Oct 30, 2023
No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models Paper β’ 2510.03978 β’ Published Oct 4, 2025 β’ 4
Stanza: A Python Natural Language Processing Toolkit for Many Human Languages Paper β’ 2003.07082 β’ Published Mar 16, 2020
Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning Paper β’ 2512.20934 β’ Published Dec 24, 2025
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper β’ 2601.18207 β’ Published 12 days ago β’ 19
view post Post 1610 Are you familiar with reverse residual connections or looping in language models?Excited to share my Looped-GPT blog post and codebase πhttps://github.com/sanyalsunny111/Looped-GPTTL;DR: looping during pre-training improves generalization.Plot shows GPT2 LMs pre-trained with 15.73B OWT tokensP.S. This is my first post here β I have ~4 followers and zero expectations for reach π See translation 3 replies Β· π§ 6 6 π 3 3 + Reply
SkillFactory: Self-Distillation For Learning Cognitive Behaviors Paper β’ 2512.04072 β’ Published Dec 3, 2025 β’ 5
π¨ Gelato Collection From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents β’ 5 items β’ Updated Nov 15, 2025 β’ 1
mlfoundations/gelato-osworld-agent-trajectories Viewer β’ Updated Nov 6, 2025 β’ 13.5k β’ 41 β’ 1
mlfoundations/gelato-osworld-agent-trajectories Viewer β’ Updated Nov 6, 2025 β’ 13.5k β’ 41 β’ 1
π¨ Gelato Collection From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents β’ 5 items β’ Updated Nov 15, 2025 β’ 1
π MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" β’ 14 items β’ Updated Oct 22, 2025 β’ 65