Fudan NLP Lab

university

https://nlp.fudan.edu.cn/

Activity Feed Request to join this org

AI & ML interests

Natural Language Processing

Papers

CCTU: A Benchmark for Tool Use under Complex Constraints

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

View all Papers

authored a paper 2 months ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 66

authored a paper 3 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 156

authored a paper 4 months ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 81

authored a paper 7 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 273

authored 5 papers 8 months ago

Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

Paper • 2404.00884 • Published Apr 1, 2024 • 1

Multi-Programming Language Sandbox for LLMs

Paper • 2410.23074 • Published Oct 30, 2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024

A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models

Paper • 2303.10420 • Published Mar 18, 2023 • 1

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39

authored 7 papers about 1 year ago

MouSi: Poly-Visual-Expert Vision-Language Models

Paper • 2401.17221 • Published Jan 30, 2024 • 9

LongHeads: Multi-Head Attention is Secretly a Long Context Processor

Paper • 2402.10685 • Published Feb 16, 2024 • 1

Length Generalization of Causal Transformers without Position Encoding

Paper • 2404.12224 • Published Apr 18, 2024 • 1

AntLM: Bridging Causal and Masked Language Models

Paper • 2412.03275 • Published Dec 4, 2024

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Paper • 2502.14837 • Published Feb 20, 2025 • 3

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs

Paper • 2410.11302 • Published Oct 15, 2024

Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models

Paper • 2410.03176 • Published Oct 4, 2024