Ming Yang's picture

2

Ming Yang

Puitar619

https://github.com/PUITAR

PUITAR

AI & ML interests

RAG, Post Training

Recent Activity

upvoted a paper 5 days ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

published a dataset about 2 months ago

Puitar619/DeepSlide-Domain-Paper

upvoted a paper 2 months ago

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

View all activity

Organizations

None yet

upvoted a paper 5 days ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published 13 days ago • 61

published a dataset about 2 months ago

Puitar619/DeepSlide-Domain-Paper

Updated Mar 5 • 6

upvoted a paper 2 months ago

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

Paper • 2602.10622 • Published Feb 11 • 28