4 15 5

XinghaoWang

Singhoo

xinghaow99

AI & ML interests

LLMs

Recent Activity

liked a dataset about 1 month ago

allenai/dolma3_longmino_mix-100B-1125

liked a model about 2 months ago

OpenMOSS-Team/MOVA-360p

liked a model about 2 months ago

OpenMOSS-Team/MOSS-TTS

View all activity

Organizations

liked a dataset about 1 month ago

allenai/dolma3_longmino_mix-100B-1125

Preview • Updated Feb 24 • 7.47k • 13

liked 2 models about 2 months ago

OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 24.7k • 211

OpenMOSS-Team/MOSS-TTS

Text-to-Speech • 8B • Updated 24 days ago • 60.7k • 371

upvoted a paper about 2 months ago

MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models

Paper • 2602.10934 • Published Feb 11 • 49

authored 3 papers 2 months ago

commented a paper 2 months ago

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published Feb 9 • 38 •

upvoted a paper 2 months ago

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published Feb 9 • 38

submitted a paper to Daily Papers 2 months ago

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published Feb 9 • 38

upvoted 2 papers 2 months ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation

Paper • 2601.23182 • Published Jan 30 • 21

upvoted 2 papers 3 months ago

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published Jan 21 • 75

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92

upvoted a paper 4 months ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 60

upvoted 2 papers 5 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 242

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 98

upvoted 2 papers 6 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 61

Sparser Block-Sparse Attention via Token Permutation

Paper • 2510.21270 • Published Oct 24, 2025 • 25

commented a paper 6 months ago

Sparser Block-Sparse Attention via Token Permutation

Paper • 2510.21270 • Published Oct 24, 2025 • 25 •

XinghaoWang

AI & ML interests

Recent Activity

Organizations

Singhoo's activity