Viacheslav's picture

Viacheslav

ummagumm-a

·

ummagumm-a

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

t-tech/manifolds

new activity about 2 months ago

tfrere/research-article-template:References shown at the end of a chapter (not in the footnote)

new activity about 2 months ago

tfrere/research-article-template:References shown at the end of a chapter (not in the footnote)

View all activity

Organizations

None yet

authored a paper 6 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Paper • 2508.04280 • Published Aug 6, 2025 • 35

authored 3 papers about 1 year ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13, 2025 • 37

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 89

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113

authored 3 papers about 2 years ago

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Paper • 2312.12044 • Published Dec 19, 2023 • 4

In-Context Reinforcement Learning for Variable Action Spaces

Paper • 2312.13327 • Published Dec 20, 2023 • 4

Emergence of In-Context Reinforcement Learning from Noise Distillation

Paper • 2312.12275 • Published Dec 19, 2023 • 4