Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Viacheslav's picture
7 1

Viacheslav

ummagumm-a
vkurenkov's profile picture
·
  • ummagumm-a

AI & ML interests

None yet

Organizations

None yet

authored a paper 4 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Paper • 2508.04280 • Published Aug 6, 2025 • 35
authored 3 papers 11 months ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13, 2025 • 37

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 89

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113
authored 3 papers almost 2 years ago

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Paper • 2312.12044 • Published Dec 19, 2023 • 4

In-Context Reinforcement Learning for Variable Action Spaces

Paper • 2312.13327 • Published Dec 20, 2023 • 4

Emergence of In-Context Reinforcement Learning from Noise Distillation

Paper • 2312.12275 • Published Dec 19, 2023 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs