Preprint Vintix: Action Model via In-Context Reinforcement Learning
Andrey Polubarov and...Nikita Lyubaykin, Alexander Derevyagin, Ilya Zisman, Denis Tarasov, Alexander Nikulin, Vladislav Kurenkov
PreprintYes, Q-learning Helps Offline
In-Context RL

Denis Tarasov and...Alexander Nikulin, Ilya Zisman, Albina Klepach, Andrei Polubarov, Nikita Lyubaykin, Alexander Derevyagin, Igor Kiselev, Vladislav Kurenkov
PreprintLatent Action Learning Requires Supervision in the Presence of Distractors
Alexander Nikulin and...Ilya Zisman, Denis Tarasov, Nikita Lyubaykin, Andrei Polubarov, Igor Kiselev, Vladislav Kurenkov
PreprintObject-Centric Latent
Action Learning

Albina Klepach and...Alexander Nikulin, Ilya Zisman, Denis Tarasov, Alexander Derevyagin, Andrei Polubarov, Nikita Lyubaykin, Vladislav Kurenkov


PreprintN-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs
Ilya Zisman and...Alexander Nikulin, Andrei Polubarov, Nikita Lyubaykin, Vladislav Kurenkov
ICLR 2025XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Alexander Nikulin and...Ilya Zisman, Alexey Zemtsov, Vladislav Kurenkov
NeurIPS 2024XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin and...Vladislav Kurenkov, Ilya Zisman, Artem Agarkov, Viacheslav Sinii, Sergey Kolesnikov
ICML 2024In-Context Reinforcement Learning for Variable Action Spaces
Viacheslav Sinii and...Alexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Sergey Kolesnikov


ICML 2024 Emergence of In-Context Reinforcement Learning from Noise Distillation
 Ilya Zisman and...ladislav Kurenkov, Alexander Nikulin, Viacheslav Sinii, Sergey Kolesnikov
NeurIPS 2023Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov and...Vladislav Kurenkov, Alexander Nikulin, Sergey Kolesnikov
NeurIPS2023 CORL: Research-oriented Deep Offline Reinforcement Learning Library
Denis Tarasov and...Alexander Nikulin, Dmitry Akimov, Vladislav Kurenkov, Sergey Kolesnikov
NeurIPS 2023
Katakomba: Tools and Benchmarks for Data-Driven NetHack
Vladislav Kurenkov and...Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov


ICML 2023Anti-Exploration by Random
Network Distillation

Alexander Nikulin and...Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov
ICML 2022Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov and...Sergey Kolesnikov