1 33 10

Yan Varakin

ZDPLI

https://www.researchgate.net/profile/Yan-Varakin

ZDPLI

AI & ML interests

All areas of NLP, computational mathematics, reinforcement learning, robotics.

Organizations

upvoted an article 5 months ago

Article

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Feb 2, 2025

•

upvoted an article 6 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26, 2025

•

120

upvoted 2 articles 8 months ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

•

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

•

upvoted a paper 8 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 53

upvoted an article 8 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

266

upvoted 5 papers 8 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 92

upvoted 3 papers 11 months ago

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published Jan 21, 2025 • 7

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24, 2025 • 77

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24, 2025 • 28

upvoted 2 papers 12 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9, 2025 • 95

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7, 2025 • 27

upvoted 3 papers about 1 year ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Paper • 2412.01822 • Published Dec 2, 2024 • 15

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Paper • 2411.15139 • Published Nov 22, 2024 • 15

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

upvoted an article about 1 year ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Nov 19, 2024

•

Yan Varakin

AI & ML interests

Organizations

ZDPLI's activity

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Gemma 3n fully available in the open-source ecosystem!

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tune Llama 2 with DPO

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

LLaVA-o1: Let Vision Language Models Reason Step-by-Step