1 16 60

MC

Dreamer312

Dreamer

AI & ML interests

NLP, CV, LLM, AGENT, RL

Recent Activity

liked a model about 1 month ago

WeiboAI/VibeThinker-1.5B

liked a model 2 months ago

moonshotai/Kimi-K2-Thinking

liked a Space 3 months ago

lerobot/robot-learning-tutorial

View all activity

Organizations

None yet

upvoted a paper 8 months ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20, 2025 • 76

upvoted a collection 8 months ago

Llama 4

Collection

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 16 days ago • 53

upvoted 2 papers 8 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19

Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation

Paper • 2409.10262 • Published Sep 16, 2024 • 1

upvoted an article 8 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.03k

upvoted a collection 8 months ago

Qwen3

Collection

84 items • Updated 10 days ago • 1.55k

upvoted 2 articles 9 months ago

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

•

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

147

upvoted an article 10 months ago

Article

Trace & Evaluate your Agent with Arize Phoenix

Feb 28, 2025

•

upvoted an article 11 months ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Jan 31, 2025

•

upvoted a paper about 1 year ago

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Paper • 2404.13013 • Published Apr 19, 2024 • 31

upvoted 4 articles over 1 year ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

•

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18, 2024

•

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

•

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

•

upvoted a paper over 2 years ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 248

MC

AI & ML interests

Recent Activity

Organizations

Dreamer312's activity

Mixture of Experts Explained

Proximal Policy Optimization (PPO)

Merge Large Language Models with mergekit

Trace & Evaluate your Agent with Arize Phoenix

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

A failed experiment: Infini-Attention, and why we should keep trying?

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Preference Optimization for Vision Language Models

Docmatix - a huge dataset for Document Visual Question Answering