Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a Space
10 days ago
ggml-org/gguf-my-repo
liked
a dataset
13 days ago
nvidia/Nemotron-VLM-Dataset-v2
liked
a model
16 days ago
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16