sergiopaniego/wordle-grpo-Qwen3-1.7B-Instruct-updated Text Generation • 2B • Updated about 1 hour ago • 48
view post Post 50 Ya está disponible el vídeo de la charla del otro día en @nerdearla sobre IA abierta, por si queréis verla! 🤠https://www.youtube.com/watch?v=p-JLn4xAkMw See translation 1 reply · Reply
huggingface-projects/Deep-RL-Course-Certification Viewer • Updated about 3 hours ago • 1.61k • 251 • 16
view post Post 805 we've just added several example scripts to TRL showing how to train models with GRPO using some of the new OpenEnv environmentstrain a model to interact with a browser (🎮 BrowserGym Env), play Wordle (🎮 Wordle Env) and moooore!TRL (GRPO + vLLM) + OpenEnv! ⚡️📝 go play with them: https://github.com/huggingface/trl/tree/main/examples/scripts/openenv📝 examples list: https://huggingface.co/docs/trl/main/en/example_overview#scripts See translation 🔥 3 3 👍 3 3 + Reply
sergiopaniego/wordle-grpo-Qwen3-1.7B-Instruct-updated Text Generation • 2B • Updated about 1 hour ago • 48
Running Wordle Grpo Qwen2.5 0.5B Instruct Updated 🚀 Track and visualize project metrics and media logs
Running Wordle Grpo Qwen2.5 0.5B Instruct Updated 🚀 Track and visualize project metrics and media logs