Sergio Paniego's picture

Sergio Paniego PRO

sergiopaniego

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

published a model 39 minutes ago

sergiopaniego/wordle-grpo-Qwen3-1.7B

updated a model about 1 hour ago

sergiopaniego/wordle-grpo-Qwen3-1.7B-Instruct-updated

posted an update about 3 hours ago

Ya está disponible el vídeo de la charla del otro día en @nerdearla sobre IA abierta, por si queréis verla! 🤠 https://www.youtube.com/watch?v=p-JLn4xAkMw

View all activity

Organizations

published a model 39 minutes ago

sergiopaniego/wordle-grpo-Qwen3-1.7B

Updated 39 minutes ago

updated a model about 1 hour ago

sergiopaniego/wordle-grpo-Qwen3-1.7B-Instruct-updated

Text Generation • 2B • Updated about 1 hour ago • 48

posted an update about 3 hours ago

Post

50

Ya está disponible el vídeo de la charla del otro día en @nerdearla sobre IA abierta, por si queréis verla! 🤠

https://www.youtube.com/watch?v=p-JLn4xAkMw

1 reply

·

updated a dataset about 3 hours ago

huggingface-projects/Deep-RL-Course-Certification

Viewer • Updated about 3 hours ago • 1.61k • 251 • 16

updated 2 datasets about 4 hours ago

agents-course/final-certificates

Viewer • Updated about 4 hours ago • 5 • 1.01k • 4

agents-course/course-certificates-of-excellence

Viewer • Updated about 4 hours ago • 3.81k • 579 • 5

posted an update about 23 hours ago

Post

805

we've just added several example scripts to TRL showing how to train models with GRPO using some of the new OpenEnv environments

train a model to interact with a browser (🎮 BrowserGym Env), play Wordle (🎮 Wordle Env) and moooore!

TRL (GRPO + vLLM) + OpenEnv! ⚡️

📝 go play with them: https://github.com/huggingface/trl/tree/main/examples/scripts/openenv

📝 examples list: https://huggingface.co/docs/trl/main/en/example_overview#scripts

published a model 1 day ago

sergiopaniego/wordle-grpo-Qwen3-1.7B-Instruct-updated

Text Generation • 2B • Updated about 1 hour ago • 48

updated a model 1 day ago

sergiopaniego/wordle-grpo-Qwen2.5-0.5B-Instruct-updated

Updated 1 day ago

updated a Space 1 day ago

Wordle Grpo Qwen2.5 0.5B Instruct Updated

Track and visualize project metrics and media logs

published a Space 1 day ago

Wordle Grpo Qwen2.5 0.5B Instruct Updated

Track and visualize project metrics and media logs

published a model 1 day ago

sergiopaniego/wordle-grpo-Qwen2.5-0.5B-Instruct-updated

Updated 1 day ago

updated 2 datasets 1 day ago

agents-course/final-certificates

Viewer • Updated about 4 hours ago • 5 • 1.01k • 4

agents-course/course-certificates-of-excellence

Viewer • Updated about 4 hours ago • 3.81k • 579 • 5

updated a model 2 days ago

sergiopaniego/wordle-grpo-Qwen2.5-0.5B-Instruct-test

Updated 2 days ago

updated a Space 2 days ago

Wordle Grpo Qwen2.5 0.5B Instruct Test

published a Space 2 days ago

Wordle Grpo Qwen2.5 0.5B Instruct Test

published a model 2 days ago

sergiopaniego/wordle-grpo-Qwen2.5-0.5B-Instruct-test

Updated 2 days ago

updated 2 models 2 days ago

sergiopaniego/wordle-grpo-Qwen3-0.6B-Instruct-wandb

Updated 2 days ago

sergiopaniego/wordle-grpo-Qwen3-0.6B-Instruct-wandb

Updated 2 days ago