A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret
Shu Zhao
TreezzZ
AI & ML interests
None yet
Organizations
models
14
TreezzZ/Ferret_Search-R1_Qwen2.5-14b-instruct_ppo
15B
•
Updated
•
7
TreezzZ/Ferret_ParallelSearch_Qwen3-30b-a3b-instruct_ppo
31B
•
Updated
•
5
TreezzZ/Ferret_ParallelSearch_Qwen2.5-3b-instruct_ppo
3B
•
Updated
•
4
TreezzZ/Ferret_ParallelSearch_Qwen3-4b-instruct_ppo
4B
•
Updated
•
28
TreezzZ/Ferret_ExpandSearch_Qwen2.5-3b-instruct_Llama4-Maverick-17b-128e-instruct_ppo
3B
•
Updated
•
70
TreezzZ/Ferret_ParallelSearch_Qwen2.5-7b-instruct_ppo
8B
•
Updated
•
4
TreezzZ/Ferret_Search-R1_Qwen2.5-3b-instruct_ppo
3B
•
Updated
•
5
TreezzZ/ExpandSearch-3b-instruct-Squeezer-LLaMA4-Maverick
3B
•
Updated
•
9
TreezzZ/ParallelSearch-7b-base
8B
•
Updated
•
6
•
1
TreezzZ/ParallelSearch-7b-instruct
8B
•
Updated
•
7
•
2