Tulu3 with distraction mitigation data
Collection
LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract
•
5 items
•
Updated
•
2