groupfairnessllm/tulu-3-preference-data-with-distraction
Viewer
•
Updated
•
1.5k
•
27
LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract