weblab-llm-competition-2025-bridge/neko-prelim-DPO_Adversarial_vanilla_ver2 Viewer • Updated 4 days ago • 1.8k • 4
weblab-llm-competition-2025-bridge/neko-prelim-wj-vanilla_benign_v2 Viewer • Updated 4 days ago • 1.6k • 7
weblab-llm-competition-2025-bridge/neko-prelim-wj-Adversarial_harmful_v2 Viewer • Updated 4 days ago • 1.76k • 15
weblab-llm-competition-2025-bridge/neko-prelim-wj-Adversarial_benign_v2 Viewer • Updated 4 days ago • 1.76k • 6
weblab-llm-competition-2025-bridge/neko-prelim-dna_vanilla_harmful_v2 Viewer • Updated 4 days ago • 1.2k • 4
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_PHYBench Viewer • Updated 4 days ago • 100 • 4
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_OlympiadBench Viewer • Updated 4 days ago • 472 • 4
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_OpenMathReasoning Viewer • Updated 4 days ago • 5.17k • 7
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_MixtureOfThoughts Preview • Updated 4 days ago • 7
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_PhysReason Viewer • Updated 4 days ago • 202 • 6
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_GPQA_Diamond Viewer • Updated 4 days ago • 61 • 10
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_OpenThoughts-114k Viewer • Updated 4 days ago • 6k • 17
weblab-llm-competition-2025-bridge/neko-prelim-HLE_RL_Olympiadbench-v2 Viewer • Updated 4 days ago • 21 • 7
weblab-llm-competition-2025-bridge/neko-prelim-HLE_SFT_LIMO-v2 Viewer • Updated 4 days ago • 800 • 16
weblab-llm-competition-2025-bridge/team-pont-neuf-sft-dataset Viewer • Updated 15 days ago • 742 • 21 • 1
weblab-llm-competition-2025-bridge/difficult_problem_dataset_v4_500 Viewer • Updated Sep 19 • 5.05k • 28
weblab-llm-competition-2025-bridge/team-truthowl-mixed-reasoning-dataset Viewer • Updated Sep 1 • 22.9k • 54
weblab-llm-competition-2025-bridge/team-camino-TheUniversityofMinnesota_pastGraduateWrittenExams_Physics Viewer • Updated Aug 31 • 68 • 14
weblab-llm-competition-2025-bridge/team-camino-GraduateMathExam_Kyoto Viewer • Updated Aug 31 • 225 • 3
weblab-llm-competition-2025-bridge/team-camino-Nemotron-CrossThink-QA_reasoning_Phi-4-reasoning-plus_0803_n20480_test_5k Viewer • Updated Aug 31 • 5.14k • 5
weblab-llm-competition-2025-bridge/team-camino-Omni-MATH_difficulty5plus_qa Viewer • Updated Aug 31 • 4.43k • 7
weblab-llm-competition-2025-bridge/team-akiyama-short_cleaned_NuminaMath-RL-Verifiable_with_proof Viewer • Updated Aug 30 • 147k • 17