Date: 2025-10-14Single-process model-parallel SFT with LoRA FP16 (T4Γ2). Answer-only loss. Time-capped.
See usage snippet in repo.
-
Base model