Update README.md
Browse files
README.md
CHANGED
|
@@ -34,7 +34,7 @@ The base checkpoint is Unsloth’s 4-bit MXFP4 build (`unsloth/Qwen3-14B-unsloth
|
|
| 34 |
- Task: Binary classification (P = synthesizable, U = unsynthesizable)
|
| 35 |
- Training Objective: QLoRA with focal loss (gamma = 2.0, alpha_P = 8.12, alpha_U = 1.0)
|
| 36 |
- Max Sequence Length (train): 2048 tokens; Evaluation: 180 tokens
|
| 37 |
-
- Dataset: 316,442 train (`
|
| 38 |
- Adapter Size: ~981 MB (`adapter_model.safetensors`)
|
| 39 |
|
| 40 |
### Prompt & Thinking Prefill
|
|
|
|
| 34 |
- Task: Binary classification (P = synthesizable, U = unsynthesizable)
|
| 35 |
- Training Objective: QLoRA with focal loss (gamma = 2.0, alpha_P = 8.12, alpha_U = 1.0)
|
| 36 |
- Max Sequence Length (train): 2048 tokens; Evaluation: 180 tokens
|
| 37 |
+
- Dataset: 316,442 train (`train_pu_hem.jsonl`) / 79,114 validation (`validate_pu_hem.jsonl`) samples (~11% P / 89% U)
|
| 38 |
- Adapter Size: ~981 MB (`adapter_model.safetensors`)
|
| 39 |
|
| 40 |
### Prompt & Thinking Prefill
|