Smilyai-labs
/

Sam-2.5-2

Text Generation

Model card Files Files and versions

🧠 Model Card: Sam-2.5-2

Overview

Sam-2.5-2 is a fine-tuned variant of Sam2.5, optimized for chain-of-thought reasoning on GSM8K. It retains modular, ablation-ready architecture and demonstrates strong generalization across arithmetic and logic-heavy prompts.

🔧 Architecture

Component	Value
Base Model	Sam2.5
Layers	Unchanged
Heads	Unchanged
FF Multiplier	Unchanged
Dropout	Unchanged
Tokenizer	AutoTokenizer
Shared Weights	`lm_head ↔ embed` (cloned during save)

🧪 Training Details

Parameter	Value
Dataset	GSM8K
Epochs	2
Batch Size	2
Max Length	512
Optimizer	AdamW
Learning Rate	1e-4
Replay Mixing	None
Early Stopping	Manual checkpointing

📉 Performance Metrics

Metric	Epoch 1	Epoch 2
Final Train Loss	0.7826	2.7956
Validation Loss	2.5932	1.8989
Perplexity	13.37	6.68

🔍 Output Quality

✅ Fluent chain-of-thought steps
✅ Accurate arithmetic reasoning
✅ Consistent use of scratchpad format (<<...>>)
✅ Stable token alignment across nested logic

💾 Checkpointing

Safe save logic applied to avoid shared memory errors
Format: .safetensors
Best model: checkpoints/epoch_2_loss_1.8989/
Final model: checkpoints/final/

Downloads last month: 12