hf_nld

BabyLM GPT-BERT baseline model.

Main branch

main contains the final model.

Checkpoint branches

Intermediate checkpoints are stored as separate branches. Branch names follow the checkpoint filenames, for example:

  • ckpt-100K
  • ckpt-1M
  • ckpt-10M
  • ckpt-100M
Downloads last month
2,507
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support