hf_nld
BabyLM GPT-BERT baseline model.
Main branch
main contains the final model.
Checkpoint branches
Intermediate checkpoints are stored as separate branches.
Branch names follow the checkpoint filenames, for example:
ckpt-100K
ckpt-1M
ckpt-10M
ckpt-100M