alessiodevoto's picture
Push model using huggingface_hub.
f2786dc verified
{
"dataset_name": "kmfoda/booksum",
"head_dim": 6,
"model_name": "MaxJeblick/llama2-0b-unit-test",
"n_sink": 4,
"num_heads": 2,
"num_layers": 2,
"num_samples": 100,
"sample_seq_len": 1000
}