small-scale pretraining experiments of mine
			
	
	- 
	
	
	  BEE-spoke-data/smol_llama-101M-GQAText Generation • 0.1B • Updated • 873 • 29
- 
	
	
	  BEE-spoke-data/smol_llama-220M-GQAText Generation • 0.2B • Updated • 1.18k • 13
- 
	
	
	  BEE-spoke-data/smol_llama-220M-GQA-fineweb_eduText Generation • 0.2B • Updated • 15 • 1
- 
	
	
	  BEE-spoke-data/smol_llama-81M-tiedText Generation • 81.3M • Updated • 559 • 9

 
								 
								






