Models Trained on Redpajama-CC
			
	
	AI & ML interests
Data Selection for Language Models
			models
			9
		
			
	
	
	
	
	Data-Selection/PDS-470M
			Text Generation
			• 
		
				0.5B
			• 
	
				Updated
					
				
				• 
					
					6
				
	
				
				
Data-Selection/PDS-160M
			Text Generation
			• 
		
				0.2B
			• 
	
				Updated
					
				
				• 
					
					8
				
	
				
				
Data-Selection/PDS-1B
			Text Generation
			• 
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
Data-Selection/PDS-1.7B
			Text Generation
			• 
		
	
				Updated
					
				
				• 
					
					2
				
	
				
				
Data-Selection/BSL-1.7B
			Text Generation
			• 
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
Data-Selection/data_scorer
		
	
				Updated
					
				
				
				
	
				
				
Data-Selection/BSL-1B
			Text Generation
			• 
		
	
				Updated
					
				
				
				
	
				
				
Data-Selection/BSL-470M
			Text Generation
			• 
		
	
				Updated
					
				
				• 
					
					2
				
	
				
				
Data-Selection/BSL-160M
			Text Generation
			• 
		
	
				Updated
					
				
				• 
					
					56
				
	
				
				
			datasets
			0
		
			
	None public yet