Does this model use the same format as openchat 3.5?
#2
by
						
tarruda
	
							
						- opened
							
					
Seeing the same thing with llama.cpp (not python) and the same GGUF:Therefore, Jane is faster than Rahul.abbabbababbabbbababbabbbababbabbababbabababbabababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbababbab
Just changed the EOS token, should be good now!
alpayariyak
	
				
		changed discussion status to
		closed
			
Issue is fixed on latest GGUF upload, thanks.



