Question regarding the Stage 1 training procedure
#1
by
						
floschne
	
							
						- opened
							
					
Hi, and first of all thanks for making your models and datasets open-source!
I just read your paper and was wondering how, i.e., with which data, you trained the MLP projector in Stage 1? Did you use multilingual image captions or english-only?