Maskbit
					Collection
				
				11 items
				• 
				Updated
					
				
This model is the MaskBit tokenizer with a vocabulary size of 16bits. It uses a downsampling factor of 16 and is trained on ImageNet for images of resolution 256.
You can find more details on the project page and in the paper.