ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models Paper • 2505.12534 • Published May 18 • 3
Scaling Image Tokenizers with Grouped Spherical Quantization Paper • 2412.02632 • Published Dec 3, 2024 • 10