CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers Paper • 2305.17455 • Published May 27, 2023
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models Paper • 2507.14204 • Published Jul 14
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Paper • 2301.13741 • Published Jan 31, 2023 • 1
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs Paper • 2510.05069 • Published Oct 6 • 12