Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data Paper • 2505.05427 • Published May 8 • 4
InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation Paper • 2509.24663 • Published Sep 29 • 13
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 28 items • Updated Sep 1 • 56
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8 • 27
CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction Paper • 1807.02478 • Published Jul 4, 2018
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder Paper • 2304.04052 • Published Apr 8, 2023
FewRel: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation Paper • 1810.10147 • Published Oct 24, 2018
ConPET: Continual Parameter-Efficient Tuning for Large Language Models Paper • 2309.14763 • Published Sep 26, 2023 • 1