Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data Paper • 2505.05427 • Published May 8, 2025 • 4
InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation Paper • 2509.24663 • Published Sep 29, 2025 • 14
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 28 items • Updated Sep 1, 2025 • 59
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8, 2025 • 27
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8, 2025 • 82
CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction Paper • 1807.02478 • Published Jul 4, 2018
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder Paper • 2304.04052 • Published Apr 8, 2023