AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons Paper • 2503.05731 • Published Feb 19 • 3
LM4HPC: Towards Effective Language Model Application in High-Performance Computing Paper • 2306.14979 • Published Jun 26, 2023
AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions Paper • 2509.13523 • Published Sep 16 • 7
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models Paper • 2508.17467 • Published Aug 24
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference Paper • 2509.04377 • Published Sep 4
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies Paper • 2310.04610 • Published Oct 6, 2023 • 1
Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study Paper • 2211.02092 • Published Nov 3, 2022