MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models Paper • 2508.17467 • Published Aug 24, 2025
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference Paper • 2509.04377 • Published Sep 4, 2025
LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference Paper • 2509.02753 • Published Sep 2, 2025
ImageNet-Think-250K: A Large-Scale Synthetic Dataset for Multimodal Reasoning for Vision Language Models Paper • 2510.01582 • Published Oct 2, 2025
AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons Paper • 2503.05731 • Published Feb 19, 2025 • 3
LM4HPC: Towards Effective Language Model Application in High-Performance Computing Paper • 2306.14979 • Published Jun 26, 2023
AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions Paper • 2509.13523 • Published Sep 16, 2025 • 7
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models Paper • 2508.17467 • Published Aug 24, 2025
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference Paper • 2509.04377 • Published Sep 4, 2025
Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting Paper • 2509.25631 • Published Sep 30, 2025 • 2
AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions Paper • 2509.13523 • Published Sep 16, 2025 • 7
FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices using a Computing Power Aware Scheduler Paper • 2309.14675 • Published Sep 26, 2023 • 1
EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants Paper • 2502.20309 • Published Feb 27, 2025
Pathology Image Compression with Pre-trained Autoencoders Paper • 2503.11591 • Published Mar 14, 2025 • 5
PixCell: A generative foundation model for digital histopathology images Paper • 2506.05127 • Published Jun 5, 2025 • 3
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies Paper • 2310.04610 • Published Oct 6, 2023 • 1
Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study Paper • 2211.02092 • Published Nov 3, 2022
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 30
Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation Paper • 2304.05430 • Published Apr 11, 2023
A Survey of Techniques for Optimizing Transformer Inference Paper • 2307.07982 • Published Jul 16, 2023