kimi-k2.5-eagle3

Post-trained EAGLE3 draft model for moonshotai/Kimi-K2.5, based on lightseekorg/kimi-k2.5-eagle3.

Overview

This model is a speculative decoding draft model trained using the EAGLE3 architecture. It can be used to accelerate inference of Kimi-K2.5 by predicting multiple tokens in parallel.

Training

The base EAGLE3 draft model from lightseekorg/kimi-k2.5-eagle3 was further post-trained on open-source coding datasets.

Usage

This model is intended to be used as a draft model with EAGLE3-compatible inference engines such as vLLM or SGLang.

Downloads last month: 2,135

Safetensors

Model size

3B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for modularai/kimi-k2.5-eagle3

EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test

Paper • 2503.01840 • Published Mar 3, 2025 • 10