YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
How to use To use Eagle3 with SGLang, first replace the qwen3.py file in SGLang鈥檚 directory (sglang/python/sglang/srt/models/) with the qwen3.py file from this project.
The launch command for using Eagle3 with SGLang is:
python3 -m sglang.launch_server --model Qwen/Qwen3-4B-Instruct-2507 --speculative-algorithm EAGLE3 --speculative-draft-model-path Tengyunw/qwen3_4b_eagle3 --speculative-num-steps 6 --speculative-eagle-topk 10 --speculative-num-draft-tokens 32 --mem-fraction 0.9 --cuda-graph-max-bs 2 --dtype bfloat16
- Downloads last month
- 8
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
馃檵
Ask for provider support