togethercomputer
/

Aurora-Spec-Qwen3-Coder-Next-FP8

Text Generation

speculative-decoding

inference-time-training

code-generation

Model card Files Files and versions

jisenli commited on Feb 24

Commit

6c184ec

·

verified ·

1 Parent(s): 8c36afc

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -170,7 +170,7 @@ def main():
     llm = sgl.Engine(
         model_path="Qwen/Qwen3-Coder-Next-FP8",
         speculative_draft_model_path="togethercomputer/Aurora-Spec-Qwen3-Coder-Next-FP8",
-        speculative_algorithm="EAGLE",
         speculative_num_steps=5,
         speculative_eagle_topk=1,
         speculative_num_draft_tokens=6,
@@ -199,7 +199,7 @@ if __name__ == "__main__":
 python -m sglang.launch_server \
     --model-path Qwen/Qwen3-Coder-Next-FP8 \
     --speculative-draft-model-path togethercomputer/Aurora-Spec-Qwen3-Coder-Next-FP8 \
-    --speculative-algorithm EAGLE \
     --speculative-num-steps 5 \
     --speculative-eagle-topk 1 \
     --speculative-num-draft-tokens 6 \
@@ -259,7 +259,7 @@ If you have downloaded the models locally, replace the HuggingFace model paths w
 python -m sglang.launch_server \
     --model-path /path/to/Qwen3-Coder-Next-FP8 \
     --speculative-draft-model-path /path/to/Aurora-Spec-Qwen3-Coder-Next-FP8 \
-    --speculative-algorithm EAGLE \
     --speculative-num-steps 5 \
     --speculative-eagle-topk 1 \
     --speculative-num-draft-tokens 6 \

     llm = sgl.Engine(
         model_path="Qwen/Qwen3-Coder-Next-FP8",
         speculative_draft_model_path="togethercomputer/Aurora-Spec-Qwen3-Coder-Next-FP8",
+        speculative_algorithm="EAGLE3",
         speculative_num_steps=5,
         speculative_eagle_topk=1,
         speculative_num_draft_tokens=6,
 python -m sglang.launch_server \
     --model-path Qwen/Qwen3-Coder-Next-FP8 \
     --speculative-draft-model-path togethercomputer/Aurora-Spec-Qwen3-Coder-Next-FP8 \
+    --speculative-algorithm EAGLE3 \
     --speculative-num-steps 5 \
     --speculative-eagle-topk 1 \
     --speculative-num-draft-tokens 6 \
 python -m sglang.launch_server \
     --model-path /path/to/Qwen3-Coder-Next-FP8 \
     --speculative-draft-model-path /path/to/Aurora-Spec-Qwen3-Coder-Next-FP8 \
+    --speculative-algorithm EAGLE3 \
     --speculative-num-steps 5 \
     --speculative-eagle-topk 1 \
     --speculative-num-draft-tokens 6 \