Text Generation
Transformers
Safetensors
step3p5
conversational
custom_code
fp8
WinstonDeng commited on
Commit
433124f
·
verified ·
1 Parent(s): d16e6d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -10,11 +10,12 @@ base_model:
10
 
11
  <div align="center" style="display: flex; justify-content: center; align-items: center;">
12
  <img src="stepfun.svg" width="25" style="margin-right: 10px;"/>
13
- <h1 style="margin: 0; border-bottom: none;">Step-3.5-Flash</h1>
14
  </div>
15
 
16
  [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20HF-StepFun/STEP3p5-preview)](https://huggingface.co/stepfun-ai/Step-3.5-Flash-FP8)
17
  [![ModelScope](https://img.shields.io/badge/ModelScope-StepFun/STEP3p5-preview)](https://huggingface.co/stepfun-ai/step3p5_preview/tree/main)
 
18
  [![Paper](https://img.shields.io/badge/Paper-Arxiv-red)](https://huggingface.co/stepfun-ai/Step-3.5-Flash-FP8)
19
  [![License](https://img.shields.io/badge/License-Apache%202.0-green)]()
20
 
@@ -22,7 +23,7 @@ base_model:
22
 
23
  ## 1. Introduction
24
 
25
- **Step 3.5 Flash** is our most capable open-source foundation model, engineered to deliver frontier reasoning and agentic capabilities with exceptional efficiency. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token. This "intelligence density" allows it to rival the reasoning depth of top-tier proprietary models, while maintaining the agility required for real-time interaction.
26
 
27
  ## 2. Key Capabilities
28
 
@@ -213,7 +214,7 @@ vllm serve <MODEL_PATH_OR_HF_ID> \
213
  1. Install SGLang.
214
  ```bash
215
  # via Docker
216
- docker pull lmsysorg/sglang:latest
217
  # or from source (pip)
218
  pip install "sglang[all] @ git+https://github.com/sgl-project/sglang.git"
219
  ```
 
10
 
11
  <div align="center" style="display: flex; justify-content: center; align-items: center;">
12
  <img src="stepfun.svg" width="25" style="margin-right: 10px;"/>
13
+ <h1 style="margin: 0; border-bottom: none;">Step 3.5 Flash</h1>
14
  </div>
15
 
16
  [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20HF-StepFun/STEP3p5-preview)](https://huggingface.co/stepfun-ai/Step-3.5-Flash-FP8)
17
  [![ModelScope](https://img.shields.io/badge/ModelScope-StepFun/STEP3p5-preview)](https://huggingface.co/stepfun-ai/step3p5_preview/tree/main)
18
+ [![Webpage](https://img.shields.io/badge/Webpage-Blog-blue)](https://static.stepfun.com/blog/step-3.5-flash/)
19
  [![Paper](https://img.shields.io/badge/Paper-Arxiv-red)](https://huggingface.co/stepfun-ai/Step-3.5-Flash-FP8)
20
  [![License](https://img.shields.io/badge/License-Apache%202.0-green)]()
21
 
 
23
 
24
  ## 1. Introduction
25
 
26
+ **Step 3.5 Flash** ([visit website](https://static.stepfun.com/blog/step-3.5-flash/)) is our most capable open-source foundation model, engineered to deliver frontier reasoning and agentic capabilities with exceptional efficiency. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token. This "intelligence density" allows it to rival the reasoning depth of top-tier proprietary models, while maintaining the agility required for real-time interaction.
27
 
28
  ## 2. Key Capabilities
29
 
 
214
  1. Install SGLang.
215
  ```bash
216
  # via Docker
217
+ docker pull lmsysorg/sglang:dev-pr-18084
218
  # or from source (pip)
219
  pip install "sglang[all] @ git+https://github.com/sgl-project/sglang.git"
220
  ```