Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
open-r1
dapo
trl
conversational
text-generation-inference
Instructions to use kangdawei/DAPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use kangdawei/DAPO with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="kangdawei/DAPO") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("kangdawei/DAPO") model = AutoModelForCausalLM.from_pretrained("kangdawei/DAPO") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use kangdawei/DAPO with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "kangdawei/DAPO" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kangdawei/DAPO", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/kangdawei/DAPO
- SGLang
How to use kangdawei/DAPO with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "kangdawei/DAPO" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kangdawei/DAPO", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "kangdawei/DAPO" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kangdawei/DAPO", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use kangdawei/DAPO with Docker Model Runner:
docker model run hf.co/kangdawei/DAPO
Training in progress, step 170
Browse files- dynamic_sampling_log.txt +79 -0
- model.safetensors +1 -1
dynamic_sampling_log.txt
CHANGED
|
@@ -1284,3 +1284,82 @@ Step Attempts Total Prompts Valid Prompts Efficiency
|
|
| 1284 |
159 5 30 6 20.0 %
|
| 1285 |
159 10 60 6 10.0 %
|
| 1286 |
160 1 6 6 100.0 %
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1284 |
159 5 30 6 20.0 %
|
| 1285 |
159 10 60 6 10.0 %
|
| 1286 |
160 1 6 6 100.0 %
|
| 1287 |
+
160 2 12 6 50.0 %
|
| 1288 |
+
160 3 18 6 33.3 %
|
| 1289 |
+
160 4 24 6 25.0 %
|
| 1290 |
+
160 4 24 6 25.0 %
|
| 1291 |
+
160 2 12 6 50.0 %
|
| 1292 |
+
160 7 42 6 14.3 %
|
| 1293 |
+
160 2 12 6 50.0 %
|
| 1294 |
+
161 2 12 6 50.0 %
|
| 1295 |
+
161 5 30 6 20.0 %
|
| 1296 |
+
161 2 12 6 50.0 %
|
| 1297 |
+
161 1 6 6 100.0 %
|
| 1298 |
+
161 2 12 6 50.0 %
|
| 1299 |
+
161 1 6 6 100.0 %
|
| 1300 |
+
161 4 24 6 25.0 %
|
| 1301 |
+
161 2 12 6 50.0 %
|
| 1302 |
+
162 9 54 6 11.1 %
|
| 1303 |
+
162 5 30 6 20.0 %
|
| 1304 |
+
162 3 18 6 33.3 %
|
| 1305 |
+
162 2 12 6 50.0 %
|
| 1306 |
+
162 6 36 6 16.7 %
|
| 1307 |
+
162 1 6 6 100.0 %
|
| 1308 |
+
162 4 24 6 25.0 %
|
| 1309 |
+
162 1 6 6 100.0 %
|
| 1310 |
+
163 4 24 6 25.0 %
|
| 1311 |
+
163 1 6 6 100.0 %
|
| 1312 |
+
163 2 12 6 50.0 %
|
| 1313 |
+
163 7 42 6 14.3 %
|
| 1314 |
+
163 7 42 6 14.3 %
|
| 1315 |
+
163 5 30 6 20.0 %
|
| 1316 |
+
163 10 60 6 10.0 %
|
| 1317 |
+
163 1 6 6 100.0 %
|
| 1318 |
+
164 3 18 6 33.3 %
|
| 1319 |
+
164 5 30 6 20.0 %
|
| 1320 |
+
164 10 60 6 10.0 %
|
| 1321 |
+
164 3 18 6 33.3 %
|
| 1322 |
+
164 3 18 6 33.3 %
|
| 1323 |
+
164 1 6 6 100.0 %
|
| 1324 |
+
164 2 12 6 50.0 %
|
| 1325 |
+
164 5 30 6 20.0 %
|
| 1326 |
+
165 4 24 6 25.0 %
|
| 1327 |
+
165 3 18 6 33.3 %
|
| 1328 |
+
165 4 24 6 25.0 %
|
| 1329 |
+
165 3 18 6 33.3 %
|
| 1330 |
+
165 1 6 6 100.0 %
|
| 1331 |
+
165 4 24 6 25.0 %
|
| 1332 |
+
165 6 36 6 16.7 %
|
| 1333 |
+
165 4 24 6 25.0 %
|
| 1334 |
+
166 1 6 6 100.0 %
|
| 1335 |
+
166 3 18 6 33.3 %
|
| 1336 |
+
166 2 12 6 50.0 %
|
| 1337 |
+
166 4 24 6 25.0 %
|
| 1338 |
+
166 2 12 6 50.0 %
|
| 1339 |
+
166 2 12 6 50.0 %
|
| 1340 |
+
166 3 18 6 33.3 %
|
| 1341 |
+
166 2 12 6 50.0 %
|
| 1342 |
+
167 2 12 6 50.0 %
|
| 1343 |
+
167 9 54 6 11.1 %
|
| 1344 |
+
167 3 18 6 33.3 %
|
| 1345 |
+
167 3 18 6 33.3 %
|
| 1346 |
+
167 3 18 6 33.3 %
|
| 1347 |
+
167 2 12 6 50.0 %
|
| 1348 |
+
167 1 6 6 100.0 %
|
| 1349 |
+
167 5 30 6 20.0 %
|
| 1350 |
+
168 3 18 6 33.3 %
|
| 1351 |
+
168 4 24 6 25.0 %
|
| 1352 |
+
168 10 60 6 10.0 %
|
| 1353 |
+
168 2 12 6 50.0 %
|
| 1354 |
+
168 4 24 6 25.0 %
|
| 1355 |
+
168 4 24 6 25.0 %
|
| 1356 |
+
168 6 36 6 16.7 %
|
| 1357 |
+
168 4 24 6 25.0 %
|
| 1358 |
+
169 6 36 6 16.7 %
|
| 1359 |
+
169 2 12 6 50.0 %
|
| 1360 |
+
169 1 6 6 100.0 %
|
| 1361 |
+
169 3 18 6 33.3 %
|
| 1362 |
+
169 4 24 6 25.0 %
|
| 1363 |
+
169 2 12 6 50.0 %
|
| 1364 |
+
169 9 54 6 11.1 %
|
| 1365 |
+
169 5 30 6 20.0 %
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 3554214752
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9dd192b9b8f974834b874de0a750b3d0c5ad60fc9bc620ad906e5fc42ed46426
|
| 3 |
size 3554214752
|