rokugatsu
/

LLM2025_Advanced_DPO_5

Text Generation

Model card Files Files and versions

LLM2025_Advanced_DPO_5

8.06 GB

Ctrl+K

Ctrl+K

1 contributor

History: 5 commits

rokugatsu's picture

Upload DPO-trained Qwen3-4B-Instruct-2507 model

6a79da5 verified 3 months ago

.gitattributes

1.57 kB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
README.md

2.41 kB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
added_tokens.json

707 Bytes
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
chat_template.jinja

2.63 kB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
config.json

1.57 kB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
generation_config.json

211 Bytes
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
merges.txt

1.67 MB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
model-00001-of-00002.safetensors

4.97 GB
xet

Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
model-00002-of-00002.safetensors

3.08 GB
xet

Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
model.safetensors.index.json

32.9 kB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
special_tokens_map.json

613 Bytes
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
tokenizer.json

11.4 MB
xet

Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
tokenizer_config.json

5.41 kB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago
vocab.json

2.78 MB
Upload DPO-trained Qwen3-4B-Instruct-2507 model 3 months ago