Steven ZHANG's picture

1 1

Steven ZHANG PRO

STEVENZHANG904

·

AI & ML interests

AI for Science, Multimodal ML, AI for Info Sec

Recent Activity

updated a collection 19 days ago

Qwen3 Small SFT — MAS Traces

updated a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-planner-sft

published a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-planner-sft

View all activity

Organizations

updated a collection 19 days ago

Qwen3 Small SFT — MAS Traces

Qwen3-0.6B / 1.7B SFT-distilled from Qwen3-32B on Divij/qwen3-32b-mas-traces (planner/executor/verifier). 4 epochs, bf16. • 9 items • Updated 19 days ago

updated a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-planner-sft

Text Generation • 4B • Updated 19 days ago • 32

published a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-planner-sft

Text Generation • 4B • Updated 19 days ago • 32

updated a collection 19 days ago

Qwen3 Small SFT — MAS Traces

Qwen3-0.6B / 1.7B SFT-distilled from Qwen3-32B on Divij/qwen3-32b-mas-traces (planner/executor/verifier). 4 epochs, bf16. • 9 items • Updated 19 days ago

updated a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-verifier-sft

Text Generation • 4B • Updated 19 days ago • 34

published a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-verifier-sft

Text Generation • 4B • Updated 19 days ago • 34

updated a collection 19 days ago

Qwen3 Small SFT — MAS Traces

Qwen3-0.6B / 1.7B SFT-distilled from Qwen3-32B on Divij/qwen3-32b-mas-traces (planner/executor/verifier). 4 epochs, bf16. • 9 items • Updated 19 days ago

updated a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-executor-sft

Text Generation • 4B • Updated 19 days ago • 27

published a model 19 days ago

STEVENZHANG904/Qwen3-4B-Instruct-2507-executor-sft

Text Generation • 4B • Updated 19 days ago • 27

updated a collection 22 days ago

Qwen3 Small SFT — MAS Traces

Qwen3-0.6B / 1.7B SFT-distilled from Qwen3-32B on Divij/qwen3-32b-mas-traces (planner/executor/verifier). 4 epochs, bf16. • 9 items • Updated 19 days ago

updated a model 22 days ago

STEVENZHANG904/Qwen3-1.7B-planner-sft

Text Generation • 2B • Updated 22 days ago • 42

published a model 22 days ago

STEVENZHANG904/Qwen3-1.7B-planner-sft

Text Generation • 2B • Updated 22 days ago • 42

updated a collection 23 days ago

Qwen3 Small SFT — MAS Traces

Qwen3-0.6B / 1.7B SFT-distilled from Qwen3-32B on Divij/qwen3-32b-mas-traces (planner/executor/verifier). 4 epochs, bf16. • 9 items • Updated 19 days ago

updated a model 23 days ago

STEVENZHANG904/Qwen3-1.7B-verifier-sft

Text Generation • 2B • Updated 23 days ago • 43

published a model 23 days ago

STEVENZHANG904/Qwen3-1.7B-verifier-sft

Text Generation • 2B • Updated 23 days ago • 43

updated a collection 23 days ago

Qwen3 Small SFT — MAS Traces

Qwen3-0.6B / 1.7B SFT-distilled from Qwen3-32B on Divij/qwen3-32b-mas-traces (planner/executor/verifier). 4 epochs, bf16. • 9 items • Updated 19 days ago

updated a model 23 days ago

STEVENZHANG904/Qwen3-0.6B-verifier-sft

Text Generation • 0.6B • Updated 23 days ago • 40

published a model 23 days ago

STEVENZHANG904/Qwen3-0.6B-verifier-sft

Text Generation • 0.6B • Updated 23 days ago • 40

updated a collection 23 days ago

Qwen3 Small SFT — MAS Traces

Qwen3-0.6B / 1.7B SFT-distilled from Qwen3-32B on Divij/qwen3-32b-mas-traces (planner/executor/verifier). 4 epochs, bf16. • 9 items • Updated 19 days ago

updated a model 23 days ago

STEVENZHANG904/Qwen3-1.7B-executor-sft

Text Generation • 2B • Updated 23 days ago • 43