6 1 12

Jiangze Yan

Kadins

Kidand

AI & ML interests

Main areas of interest include large language models and multimodal models.

Recent Activity

authored a paper 4 days ago

DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models

authored a paper 4 days ago

HiMPO: Hindsight-Informed Memory Policy Optimization for Less-Entangled Credit in Long-Horizon Agents

authored a paper 4 days ago

HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation

View all activity

Organizations

authored 3 papers 4 days ago

DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models

Paper • 2503.04472 • Published Jan 12

HiMPO: Hindsight-Informed Memory Policy Optimization for Less-Entangled Credit in Long-Horizon Agents

Paper • 2606.16285 • Published 6 days ago • 1

HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation

Paper • 2603.10359 • Published Mar 11

upvoted a paper 4 days ago

HiMPO: Hindsight-Informed Memory Policy Optimization for Less-Entangled Credit in Long-Horizon Agents

Paper • 2606.16285 • Published 6 days ago • 1

updated a model 12 months ago

Kadins/deepseek-r1-32b-dapo-0702

Text Generation • 33B • Updated Jul 2, 2025 • 1

published a model 12 months ago

Kadins/deepseek-r1-32b-dapo-0702

Text Generation • 33B • Updated Jul 2, 2025 • 1

New activity in BAAI/TACO about 1 year ago

The dataset cannot be downloaded using load_dataset().

#6 opened about 1 year ago by

Kadins

updated a model over 1 year ago

Kadins/DeepSeek-R1-Distill-Qwen-7B-GRPO

Text Generation • 8B • Updated Mar 8, 2025 • 2

published a model over 1 year ago

Kadins/DeepSeek-R1-Distill-Qwen-7B-GRPO

Text Generation • 8B • Updated Mar 8, 2025 • 2

updated a model over 1 year ago

Kadins/Qwen-2.5-7B-Math-RL

Text Generation • 8B • Updated Mar 5, 2025 • 7

published a model over 1 year ago

Kadins/Qwen-2.5-7B-Math-RL

Text Generation • 8B • Updated Mar 5, 2025 • 7

updated a model over 1 year ago

Kadins/Qwen-2.5-7B-Instruct-GRPO

Text Generation • 15B • Updated Mar 3, 2025 • 8

published a model over 1 year ago

Kadins/Qwen-2.5-7B-Instruct-GRPO

Text Generation • 15B • Updated Mar 3, 2025 • 8

updated 2 models over 1 year ago

Kadins/Qwen-2.5-7B-Simple-RL

Text Generation • 8B • Updated Feb 27, 2025 • 6

Kadins/Qwen-2.5-7B-Simple-RL

Text Generation • 8B • Updated Feb 27, 2025 • 6

published a model over 1 year ago

Kadins/Qwen-2.5-7B-Simple-RL

Text Generation • 8B • Updated Feb 27, 2025 • 6

Jiangze Yan

AI & ML interests

Recent Activity

Organizations

Kadins's activity

The dataset cannot be downloaded using load_dataset().