Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Ruiyi Wang
ruiyiwang
Follow
https://ruiyiw.github.io
RuiyiWang153
ruiyiw
AI & ML interests
social agents, LLM reasoning, reinforcement learning
Organizations
None yet
ruiyiwang
's models
18
Sort: Recently updated
ruiyiwang/swegym-qwen-8b-lines-complex-easy-grpo-final
Updated
Nov 26, 2025
ruiyiwang/alfworld-qwen-7b-sft-admissible
Updated
Nov 26, 2025
ruiyiwang/swegym-qwen-8b-tests-complex-hard-grpo-final-short
Updated
Nov 26, 2025
ruiyiwang/swegym-qwen-8b-lines-complex-medium-grpo-final-short
Updated
Nov 26, 2025
ruiyiwang/swegym-qwen-8b-lines-complex-hard-grpo-final-short
Updated
Nov 26, 2025
ruiyiwang/swegym-qwen-8b-tests-complex-medium-grpo-final-short
Updated
Nov 26, 2025
ruiyiwang/swegym-qwen-8b-lines-complex-easy-grpo-final-short
Updated
Nov 26, 2025
ruiyiwang/swegym-qwen-8b-tests-complex-easy-grpo-final
Updated
Nov 26, 2025
ruiyiwang/swegym-qwen3-8b-env-lines-complexity-medium-grpo-basic
Updated
Nov 25, 2025
ruiyiwang/swegym-qwen3-8b-env-lines-complexity-hard-grpo-basic
Updated
Nov 25, 2025
ruiyiwang/swegym-qwen3-8b-env-tests-complexity-easy-grpo-basic
Updated
Nov 25, 2025
ruiyiwang/swegym-qwen3-8b-env-tests-complexity-hard-grpo-basic
Updated
Nov 25, 2025
ruiyiwang/swegym-qwen3-8b-env-lines-complexity-easy-grpo-basic
Updated
Nov 25, 2025
ruiyiwang/swegym-qwen-8b-lines-complex-easy-grpo-basic
Updated
Nov 25, 2025
ruiyiwang/swegym-qwen3-8b-env-tests-complexity-medium-grpo-basic
Updated
Nov 24, 2025
ruiyiwang/SFT-alfworld-text-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-text-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025