Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
2
Ruiyi Wang
ruiyiwang
Follow
https://ruiyiw.github.io
RuiyiWang153
ruiyiw
AI & ML interests
social agents, LLM reasoning, reinforcement learning
Recent Activity
updated
a dataset
28 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits
published
a dataset
28 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits
updated
a model
about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3
View all activity
Organizations
None yet
ruiyiwang
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
28 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits
Viewer
•
Updated
28 days ago
•
8.9k
•
217
published
a dataset
28 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits
Viewer
•
Updated
28 days ago
•
8.9k
•
217
updated
a model
about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3
Updated
Apr 23
published
a model
about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3
Updated
Apr 23
updated
a model
about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-2
Updated
Apr 23
published
a model
about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-2
Updated
Apr 23
updated
a model
about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4
Updated
Apr 22
published
a model
about 2 months ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4
Updated
Apr 22
New activity in
PEARLS-Lab/robocasa-composite-raw-videos
3 months ago
Remove root-level episode files (should be under task folders)
#1 opened 3 months ago by
ruiyiwang
updated
a model
6 months ago
ruiyiwang/alfworld-qwen-7b-sft-admissible
Updated
Nov 26, 2025
published
a model
6 months ago
ruiyiwang/alfworld-qwen-7b-sft-admissible
Updated
Nov 26, 2025
liked
a dataset
7 months ago
PEARLS-Lab/meow-tea-taro-dataset
Updated
Mar 5
•
139
•
2
liked
a model
7 months ago
KAKA22/CodeRM-8B
Text Generation
•
8B
•
Updated
Jan 20, 2025
•
43
•
•
6
updated
a dataset
7 months ago
ruiyiwang/meow-tea-oolong-dataset
Viewer
•
Updated
Nov 21, 2025
•
13.1k
•
7
updated
3 models
7 months ago
ruiyiwang/SFT-alfworld-text-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-text-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
published
3 models
7 months ago
ruiyiwang/SFT-alfworld-text-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-text-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
ruiyiwang/SFT-alfworld-visual-only-Qwen2.5-VL-7B-Instruct
Updated
Nov 20, 2025
Load more