Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1040.7
TFLOPS
3
20
26
Erfan Shayegani 😈
Erfan-Shayegani
Follow
Fishtiks's profile picture
shtefcs's profile picture
taesiri's profile picture
9 followers
·
8 following
https://erfanshayegani.github.io/
Erf_Shayegani
erfanshayegani
erfan-shayegani
AI & ML interests
AI Safety - Responsible AI - Multi-Modal Alignment
Recent Activity
updated
a model
12 days ago
Erfan-Shayegani/smolgrpo2-paddingRight
published
a model
12 days ago
Erfan-Shayegani/smolgrpo2-paddingRight
updated
a model
12 days ago
Erfan-Shayegani/smolgrpo2-paddingLeft
View all activity
Organizations
Erfan-Shayegani
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
12 days ago
Erfan-Shayegani/smolgrpo2-paddingRight
Updated
12 days ago
published
a model
12 days ago
Erfan-Shayegani/smolgrpo2-paddingRight
Updated
12 days ago
updated
a model
12 days ago
Erfan-Shayegani/smolgrpo2-paddingLeft
Updated
12 days ago
published
a model
12 days ago
Erfan-Shayegani/smolgrpo2-paddingLeft
Updated
12 days ago
updated
a model
12 days ago
Erfan-Shayegani/Qwen2-0-5B-GRPO-vllm-trl
Updated
12 days ago
published
a model
12 days ago
Erfan-Shayegani/Qwen2-0-5B-GRPO-vllm-trl
Updated
12 days ago
updated
a model
13 days ago
Erfan-Shayegani/wordle-grpo-Qwen3-1.7B-test
Text Generation
•
2B
•
Updated
13 days ago
•
16
published
a model
14 days ago
Erfan-Shayegani/wordle-grpo-Qwen3-1.7B-test
Text Generation
•
2B
•
Updated
13 days ago
•
16
upvoted
an
article
14 days ago
view article
Article
GRPO for GUI Grounding Done Right
Jun 11, 2025
•
37
published
a model
15 days ago
Erfan-Shayegani/browsergym-grpo-functiongemma-270m-it
Updated
15 days ago
updated
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-GRPO-corrected-formatreward
Updated
15 days ago
published
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-GRPO-corrected-formatreward
Updated
15 days ago
liked
a Space
15 days ago
Running
RL
1
BrowserGym Environment Server
🌐
1
Control a simulated environment via text actions
updated
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-GRPO
Updated
15 days ago
published
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-GRPO
Updated
15 days ago
updated
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-2
Updated
15 days ago
published
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-2
Updated
15 days ago
updated
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking
Updated
15 days ago
published
a model
15 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking
Updated
15 days ago
updated
a model
16 days ago
Erfan-Shayegani/Qwen-3B-GRPO-gsm8k
Updated
16 days ago
Load more