Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
4
2
2
Rohan Surana
rohan2810
Follow
rohan2810
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization
submitted
a paper
3 days ago
F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking
upvoted
a
paper
10 days ago
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning
View all activity
Organizations
None yet
rohan2810
's models
32
Sort: Recently updated
rohan2810/llama-pii-ori
Updated
Dec 4, 2024
rohan2810/llama-pii-syn
Updated
Dec 4, 2024
Previous
1
2
Next