Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Rohan Surana's picture
4 2 2

Rohan Surana

rohan2810
  • rohan2810

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago
MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization
submitted a paper 3 days ago
F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking
upvoted a paper 10 days ago
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning
View all activity

Organizations

None yet

rohan2810 's models 32

rohan2810/llama-pii-ori

Updated Dec 4, 2024

rohan2810/llama-pii-syn

Updated Dec 4, 2024
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs