Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Tencent-Hunyuan-Multimodal-RL

company
https://TODO
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

sumailmao  updated a collection about 11 hours ago
Flow-DPPO: GenEval2 Reward LoRA Adapters
P2333  submitted a paper about 14 hours ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
zhouxiangxin  submitted a paper about 15 hours ago
Rethinking the Divergence Regularization in LLM RL
View all activity

Papers

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Rethinking the Divergence Regularization in LLM RL

View all Papers

Xiangxin Zhou's profile pictureLazy Beaver's profile pictureBoye Niu's profile pictureRuoyu's profile pictureJiarui Yao's profile pictureJiaqi Tang's profile pictureTianyu Pang's profile picturePU JIAN's profile picturesumail's profile pictureLvfang Tao's profile picture
Tencent-Hunyuan-Multimodal-RL 's papers 2
Submitted by
Tianyu Pang
30

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
2
Submitted by
Xiangxin Zhou
26

Rethinking the Divergence Regularization in LLM RL

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
408 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs