Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Tony Congqian Wang's picture
6 14 1

Tony Congqian Wang

TonyCWang

AI & ML interests

None yet

Organizations

None yet

commented 3 papers 7 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229 •
9

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 45 •
4

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57 •
4
commented a paper 8 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50
New activity in timm/vit_little_patch16_reg4_gap_256.sbb_in1k 9 months ago

Loss exploding to nan

31
#1 opened 10 months ago by
tony0278611
commented 2 papers 11 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50
New activity in timm/mobilenetv4_conv_aa_large.e230_r448_in12k_ft_in1k 11 months ago

Training recipe

#2 opened 11 months ago by
TonyCWang
commented 2 papers 11 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs