Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ZZ's picture
1 3 4

ZZ

ZR8
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago
Post-Trained MoE Can Skip Half Experts via Self-Distillation
new activity 9 months ago
renjiepi/G-LLaVA-7B-align:Is this model only pretrained and not finetuned
upvoted a paper 9 months ago
Towards a Unified View of Large Language Model Post-Training
View all activity

Organizations

Hugging Face Discord Community's profile picture

upvoted a paper 13 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 14 days ago • 30
New activity in renjiepi/G-LLaVA-7B-align 9 months ago

Is this model only pretrained and not finetuned

#1 opened 9 months ago by
ZR8
upvoted a paper 9 months ago

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4, 2025 • 76
liked a dataset 10 months ago

JiayuLei/RadGenome-Brain_MRI

Viewer • Updated Jul 10, 2024 • 3 • 46 • 7
liked 2 models 11 months ago

csuhan/Tar-1.5B

Any-to-Any • 3B • Updated Jul 2, 2025 • 615 • 2

chaoyi-wu/RadFM

Updated Aug 31, 2023 • 20
liked a model about 1 year ago

microsoft/rad-dino

Image Feature Extraction • 86.6M • Updated 19 days ago • 382k • 75
upvoted a paper about 1 year ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14, 2025 • 28
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs