ZZ's picture

ZZ

ZR8

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

new activity 9 months ago

renjiepi/G-LLaVA-7B-align:Is this model only pretrained and not finetuned

upvoted a paper 9 months ago

Towards a Unified View of Large Language Model Post-Training

View all activity

Organizations

upvoted a paper 13 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 14 days ago • 30

New activity in renjiepi/G-LLaVA-7B-align 9 months ago

Is this model only pretrained and not finetuned

#1 opened 9 months ago by

upvoted a paper 9 months ago

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4, 2025 • 76

liked a dataset 10 months ago

JiayuLei/RadGenome-Brain_MRI

Viewer • Updated Jul 10, 2024 • 3 • 46 • 7

liked 2 models 11 months ago

csuhan/Tar-1.5B

Any-to-Any • 3B • Updated Jul 2, 2025 • 615 • 2

chaoyi-wu/RadFM

Updated Aug 31, 2023 • 20

liked a model about 1 year ago

microsoft/rad-dino

Image Feature Extraction • 86.6M • Updated 19 days ago • 382k • 75

upvoted a paper about 1 year ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14, 2025 • 28