Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Rummy's picture
6 14

Rummy

yang31210999
wanng's profile picture John6666's profile picture zunhai's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
updated a model 14 days ago
yang31210999/result-weight-similarity-0327_ICML_Rebuttal
published a model 14 days ago
yang31210999/result-weight-similarity-0327_ICML_Rebuttal
View all activity

Organizations

Tsinghua IIGroup's profile picture

yang31210999 's models 129

yang31210999/Qwen3-1.7B-AWQ-2b

2B • Updated Jul 26, 2025

yang31210999/Qwen3-0.6B-AWQ-3b

0.6B • Updated Jul 26, 2025

yang31210999/Qwen3-0.6B-AWQ-2b

0.6B • Updated Jul 26, 2025 • 1

yang31210999/Shadow-Checkpoints

Updated Jul 26, 2025

yang31210999/Llama3.1-1B-Neo-BAAI-1000k

Text Generation • 2B • Updated Feb 28, 2025 • 8 • 2

yang31210999/Llama-3.1-Minitron-4B-Depth-Neo-BAAI-100k

Text Generation • 5B • Updated Feb 28, 2025 • 7 • 1

yang31210999/Llama-3.2-1B-Instruct-Neo-BAAI-10k

Text Generation • 1B • Updated Feb 28, 2025 • 3

yang31210999/H200-pile-0.01-15-10-5-neo-rank64-lr2e-4

Updated Oct 23, 2024

yang31210999/1023-eval-matmulfree-370M-ckpt27

Updated Oct 23, 2024
  • Previous
  • 1
  • ...
  • 3
  • 4
  • 5
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs