Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
419.2
TFLOPS
Yaowei Zheng
hiyouga
93
77
184
Follow
nwkcahugfc's profile picture
TinyCici's profile picture
sauh811's profile picture
3,561 followers
·
38 following
https://github.com/hiyouga
code_hiyouga
hiyouga
hiyouga
AI & ML interests
LLM Training System
Recent Activity
posted
an
update
26 days ago
Follow my X account — I'll be sharing thoughts and findings on building open-source AI Agent projects, Agent Memory, and Observability. Thanks for connecting! https://x.com/code_hiyouga
liked
a dataset
28 days ago
DEVAI-benchmark/DEVAI
updated
a Space
about 2 months ago
hiyouga/LLaMA-Board
View all activity
Organizations
hiyouga
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
28 days ago
DEVAI-benchmark/DEVAI
Preview
•
Updated
Oct 24, 2024
•
423
•
22
liked
a dataset
8 months ago
neulab/agent-data-collection
Preview
•
Updated
Mar 9
•
3.24k
•
114
liked
2 models
10 months ago
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
Jan 22
•
237k
•
2.41k
internlm/Intern-S1-mini
Image-Text-to-Text
•
9B
•
Updated
Mar 29
•
7.82k
•
115
liked
a dataset
11 months ago
nvidia/Llama-Nemotron-VLM-Dataset-v1
Viewer
•
Updated
Oct 22, 2025
•
2.86M
•
3.32k
•
166
liked
2 models
11 months ago
janhq/Jan-v1-4B
Text Generation
•
4B
•
Updated
Aug 23, 2025
•
1.15k
•
•
355
openbmb/MiniCPM-V-4
Image-Text-to-Text
•
4B
•
Updated
Sep 15, 2025
•
218k
•
465
liked
a dataset
11 months ago
allenai/WildChat-4.8M
Viewer
•
Updated
Aug 11, 2025
•
3.2M
•
65.8k
•
164
liked
a model
11 months ago
openai/gpt-oss-20b
Text Generation
•
22B
•
Updated
Aug 26, 2025
•
7M
•
•
4.74k
liked
a dataset
11 months ago
JT-LM/JIUTIAN-TReB
Updated
13 days ago
•
36
•
4
liked
a Space
12 months ago
Runtime error
19
Megatron Memory Estimator
👁
19
Estimate GPU memory usage for Megatron models
liked
a model
12 months ago
moonshotai/Kimi-K2-Instruct
Text Generation
•
1T
•
Updated
Apr 23
•
483k
•
•
2.37k
liked
a dataset
12 months ago
data-for-agents/insta-150k-v3
Viewer
•
Updated
May 28, 2025
•
146k
•
449
•
16
liked
a model
12 months ago
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
Oct 25, 2025
•
585k
•
776
liked
a dataset
about 1 year ago
Saigyouji-Yuyuko1000/dapo17k
Viewer
•
Updated
Jun 23, 2025
•
17.9k
•
211
•
2
liked
2 models
about 1 year ago
reducto/RolmOCR
Image-Text-to-Text
•
8B
•
Updated
Apr 2, 2025
•
232k
•
587
nanonets/Nanonets-OCR-s
Image-Text-to-Text
•
4B
•
Updated
Jun 20, 2025
•
17.1k
•
1.59k
liked
a dataset
about 1 year ago
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
Jun 9, 2025
•
1.2M
•
18.5k
•
244
liked
2 models
about 1 year ago
open-thoughts/OpenThinker3-7B
Text Generation
•
8B
•
Updated
Jun 9, 2025
•
63.4k
•
•
136
ByteDance-Seed/BAGEL-7B-MoT
Any-to-Any
•
15B
•
Updated
Jan 9
•
886
•
1.21k
Load more