Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
Shouren Wang
PRO
ShourenWSR
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
ShourenWSR/Qwen3-8B-Base-Instruct-Stage1-to-be-deleted
published
a model
5 days ago
ShourenWSR/Qwen3-8B-Base-Instruct-Stage1-to-be-deleted
updated
a model
5 days ago
ShourenWSR/Qwen3-4B-Base-Instruct-Stage2-Superior-65k-27k-to-be-deleted
View all activity
Organizations
None yet
ShourenWSR
's models
72
Sort: Recently updated
ShourenWSR/Qwen3-8B-Base-Instruct-Stage1-to-be-deleted
308k
•
Updated
5 days ago
•
12
ShourenWSR/Qwen3-4B-Base-Instruct-Stage2-Superior-65k-27k-to-be-deleted
7B
•
Updated
5 days ago
•
11
ShourenWSR/Qwen3-4B-Base-Instruct-Stage2-Superior-27k-27k-to-be-deleted
7B
•
Updated
5 days ago
•
13
ShourenWSR/Qwen3-4B-Base-Instruct-Stage1-to-be-deleted
4B
•
Updated
5 days ago
•
10
ShourenWSR/Qwen3-4B-V2-Superior-Hybrid-30k
7B
•
Updated
5 days ago
•
12
ShourenWSR/Qwen3-4B-Base-Superior-65k-27k
7B
•
Updated
5 days ago
•
12
ShourenWSR/Qwen3-4B-Instruct-Superior-65k-27k
7B
•
Updated
5 days ago
•
12
ShourenWSR/Qwen3-4B-Instruct-Superior-27k-27k
7B
•
Updated
5 days ago
•
11
ShourenWSR/Qwen3-4B-Instruct-NaiveMix-140k
7B
•
Updated
5 days ago
•
11
ShourenWSR/Qwen3-4B-Base-NaiveMix-140k
7B
•
Updated
5 days ago
•
11
ShourenWSR/Qwen3-4B-NaiveMix-140k
7B
•
Updated
5 days ago
•
11
ShourenWSR/Qwen3-4B-Baseline-2Phase-Superior-65k-27k-Phase2
196k
•
Updated
5 days ago
•
12
ShourenWSR/Qwen3-4B-Baseline-2Phase-Superior-65k-27k-Phase1
196k
•
Updated
5 days ago
•
12
ShourenWSR/Qwen3-4B-Baseline-2Phase-NaiveMix-140k-Phase2
196k
•
Updated
5 days ago
•
12
ShourenWSR/Qwen3-4B-Baseline-2Phase-NaiveMix-140k-Phase1
196k
•
Updated
5 days ago
•
11
ShourenWSR/Qwen2.5-7B-PL-MoE-Superior-27k-27k-v2
13B
•
Updated
Mar 30
•
2
ShourenWSR/Qwen3-4B-PL-MoE-V2-Superior-Hybrid-8k
Feature Extraction
•
7B
•
Updated
Mar 30
•
3
ShourenWSR/Qwen2.5-7B-PL-MoE-Superior-27k-27k-ckpt2400
13B
•
Updated
Mar 30
•
2
ShourenWSR/Qwen3-4B-PL-MoE-Initialized-V2
7B
•
Updated
Mar 29
•
2
ShourenWSR/Qwen3-4B-PL-MoE-Superior-27k-27k-v2
Feature Extraction
•
7B
•
Updated
Mar 29
•
2
ShourenWSR/Qwen3-4B-PL-MoE-Superior-27k-27k-ckpt2400
7B
•
Updated
Mar 28
•
2
ShourenWSR/Phi4-Mini-PL-MoE-Superior-27k-27k
Feature Extraction
•
6B
•
Updated
Mar 28
•
9
ShourenWSR/Qwen3-4B-PL-MoE-Superior-27k-27k
Feature Extraction
•
7B
•
Updated
Mar 25
•
8
ShourenWSR/Phi4-Mini-PL-MoE-Initialized
6B
•
Updated
Mar 23
•
2
ShourenWSR/Qwen3-8B-PL-MoE-Initialized
14B
•
Updated
Mar 23
•
1
ShourenWSR/Qwen3-4B-PL-MoE-Initialized
7B
•
Updated
Mar 22
•
4
ShourenWSR/Qwen3-4B-Base-Instruct-PL-MoE-Initialized
7B
•
Updated
Mar 20
•
2
ShourenWSR/Qwen3-4B-Base-Instruct
Text Generation
•
4B
•
Updated
Mar 20
•
1
ShourenWSR/Qwen3-4B-Base-PL-MoE-Initialized
7B
•
Updated
Mar 19
•
1
ShourenWSR/LLaMA3.1-8B-PL-MoE-Initialized
14B
•
Updated
Mar 19
•
12
Previous
1
2
3
Next