Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Masayuki Yamada
PRO
beachcities
Follow
0 followers
·
1 following
beachcities
masayukiyamada
AI & ML interests
None yet
Recent Activity
updated
a model
about 20 hours ago
beachcities/qwen3-4b-sft-v5a-hybrid-merged
published
a model
about 20 hours ago
beachcities/qwen3-4b-sft-v5a-hybrid-merged
updated
a model
about 21 hours ago
beachcities/qwen3-4b-sft-v5-hybrid
View all activity
Organizations
None yet
beachcities
's models
24
Sort: Recently updated
beachcities/qwen3-4b-sft-v5a-hybrid-merged
4B
•
Updated
about 20 hours ago
•
10
beachcities/qwen3-4b-sft-v5-hybrid
Text Generation
•
Updated
about 21 hours ago
•
5
beachcities/qwen3-4b-sft-v5-hybrid-merged
4B
•
Updated
about 21 hours ago
•
4
beachcities/qwen3-4b-sft-v4a-dpo
Text Generation
•
4B
•
Updated
1 day ago
•
28
beachcities/qwen3-4b-sft-v4a-masked
Text Generation
•
Updated
1 day ago
•
13
beachcities/qwen3-4b-sft-v4a-masked-merged
4B
•
Updated
1 day ago
•
23
beachcities/qwen3-4b-sft-v3a-lr5e5-merged
Text Generation
•
4B
•
Updated
1 day ago
•
45
beachcities/qwen3-4b-sft-v3b-emptythink-merged
4B
•
Updated
1 day ago
•
11
beachcities/qwen3-4b-sft-v3a-lr5e5
Text Generation
•
Updated
2 days ago
•
26
beachcities/qwen3-4b-sft-dpo-v25mix-structeval
Text Generation
•
4B
•
Updated
4 days ago
•
53
beachcities/qwen3-4b-sft-v25mix-structeval
Text Generation
•
Updated
4 days ago
•
14
beachcities/qwen3-4b-sft-dpo-v3-structeval
Text Generation
•
4B
•
Updated
4 days ago
•
29
beachcities/qwen3-4b-sft-dpo-v2mix-structeval
Text Generation
•
4B
•
Updated
5 days ago
•
27
beachcities/qwen3-4b-sft-v2mix-structeval
Text Generation
•
Updated
5 days ago
•
15
beachcities/qwen3-4b-sft-dpo-v2-structeval
Text Generation
•
4B
•
Updated
5 days ago
•
87
beachcities/qwen3-4b-sft-v4-structeval
Text Generation
•
Updated
5 days ago
•
26
beachcities/qwen3-4b-sft-dpo-structeval
Text Generation
•
4B
•
Updated
5 days ago
•
33
beachcities/qwen3-4b-sft-structeval
Text Generation
•
Updated
7 days ago
•
28
beachcities/Qwen3-4B-DPO-Final-Day3
Text Generation
•
4B
•
Updated
8 days ago
•
9
beachcities/planB-merged-qwen3-4b-2507
4B
•
Updated
8 days ago
•
12
beachcities/ppo-BipedalWalker-v3-A100-SOTA
Reinforcement Learning
•
Updated
Dec 19, 2025
beachcities/ppo-LunarLander-v3-A100-SOTA
Reinforcement Learning
•
Updated
Dec 18, 2025
beachcities/ppo-LunarLander-v2-expert
Reinforcement Learning
•
Updated
Dec 18, 2025
beachcities/ppo-LunarLander-v3
Reinforcement Learning
•
Updated
Dec 18, 2025