Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Takayama
kazuyamaa
Follow
Ta1k1's profile picture
1 follower
·
1 following
kazukitakayamas
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 months ago
kazuyamaa/alfworld-lambda-grpo-v004
published
a model
about 2 months ago
kazuyamaa/alfworld-lambda-grpo-v004
updated
a model
about 2 months ago
kazuyamaa/alfworld-lambda-grpo-v002-hull
View all activity
Organizations
None yet
kazuyamaa
's models
46
Sort: Recently updated
kazuyamaa/Qwen3-4B-PPO-3000data-v1-Full
Text Generation
•
8B
•
Updated
Nov 23, 2025
•
2
kazuyamaa/Qwen3-4B-PPO-3000data-v1
Reinforcement Learning
•
Updated
Nov 23, 2025
•
3
kazuyamaa/Qwen3-4B-reward-v1
Updated
Nov 20, 2025
kazuyamaa/Qwen3-8B-Math-gspo_v1
Text Generation
•
8B
•
Updated
Oct 28, 2025
•
2
kazuyamaa/Qwen3-8B-Math-GRPO_v2
Text Generation
•
8B
•
Updated
Oct 19, 2025
•
1
kazuyamaa/Qwen3-8B-Math-GRPO
Text Generation
•
8B
•
Updated
Oct 19, 2025
•
1
kazuyamaa/DeepSeek-R1-Distill-Qwen-32B-axolotl-sft-v1.0
Updated
Jul 14, 2025
•
4
kazuyamaa/Qwen2.5-3B-Instruct-GRPO-v002
Text Generation
•
3B
•
Updated
May 7, 2025
•
1
kazuyamaa/Qwen2.5-3B-Instruct-GRPO-v001
Text Generation
•
2B
•
Updated
May 6, 2025
•
1
kazuyamaa/gemma3-1b-GRPO-inst
Updated
Apr 26, 2025
kazuyamaa/code-trans-gemma-2-2b-dpo
Text Generation
•
3B
•
Updated
Mar 22, 2025
•
2
kazuyamaa/code-trans-gemma-2-2b-sft
Text Generation
•
3B
•
Updated
Mar 22, 2025
•
3
kazuyamaa/gemma-2-2b-code-translate-dpo-merged
3B
•
Updated
Mar 21, 2025
kazuyamaa/gemma-2-2b-sft-merged
3B
•
Updated
Mar 20, 2025
kazuyamaa/DeepSeek-R1-Distill-Qwen-14B-axolotl-int-v1.0-merged
15B
•
Updated
Mar 15, 2025
kazuyamaa/llm-jp-3-13b_r128_int_20241209_2
Updated
Dec 23, 2024
Previous
1
2
Next