Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
Barış Deniz Sağlam
bdsaglam
Follow
0 followers
·
2 following
bdsaglam
AI & ML interests
language models, reinforcement learning
Organizations
None yet
bdsaglam
's models
217
Sort: Recently updated
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250402_102637
Updated
Apr 2
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250401_104917
Updated
Apr 2
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250401_104703
Updated
Apr 1
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250401_104358
Updated
Apr 1
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250330_182708
Updated
Mar 30
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250330_131151
Updated
Mar 30
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250330_131059
Updated
Mar 30
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250330_130458
Updated
Mar 30
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250330_122351
Updated
Mar 30
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250329_222430
Updated
Mar 30
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged-ragent-grpo-20250329_195339
Updated
Mar 29
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250329_163553
Updated
Mar 29
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250329_161619
Updated
Mar 29
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250329_155340
Updated
Mar 29
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250329_155027
Updated
Mar 29
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique
Updated
Mar 29
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-merged
8B
•
Updated
Mar 26
•
6
bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-musique-scaled
Updated
Mar 24
bdsaglam/Qwen2.5-1.5B-Instruct-ragent-musique-ragent-grpo-musique
Updated
Mar 23
bdsaglam/Qwen2.5-1.5B-Instruct-ragent-musique
2B
•
Updated
Mar 12
•
6
bdsaglam/Qwen2.5-1.5B-Instruct-musique-grpo
Updated
Mar 12
•
3
bdsaglam/Qwen2.5-1.5B-Instruct-raga-gsm8k-grpo
2B
•
Updated
Mar 10
•
5
bdsaglam/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-calc
Updated
Mar 7
bdsaglam/Meta-Llama-3-8B-Instruct-GRPO-merged
Updated
Feb 27
bdsaglam/Meta-Llama-3-8B-Instruct-GRPO
Updated
Feb 27
bdsaglam/Qwen2.5-1.5B-Instruct-GRPO-merged
Text Generation
•
2B
•
Updated
Feb 26
•
8
bdsaglam/Qwen2.5-1.5B-Instruct-GRPO
2B
•
Updated
Feb 26
•
7
bdsaglam/erx-llama-3-8b-tiny
Updated
Feb 19
•
5
bdsaglam/erx-llama-3-8b-low
Updated
Feb 19
•
7
bdsaglam/erx-llama-3-8b
Updated
Feb 16
•
4
Previous
1
2
3
4
5
6
...
8
Next