Barış Deniz Sağlam

bdsaglam

7 1

bdsaglam

AI & ML interests

language models, reinforcement learning

Recent Activity

upvoted an article about 2 months ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

liked a Space 2 months ago

AdithyaSK/rl-environments-guide

upvoted an article 4 months ago

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 169

upvoted an article 4 months ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Weyaxi

•

Jan 2

• 23

upvoted 2 articles over 1 year ago

Article

Training and Finetuning Reranker Models with Sentence Transformers

tomaarsen

•

Mar 26, 2025

• 195

Article

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

chansung

•

Aug 22, 2024

• 13

upvoted a paper over 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

upvoted 2 articles over 1 year ago

Article

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

Kseniase

•

Feb 13, 2025

• 18

Article

🦸🏻#11: How Do Agents Plan and Reason?

Kseniase

•

Feb 24, 2025

• 17

Barış Deniz Sağlam

AI & ML interests

Recent Activity

Organizations

bdsaglam's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Training and Finetuning Reranker Models with Sentence Transformers

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

🦸🏻#11: How Do Agents Plan and Reason?