4 7 6

Daru Okta Buana

daruokta

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

updated a dataset 9 days ago

daruokta/t5gemma2-indonesia-chat-formatted

upvoted an article 12 days ago

The Five Technologies driving AI 3.0

View all activity

Organizations

upvoted an article 4 days ago

Article

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

karina-zadorozhny

•

Jan 19

• 30

updated a dataset 9 days ago

daruokta/t5gemma2-indonesia-chat-formatted

Viewer • Updated 9 days ago • 39.8k • 219 • 1

upvoted 2 articles 12 days ago

Article

The Five Technologies driving AI 3.0

marcusinthesky

•

15 days ago

• 2

Article

Models are Markup, Tokens are Features...

marcusinthesky

•

14 days ago

• 1

updated a model 14 days ago

daruokta/t5gemma-2-1b-1b-instruct-chat-indo-v2

Text Generation • Updated 13 days ago

published a model 14 days ago

daruokta/t5gemma-2-1b-1b-instruct-chat-indo-v2

Text Generation • Updated 13 days ago

updated a model 14 days ago

daruokta/t5gemma-2-1b-1b-instruct-chat-indo-v2-exp

Text Generation • Updated 13 days ago • 5

published a model 14 days ago

daruokta/t5gemma-2-1b-1b-instruct-chat-indo-v2-exp

Text Generation • Updated 13 days ago • 5

New activity in google/t5gemma-2-270m-270m 23 days ago

Training Scripts

#3 opened 6 months ago by

khatrimann

liked a model about 2 months ago

froggeric/Qwen-Fixed-Chat-Templates

Updated 21 days ago • 607

upvoted an article about 2 months ago

Article

🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷

RDTvlokip

•

May 5

• 6

liked 2 datasets about 2 months ago

LorthGyu/indonesian-chat

Viewer • Updated Mar 8 • 200 • 29 • 1

riynkk/openai-chat-id-evol-instruct

Viewer • Updated Jan 19 • 59k • 6 • 1

published a dataset 2 months ago

daruokta/t5gemma2-indonesia-chat-formatted

Viewer • Updated 9 days ago • 39.8k • 219 • 1

updated a Space 2 months ago

ml-intern sandbox

🌍

upvoted an article 2 months ago

Article

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

Nicolas-BZRD

•

Apr 7

• 28

upvoted an article 3 months ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 62

New activity in sol-r/t5gemma-2-4b-4b-instruct-control 3 months ago

Training code

#1 opened 3 months ago by

daruokta

published a model 3 months ago

daruokta/t5gemma2-v4-stage2-checkpoints

0.7B • Updated Mar 23 • 4

updated a model 3 months ago

daruokta/t5gemma2-v4-stage2-checkpoints

0.7B • Updated Mar 23 • 4

Daru Okta Buana

AI & ML interests

Recent Activity

Organizations

daruokta's activity

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

The Five Technologies driving AI 3.0

Models are Markup, Tokens are Features...

Training Scripts

🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷

ml-intern sandbox

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

Multimodal Embedding & Reranker Models with Sentence Transformers

Training code