7 11

Ruthwik

rusty263

AI & ML interests

None yet

Recent Activity

liked a model 25 days ago

nvidia/LocateAnything-3B

updated a collection 5 months ago

Datasets

updated a collection 5 months ago

Datasets

View all activity

Organizations

None yet

upvoted an article 7 months ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

ariG23498

•

Jan 19, 2025

• 53

upvoted 5 articles about 1 year ago

Article

Exploring Quantization Backends in Diffusers

derekl35, marcsun13, sayakpaul

•

May 21, 2025

• 45

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 260

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

derekl35, marcsun13, sayakpaul, merve, linoyts

•

Jun 19, 2025

• 106

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

edbeeching, kashif, ybelkada, lewtun, lvwerra, nazneen, natolambert

•

Apr 5, 2023

• 48

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

kashif, edbeeching, lewtun, lvwerra, osanseviero

•

Jan 18, 2024

• 84

upvoted an article about 2 years ago

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 538

Ruthwik

AI & ML interests

Recent Activity

Organizations

rusty263's activity

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

Exploring Quantization Backends in Diffusers

nanoVLM: The simplest repository to train your VLM in pure PyTorch

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Preference Tuning LLMs with Direct Preference Optimization Methods

Vision Language Models Explained