Amirhossein Azhinehfar's picture

Amirhossein Azhinehfar PRO

AmirHaz

·

AmirHaz220

AI & ML interests

None yet

Organizations

upvoted an article 10 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 417

upvoted a paper about 1 year ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

upvoted a collection over 1 year ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated Dec 24, 2025 • 244