5 14

Yulianghua

lianghua

AI & ML interests

None yet

Recent Activity

upvoted an article 9 days ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

liked a Space 4 months ago

HuggingFaceTB/smol-training-playbook

liked a model 6 months ago

meituan-longcat/LongCat-Flash-Chat

View all activity

Organizations

upvoted an article 9 days ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

liked a Space 4 months ago

The Smol Training Playbook

📚

3.04k

The secrets to building world-class LLMs

liked a model 6 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 34.4k • 527

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

764

liked 3 Spaces 9 months ago

LLM训练终极指南 | The Ultra-Scale Playbook

🔥

262

了解LLM训练的方方面面

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Generate a curated web‑text dataset for LLM training

The Ultra-Scale Playbook

🌌

3.74k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 10 months ago

Article

DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background

Feb 28, 2025

•

upvoted an article about 1 year ago

Article

Open R1: Update #3

Mar 11, 2025

•

297

liked a model over 1 year ago

TencentBAC/Conan-embedding-v1

0.3B • Updated Nov 27, 2024 • 323k • 166

upvoted a collection almost 2 years ago

Zephyr ORPO

Collection

Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12, 2024 • 18

liked a model almost 2 years ago

meta-llama/Meta-Llama-3-8B

Text Generation • 8B • Updated Sep 27, 2024 • 3.09M • • 6.49k

liked a dataset almost 2 years ago

Open-Orca/OpenOrca

Viewer • Updated Feb 19, 2025 • 2.94M • 17.1k • 1.51k

liked a Space about 2 years ago

LLaMA Board

🦙

216

Fine-tuning large language model with Gradio UI

liked 5 models about 2 years ago

BAAI/bge-m3

Yulianghua

AI & ML interests

Recent Activity

Organizations

lianghua's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

The Smol Training Playbook

SmolLM3: smol, multilingual, long-context reasoner

LLM训练终极指南 | The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background

Open R1: Update #3

LLaMA Board