10 1

Ning Xie

andyxning

andyxning

AI & ML interests

Inference

Recent Activity

upvoted an article 1 day ago

Mixture of Experts Explained

liked a Space 1 day ago

nanotron/ultrascale-playbook

upvoted an article 8 months ago

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

View all activity

Organizations

None yet

upvoted an article 1 day ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.14k

upvoted 3 articles 8 months ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

jsulz, yuchenglow, znation, saba9

•

Feb 12, 2025

• 81

Article

XetHub is joining Hugging Face!

yuchenglow, julien-c

•

Aug 8, 2024

• 117

Article

From Files to Chunks: Improving HF Storage Efficiency

jsulz, erinys

•

Nov 20, 2024

• 73

upvoted an article 9 months ago

Article

Deploy LLMs with Hugging Face Inference Endpoints

philschmid

•

Jul 4, 2023

• 17

upvoted 2 articles 12 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

joaogante

•

May 11, 2023

• 79

Article

How to generate text: using different decoding methods for language generation with Transformers

patrickvonplaten

•

Mar 1, 2020

• 299

upvoted an article about 1 year ago

Article

Welcome to Inference Providers on the Hub 🔥

burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c

•

Jan 28, 2025

• 494

Ning Xie

AI & ML interests

Recent Activity

Organizations

andyxning's activity

Mixture of Experts Explained

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

XetHub is joining Hugging Face!

From Files to Chunks: Improving HF Storage Efficiency

Deploy LLMs with Hugging Face Inference Endpoints

Assisted Generation: a new direction toward low-latency text generation

How to generate text: using different decoding methods for language generation with Transformers

Welcome to Inference Providers on the Hub 🔥