4 10

Xie

stonexjr

AI & ML interests

Generative Art

Recent Activity

upvoted an article about 2 months ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

upvoted an article 7 months ago

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

upvoted an article 7 months ago

You could have designed state of the art positional encoding

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

320

upvoted 2 articles 7 months ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

Sep 16, 2025

•

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

477

liked a model about 1 year ago

ostris/Flex.2-preview

Text-to-Image • Updated Apr 25, 2025 • 572 • 391

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.83k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset about 1 year ago

poloclub/diffusiondb

Updated Jan 22, 2024 • 22.8k • 608

upvoted an article over 1 year ago

Article

Understanding InstaFlow/Rectified Flow

Oct 6, 2023

•

liked 2 datasets over 1 year ago

zh-plus/tiny-imagenet

Viewer • Updated Jul 12, 2022 • 110k • 20.4k • 98

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 106k • 791

liked 4 models over 1 year ago

liked a model about 3 years ago

lllyasviel/ControlNet

Updated Feb 25, 2023 • 2 • 3.81k

Xie

AI & ML interests

Recent Activity

Organizations

stonexjr's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

You could have designed state of the art positional encoding

The Ultra-Scale Playbook

Understanding InstaFlow/Rectified Flow