27 5

Austin Liu

Austin362667

austin362667

AI & ML interests

None yet

Recent Activity

upvoted an article 1 day ago

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

upvoted an article about 1 month ago

Understanding Vector Quantization in VQ-VAE

upvoted an article about 2 months ago

Pallas for people who know JAX but not kernels yet

View all activity

Organizations

upvoted an article 1 day ago

Article

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

nvidia

•

2 days ago

• 26

upvoted an article about 1 month ago

Article

Understanding Vector Quantization in VQ-VAE

ariG23498

•

Aug 28, 2024

• 64

upvoted an article about 2 months ago

Article

Pallas for people who know JAX but not kernels yet

ariG23498

•

Apr 29

• 21

updated a dataset about 2 months ago

Austin362667/fineweb10B_sp8192

Updated Apr 29 • 12

published a dataset about 2 months ago

Austin362667/fineweb10B_sp8192

Updated Apr 29 • 12

upvoted an article 2 months ago

Article

The PR you would have opened yourself

pcuenq, awni

•

Apr 16

• 72

upvoted a paper 3 months ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published Apr 1 • 56

updated 3 models 3 months ago

published a model 3 months ago

Austin362667/Qwen3-0.6B-MLX-bf16-python-5k-alpaca-resampled-Qwen-4B

Text Generation • 0.6B • Updated Mar 16 • 36 •

updated a dataset 3 months ago

Austin362667/python_code_instructions_5k_alpaca_qwen3_4B_resampled

Viewer • Updated Mar 15 • 5k • 9

published a dataset 3 months ago

Austin362667/python_code_instructions_5k_alpaca_qwen3_4B_resampled

Viewer • Updated Mar 15 • 5k • 9

published 2 models 3 months ago

Austin362667/Qwen3-1.7B-MLX-bf16-python-18k-alpaca

Text Generation • 2B • Updated Mar 16 • 79 •

Austin362667/Qwen3-0.6B-MLX-bf16-python-18k-alpaca

Text Generation • 0.6B • Updated Mar 16 • 37 •

updated a dataset 3 months ago

Austin362667/python_code_instructions_5_alpaca_qwen3_4B_resampled

Viewer • Updated Mar 15 • 5.01k • 10

published a dataset 4 months ago

Austin362667/python_code_instructions_5_alpaca_qwen3_4B_resampled

Viewer • Updated Mar 15 • 5.01k • 10

upvoted 2 articles 4 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

joaogante

•

May 11, 2023

• 79

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

ybelkada, timdettmers

•

Aug 17, 2022

• 136

upvoted a collection 4 months ago

SiliconMind-V1

Collection

5 items • Updated 9 days ago • 2

Austin Liu

AI & ML interests

Recent Activity

Organizations

Austin362667's activity

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

Understanding Vector Quantization in VQ-VAE

Pallas for people who know JAX but not kernels yet

The PR you would have opened yourself

Assisted Generation: a new direction toward low-latency text generation

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes