Tung-Lin Wu's picture

Tung-Lin Wu

tunglinwood

·

tunglinwood

AI & ML interests

None yet

Recent Activity

upvoted a collection about 13 hours ago

new activity 3 months ago

moonshotai/Kimi-Audio-7B-Instruct:Add Kimi-Audio EOS and pad token ids

updated a model 3 months ago

tunglinwood/Kimi-Audio-7B-Instruct

View all activity

Organizations

None yet

upvoted a collection about 13 hours ago

Gemma 4

12 items • Updated 15 days ago • 827

New activity in moonshotai/Kimi-Audio-7B-Instruct 3 months ago

Add Kimi-Audio EOS and pad token ids

#20 opened 3 months ago by

updated a model 3 months ago

tunglinwood/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated Feb 12 • 5

New activity in ResembleAI/chatterbox-turbo 5 months ago

Supported languages

#3 opened 5 months ago by

liked a model 5 months ago

ResembleAI/chatterbox-turbo

Text-to-Speech • Updated Dec 15, 2025 • • 648

upvoted an article 5 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 389

New activity in tencent/KaLM-Embedding-Gemma3-12B-2511 6 months ago

Rerank

#5 opened 6 months ago by

upvoted a collection 9 months ago

DeepSeek-V3.1

3 items • Updated Mar 2 • 262

New activity in deepseek-ai/DeepSeek-R1-0528 12 months ago

Do you have deepseek-r1-0528-awq plan?

#68 opened 12 months ago by

upvoted 2 collections about 1 year ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.79k

GLM-4-0414

GLM-4-0414 series model • 6 items • Updated Mar 2 • 135

upvoted 2 papers about 1 year ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3, 2025 • 225

Training Sparse Mixture Of Experts Text Embedding Models

Paper • 2502.07972 • Published Feb 11, 2025 • 10

upvoted a collection about 1 year ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 6 items • Updated Mar 2 • 166

published a Space about 1 year ago

Chatui

liked a Space about 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper about 1 year ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 145

upvoted an article about 1 year ago

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 275

liked a model about 1 year ago

PharMolix/BioMedGPT-R1

Updated Mar 26, 2025 • 4 • 17

liked a Space about 1 year ago

GAIA Leaderboard

Submit and score your model on the GAIA benchmark