view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 28 days ago β’ 63
meta-llama/Llama-4-Scout-17B-16E-Instruct Any-to-Any β’ 109B β’ Updated May 22, 2025 β’ 243k β’ 1.17k
deepseek-ai/DeepSeek-V3-0324 Text Generation β’ 685B β’ Updated Mar 27, 2025 β’ 215k β’ β’ 3.08k
cross-encoder/ms-marco-MiniLM-L6-v2 Text Ranking β’ 22.7M β’ Updated Aug 29, 2025 β’ 4.79M β’ 177
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity β’ 22.7M β’ Updated Mar 6, 2025 β’ 144M β’ β’ 4.29k