End-to-end speaker segmentation for overlap-aware resegmentation Paper β’ 2104.04045 β’ Published Apr 8, 2021 β’ 2
Training Datasets Collection A collection of pseudo-labelled datasets used to train the Distil-Whisper model. β’ 9 items β’ Updated Mar 21, 2024 β’ 14
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context +6 Jul 23, 2024 β’ 241
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 β’ 171
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 118
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM π€ β’ 9 items β’ Updated Sep 26, 2024 β’ 57
Gemma 2 2B Release Collection The 2.6B parameter version of Gemma 2. β’ 6 items β’ Updated Jul 10 β’ 81
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Paper β’ 2403.16627 β’ Published Mar 25, 2024 β’ 22
DBRX Collection DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. β’ 3 items β’ Updated Mar 27, 2024 β’ 96
Orca 2: Teaching Small Language Models How to Reason Paper β’ 2311.11045 β’ Published Nov 18, 2023 β’ 77
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning Paper β’ 2301.13688 β’ Published Jan 31, 2023 β’ 9