-
ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF
Text Generation • 0.5B • Updated • 853 • 7 -
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF
Text Generation • 2B • Updated • 4.62k • 14 -
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF
Text Generation • 3B • Updated • 3.76k • 7 -
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF
Text Generation • 8B • Updated • 4.17k • 8
Collections
Discover the best community collections!
Collections trending this week
-
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation • 18B • Updated • 58.5k • 498 -
DavidAU/Gemma-The-Writer-N-Restless-Quill-10B-Uncensored-GGUF
Text Generation • 10B • Updated • 11.8k • 131 -
DavidAU/Llama-3.2-8X4B-MOE-V2-Dark-Champion-Instruct-uncensored-abliterated-21B-GGUF
Text Generation • 21B • Updated • 7.4k • 113 -
DavidAU/L3-Stheno-Maid-Blackroot-Grand-HORROR-16B-GGUF
Text Generation • 17B • Updated • 2.12k • 86
-
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Paper • 2406.11271 • Published • 21 -
mlfoundations/MINT-1T-HTML
Viewer • Updated • 623M • 35k • 91 -
mlfoundations/MINT-1T-ArXiv
Viewer • Updated • 5.6M • 3.17k • 55 -
mlfoundations/MINT-1T-PDF-CC-2024-18
Updated • 47.8k • 22
-
Whisper
📉2.72kTranscribe audio or YouTube video to text
-
Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 51 -
openai/whisper-large-v2
Automatic Speech Recognition • Updated • 80k • 1.78k -
openai/whisper-large
Automatic Speech Recognition • Updated • 102k • 532
-
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series
Paper • 2401.03955 • Published • 12 -
ibm-granite/granite-timeseries-ttm-r1
Time Series Forecasting • 805k • Updated • 2.41M • 321 -
ibm-granite/granite-timeseries-patchtst
Time Series Forecasting • 616k • Updated • 9.18k • 18 -
ibm-granite/granite-timeseries-patchtsmixer
Time Series Forecasting • 196k • Updated • 366 • 21
-
Adaptive Weighting in Knowledge Distillation: An Axiomatic Framework for Multi-Scale Teacher Ensemble Optimization
Paper • 2601.17910 • Published -
Hallucinations Live in Variance
Paper • 2601.07058 • Published -
Recursive Meta-Distillation: An Axiomatic Framework for Iterative Knowledge Refinement
Paper • 2601.13100 • Published -
Multi-Teacher Ensemble Distillation: A Mathematical Framework for Probability-Domain Knowledge Aggregation
Paper • 2601.09165 • Published
-
ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF
Text Generation • 0.5B • Updated • 853 • 7 -
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF
Text Generation • 2B • Updated • 4.62k • 14 -
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF
Text Generation • 3B • Updated • 3.76k • 7 -
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF
Text Generation • 8B • Updated • 4.17k • 8
-
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation • 18B • Updated • 58.5k • 498 -
DavidAU/Gemma-The-Writer-N-Restless-Quill-10B-Uncensored-GGUF
Text Generation • 10B • Updated • 11.8k • 131 -
DavidAU/Llama-3.2-8X4B-MOE-V2-Dark-Champion-Instruct-uncensored-abliterated-21B-GGUF
Text Generation • 21B • Updated • 7.4k • 113 -
DavidAU/L3-Stheno-Maid-Blackroot-Grand-HORROR-16B-GGUF
Text Generation • 17B • Updated • 2.12k • 86
-
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Paper • 2406.11271 • Published • 21 -
mlfoundations/MINT-1T-HTML
Viewer • Updated • 623M • 35k • 91 -
mlfoundations/MINT-1T-ArXiv
Viewer • Updated • 5.6M • 3.17k • 55 -
mlfoundations/MINT-1T-PDF-CC-2024-18
Updated • 47.8k • 22
-
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series
Paper • 2401.03955 • Published • 12 -
ibm-granite/granite-timeseries-ttm-r1
Time Series Forecasting • 805k • Updated • 2.41M • 321 -
ibm-granite/granite-timeseries-patchtst
Time Series Forecasting • 616k • Updated • 9.18k • 18 -
ibm-granite/granite-timeseries-patchtsmixer
Time Series Forecasting • 196k • Updated • 366 • 21
-
Whisper
📉2.72kTranscribe audio or YouTube video to text
-
Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 51 -
openai/whisper-large-v2
Automatic Speech Recognition • Updated • 80k • 1.78k -
openai/whisper-large
Automatic Speech Recognition • Updated • 102k • 532
-
Adaptive Weighting in Knowledge Distillation: An Axiomatic Framework for Multi-Scale Teacher Ensemble Optimization
Paper • 2601.17910 • Published -
Hallucinations Live in Variance
Paper • 2601.07058 • Published -
Recursive Meta-Distillation: An Axiomatic Framework for Iterative Knowledge Refinement
Paper • 2601.13100 • Published -
Multi-Teacher Ensemble Distillation: A Mathematical Framework for Probability-Domain Knowledge Aggregation
Paper • 2601.09165 • Published