The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper β’ 2509.26507 β’ Published Sep 30, 2025 β’ 547
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism Paper β’ 1909.08053 β’ Published Sep 17, 2019 β’ 5
EvalYaks: Instruction Tuning Datasets and LoRA Fine-tuned Models for Automated Scoring of CEFR B2 Speaking Assessment Transcripts Paper β’ 2408.12226 β’ Published Aug 22, 2024
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns? Paper β’ 2407.00996 β’ Published Jul 1, 2024
Running on Zero Featured 482 Llama 2 7B Chat π 482 Chat with the Llamaβ2 7B model for instant AI responses
anjulRajendraSharma/wav2vec2-indian-english Automatic Speech Recognition β’ Updated Jan 12, 2024 β’ 1 β’ 1
Running on Zero Featured 2.7k Whisper π 2.7k Transcribe audio or YouTube videos into text with Whisper
openai/whisper-medium Automatic Speech Recognition β’ 0.8B β’ Updated Feb 29, 2024 β’ 518k β’ 275