vumichien/ettin-encoder-32m-arxiv-classification-512 Text Classification β’ 32M β’ Updated 16 days ago β’ 34
vumichien/ettin-encoder-32m-arxiv-classification-512 Text Classification β’ 32M β’ Updated 16 days ago β’ 34
vumichien/ettin-encoder-32m-arxiv-classification-8192 Text Classification β’ 32M β’ Updated 16 days ago β’ 27
vumichien/ettin-encoder-32m-arxiv-classification-8192 Text Classification β’ 32M β’ Updated 16 days ago β’ 27
vumichien/ettin-encoder-32m-imdb-sentiment Text Classification β’ 32M β’ Updated 16 days ago β’ 31
vumichien/ettin-encoder-32m-imdb-sentiment Text Classification β’ 32M β’ Updated 16 days ago β’ 31
Sleeping Agents 421 Whisper Speaker Diarization π 421 Generate speakerβlabeled transcripts from video or audio
vumichien/wav2vec2-xls-r-300m-japanese Automatic Speech Recognition β’ Updated Feb 7, 2023 β’ 4 β’ 1
vumichien/wav2vec2-xls-r-300m-japanese Automatic Speech Recognition β’ Updated Feb 7, 2023 β’ 4 β’ 1
vumichien/wav2vec2-xls-r-300m-japanese-large Automatic Speech Recognition β’ Updated May 5, 2022 β’ 2
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook π 3.22k The secrets to building world-class LLMs
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources Paper β’ 2509.25531 β’ Published Sep 29, 2025 β’ 10