WebOrganizer/Corpus-200B
Preview • Updated • 5.12k • 11
How to use allenai/dolma3-fasttext-weborganizer-topic-classifier with fastText:
from huggingface_hub import hf_hub_download
import fasttext
model = fasttext.load_model(hf_hub_download("allenai/dolma3-fasttext-weborganizer-topic-classifier", "model.bin"))A model trained to assign a document to one of the 24 Web Organizer topics. Part of the Dolma 3 pipeline.