text8-dataset / dataset_info.json
roshbeed's picture
Upload dataset_info.json with huggingface_hub
d6eabb7 verified
raw
history blame contribute delete
274 Bytes
{
"name": "text8",
"description": "Text8 dataset - a large text corpus for training word embeddings",
"source": "http://mattmahoney.net/dc/text8.zip",
"license": "Public domain",
"format": "text",
"usage": "Use for training word embeddings and language models"
}