Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
huggingface
/
data-measurements-tool
like
100
Running
App
Files
Files
Community
7
Fetching metadata from the HF Docker repository...
refs/pr/8
data-measurements-tool
/
cache_dir
/
HuggingFaceM4
/
OBELICS_default_train_text
2.09 GB
Ctrl+K
Ctrl+K
14 contributors
History:
1 commit
Ezi Ozoani
All changes minus temp.jsonl from last 3 commits
3d4f393
almost 3 years ago
associations
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
base_dset
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
lengths
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
text_dset
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
text_duplicates
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
.DS_Store
8.2 kB
xet
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
general_stats_dict.json
117 Bytes
xet
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
sorted_top_vocab.json
8.11 kB
xet
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
tokenized_df.json
844 MB
xet
All changes minus temp.jsonl from last 3 commits
almost 3 years ago
vocab_counts.json
39 MB
xet
All changes minus temp.jsonl from last 3 commits
almost 3 years ago