Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
cuber12
/
dropout-decay
like
0
English
dropout
streaming
language-modeling
transformer
mps
reproducibility
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
dropout-decay
/
docs
88.1 kB
Ctrl+K
Ctrl+K
2 contributors
History:
18 commits
Mandeep Sidhu
Make research artifacts self contained
618af58
1 day ago
formula_coefficient_methodology.md
16.1 kB
Use absolute regime names for streaming reports
3 days ago
openwebtext10k_streaming_report.md
12 kB
Document regime runbook and schedule provenance
1 day ago
plan.md
36.5 kB
Make research artifacts self contained
1 day ago
tinystories_streaming_report.md
7.33 kB
Use absolute regime names for streaming reports
3 days ago
wikitext103_streaming_report.md
16.1 kB
Add WikiText-103 five-seed streaming validation
2 days ago