Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
Duplicated from
Tonic/SmolFactory
natesgituser
/
SmolFactory
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
d60ab6c
SmolFactory
217 kB
Ctrl+K
Ctrl+K
3 contributors
History:
19 commits
Tonic
solves oom error with more reasonable configuration
d60ab6c
unverified
11 months ago
config
solves oom error with more reasonable configuration
11 months ago
.gitignore
852 Bytes
first commit
11 months ago
A100_LARGE_SCALE_GUIDE.md
5.96 kB
adds A100 large experiments
11 months ago
CLOUD_DEPLOYMENT_GUIDE.md
11.8 kB
adds A100 large experiments
11 months ago
CLOUD_TRAINING_GUIDE.md
11.5 kB
adds A100 large experiments
11 months ago
DEPLOYMENT_GUIDE.md
9.45 kB
adds A100 large experiments
11 months ago
PUSH_GUIDE.md
9.98 kB
adds A100 large experiments
11 months ago
README.md
7.25 kB
adds A100 large experiments
11 months ago
TRACKIO_INTEGRATION.md
6.24 kB
adds A100 large experiments
11 months ago
app.py
12.3 kB
adds A100 large experiments
11 months ago
cloud_deployment.sh
8.44 kB
adds A100 large experiments
11 months ago
config.py
992 Bytes
first commit
11 months ago
create_sample_dataset.py
1.4 kB
first commit
11 months ago
data.py
13.2 kB
solves dataset dict issue
11 months ago
deploy_trackio_space.py
7.23 kB
adds A100 large experiments
11 months ago
model.py
7.81 kB
only enable distributed process group if available - it's not
11 months ago
monitoring.py
11.6 kB
adds A100 large experiments
11 months ago
push_to_huggingface.py
15.5 kB
adds A100 large experiments
11 months ago
requirements.txt
690 Bytes
improves requirements and dependencies
11 months ago
requirements_core.txt
263 Bytes
improves requirements and dependencies
11 months ago
requirements_minimal.txt
265 Bytes
removes flash attention from reqs
11 months ago
requirements_space.txt
302 Bytes
adds A100 large experiments
11 months ago
run_a100_large_experiment.py
5.63 kB
solves oom error with more reasonable configuration
11 months ago
test_monitoring.py
5.39 kB
adds A100 large experiments
11 months ago
test_setup.py
6.07 kB
first commit
11 months ago
train.py
6.13 kB
adds A100 large experiments
11 months ago
trainer.py
9.58 kB
removes max sequences length argument from trainer
11 months ago