Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Roman190928
/
12MTRANSFORMER
like
0
Text Generation
Transformers
starhopp3r/TinyChat
English
meh
bad
horrible
dont-use
skilless
12m
good
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
total steps = 9250
total time(seconds) = 4650.91
final param count = 12,717,312
final loss = 1.287906
avg loss = 2.080128
took too long to train on 1xT4
total steps = 9250
total time(seconds) = 4650.91
final param count = 12,717,312
final loss = 1.287906
avg loss = 2.080128
took too long to train on 1xT4
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
Roman190928/12MTRANSFORMER
Finetunes
1 model
Dataset used to train
Roman190928/12MTRANSFORMER
starhopp3r/TinyChat
Viewer
•
Updated
Oct 2
•
1M
•
519
•
15