YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

HW4 - Efficient Training and Inference

Author: Mira Xiao(minxiao)

GPU VRAM usage: 19977MiB / 23028MiB

{'eval_loss': 3.1775877475738525, 'eval_runtime': 305.9799, 'eval_samples_per_second': 49.023, 'eval_steps_per_second': 3.503, 'epoch': 1.0}

{'train_runtime': 11754.0517, 'train_samples_per_second': 24.247, 'train_steps_per_second': 1.732, 'train_loss': 3.5308782758627038, 'epoch': 1.0}

Training time: 3:15:53 (hh:mm:ss)

Perplexity (2048): 23.99

Downloads last month
3
Safetensors
Model size
0.2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support