Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Grogros
/
dmWM-llama-3.2-1B-Instruct-DistillationWM

Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
dmWM-llama-3.2-1B-Instruct-DistillationWM / checkpoint-5000
4.96 GB
  • 1 contributor
History: 1 commit
Grogros's picture
Grogros
Training in progress, step 5000, checkpoint
4defcf3 verified 11 months ago
  • config.json
    926 Bytes
    Training in progress, step 5000, checkpoint 11 months ago
  • generation_config.json
    184 Bytes
    Training in progress, step 5000, checkpoint 11 months ago
  • model.safetensors
    4.94 GB
    xet
    Training in progress, step 5000, checkpoint 11 months ago
  • optimizer.pt
    3.72 MB
    xet
    Training in progress, step 5000, checkpoint 11 months ago
  • rng_state.pth
    14.2 kB
    xet
    Training in progress, step 5000, checkpoint 11 months ago
  • scheduler.pt
    1.06 kB
    xet
    Training in progress, step 5000, checkpoint 11 months ago
  • special_tokens_map.json
    325 Bytes
    Training in progress, step 5000, checkpoint 11 months ago
  • tokenizer.json
    17.2 MB
    xet
    Training in progress, step 5000, checkpoint 11 months ago
  • tokenizer_config.json
    54.6 kB
    Training in progress, step 5000, checkpoint 11 months ago
  • trainer_state.json
    87.4 kB
    Training in progress, step 5000, checkpoint 11 months ago
  • training_args.bin
    5.3 kB
    xet
    Training in progress, step 5000, checkpoint 11 months ago