1b_or_base / README.md
nytopop's picture
Upload folder using huggingface_hub
1c10e2b verified
metadata
library_name: transformers
language:
  - en
pipeline_tag: text-to-speech
base_model:
  - amuvarma/1b-extended_pretrain-checkpoint-192000
  - meta-llama/Llama-3.2-1B-Instruct

noteworthy changes

  • tokenizer omits <|audio|> to prevent finetunes resizing the embeddings unneccessarily
  • config.json uses the correct EOS to end generation on end of audio
  • generation_config.json uses the correct EOS to end generation on end of audio + has a reasonable default temperature
  • chat_template.jinja maps user messages to transcripts and assistant messages to audio for simple conversational context management