SpeechCombine

This repository contains the model checkpoints, sample training data, and evaluation data for
the paper [Unlocking Speech鈥揟ext Compositional Powers: Instruction-Following Speech Language Models without Instruction Tuning]

馃搧 Repository structure

  • Exps/: SpeechCombine checkpoint and tokenizer
  • eval/: Eevaluation data and evaluator checkpoint
  • tts/: TTS checkpoint and speaker embedding
  • training_data.jsonl: SpeechCombine training data

馃敆 Citation

If this resource is useful for your research, please consider citing our paper.


License: CC BY-NC 4.0

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support