AodenT commited on
Commit
7ab505f
·
verified ·
1 Parent(s): 2cadfd6

Removed LLM compute amortization comment

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -75,9 +75,9 @@ Miso TTS 8B uses two transformer components:
75
  - A smaller decoder transformer that autoregressively predicts higher-order
76
  audio codebooks within each frame.
77
 
78
- The model follows Sesame's compute-amortized decoder design: codebook 0 is
79
  predicted from the backbone hidden state, while codebooks 1 through 31 are
80
- predicted by the audio decoder.
81
 
82
  ---
83
 
 
75
  - A smaller decoder transformer that autoregressively predicts higher-order
76
  audio codebooks within each frame.
77
 
78
+ Codebook 0 is
79
  predicted from the backbone hidden state, while codebooks 1 through 31 are
80
+ predicted by the audio decoder autoregressively in codebook depth.
81
 
82
  ---
83