Instructions to use facebook/mms-tts-lif with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use facebook/mms-tts-lif with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="facebook/mms-tts-lif")# Load model directly from transformers import AutoTokenizer, AutoModelForTextToWaveform tokenizer = AutoTokenizer.from_pretrained("facebook/mms-tts-lif") model = AutoModelForTextToWaveform.from_pretrained("facebook/mms-tts-lif") - Notebooks
- Google Colab
- Kaggle
mms-tts-lif
#1
by SystemSolution21 - opened
from transformers import VitsModel, AutoTokenizer
import torch
import scipy
from IPython.display import Audio
Initialize Model
model = VitsModel.from_pretrained("facebook/mms-tts-lif")
tokenizer = AutoTokenizer.from_pretrained("facebook/mms-tts-lif")
Input Limbu text
text = """
ᤛᤣᤘᤠᤖᤥ ᤀᤠᤍᤠᤱᤒᤠ ᤀᤠᤍᤠᤱᤔᤠᤛᤣ ᤀᤠᤏᤡ ᤕᤠᤰᤌᤢᤱ ᤛᤡᤰᤁᤢᤶ ᤏᤡᤱᤘᤠ ᤋᤅᤡ ᤖᤥ ᤀᤠᤏᤡ ᤕᤠᤰᤌᤢᤱ ᤐᤠᤏ᤻ᤍᤠᤱᤜᤠ ᤐᤠᤖᤡᤖᤥ ᤛᤰᤛᤰᤜᤠ
ᤏᤡᤖᤢᤶᤗᤥ ᤛᤡᤖᤡᤈᤱᤃᤠ ᤛᤠᤵᤔᤢᤴᤎᤢᤶᤜᤠ ᤜᤢ ᤀᤠᤛᤡᤖᤥ ᤛᤡᤖᤡᤈᤱᤃᤠ ᤐᤠᤏ᤻ᤍᤠᤱᤅᤡᤴᤃ ᤀᤠᤏᤡ ᤕᤠᤰᤌᤢᤱ ᤔᤠ᤺ᤐᤠᤏ᤻ᤗᤥ
"""
inputs = tokenizer(text, return_tensors="pt")
Call model
with torch.no_grad():
output = model(**inputs).waveform
Output save as .wav file
scipy.io.wavfile.write("techno.wav", rate=model.config.sampling_rate, data=output)
Display output
Audio(output, rate=model.config.sampling_rate)