Sunbird/salt
Viewer • Updated • 80.8k • 217 • 15
How to use cxlrd/luganda-tts-v3 with NeMo:
# tag did not correspond to a valid NeMo domain.
Text-to-Speech system for Luganda using NVIDIA NeMo.
| Model | Description | Size |
|---|---|---|
luganda_fastpitch.nemo |
FastPitch spectrogram generator | 187 MB |
luganda_hifigan.nemo |
HiFi-GAN neural vocoder | 339 MB |
from nemo.collections.tts.models import FastPitchModel, HifiGanModel
fastpitch = FastPitchModel.restore_from("luganda_fastpitch.nemo")
hifigan = HifiGanModel.restore_from("luganda_hifigan.nemo")
text = "Oli otya?"
spec = fastpitch.generate_spectrogram(tokens=fastpitch.parse(text))
audio = hifigan.convert_spectrogram_to_audio(spec=spec)