Text-to-Audio
Safetensors
MLX
English
mlx-audio
stable_audio_3
audio-generation
music
diffusion
text-to-speech
speech
speech generation
voice cloning
tts
Instructions to use mlx-community/stable-audio-3-small-music with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use mlx-community/stable-audio-3-small-music with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir stable-audio-3-small-music mlx-community/stable-audio-3-small-music
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
mlx-community/stable-audio-3-small-music
This model was converted to MLX format from stabilityai/stable-audio-3-small-music using mlx-audio version 0.4.3.
Refer to the original model card for more details on the model.
Use with mlx-audio
pip install -U mlx-audio
CLI Example:
python -m mlx_audio.tts.generate --model mlx-community/stable-audio-3-small-music --text "Hello, this is a test."
Python Example:
from mlx_audio.tts.utils import load_model
from mlx_audio.tts.generate import generate_audio
model = load_model("mlx-community/stable-audio-3-small-music")
generate_audio(
model=model,
text="Hello, this is a test.",
ref_audio="path_to_audio.wav",
file_prefix="test_audio",
)
- Downloads last month
- -
Model size
0.6B params
Tensor type
F32
·
Hardware compatibility
Log In to add your hardware
Quantized
Model tree for mlx-community/stable-audio-3-small-music
Base model
stabilityai/stable-audio-3-small-music-base