Linux command
piper 命令
网络
涉及管道、覆盖或删除,执行前请先确认路径和参数。
常用示例
Synthesize speech
echo "Hello world" | piper --model [en_US-lessac-medium] --output_file [output.wav]
List models
piper --list-models
Use specific model
piper -m [model.onnx] -c [model.json] < [input.txt] > [output.wav]
Set speaker
piper --model [model.onnx] --speaker [0] < [input.txt]
Adjust speaking rate
piper --model [model.onnx] --length_scale [1.5] < [input.txt]
JSON input mode
echo '{"text": "Hello"}' | piper --model [model.onnx] --json-input
说明
piper is a fast, local neural text-to-speech system that generates natural-sounding speech from text using ONNX-based voice models. It runs entirely offline after model download, requiring no internet connection or cloud API. Each model is trained for a specific language and voice. Multi-speaker models support selecting different voice variants via speaker ID. The length_scale and noise_scale parameters control speaking rate and variation, allowing fine-tuning of output characteristics. Input is read from stdin and output as WAV audio. JSON input mode enables structured text processing with per-utterance settings. Models are available for many languages through the Piper project's model repository.
参数
- -m, --model _FILE_
- ONNX model file.
- -c, --config _FILE_
- JSON config file.
- --output_file _FILE_
- Output WAV file.
- --output_dir _DIR_
- Output directory.
- --speaker _ID_
- Speaker ID for multi-speaker.
- --length_scale _FLOAT_
- Speaking rate (higher=slower).
- --noise_scale _FLOAT_
- Variation in speech.
- --json-input
- JSON input format.
- --list-models
- Show available models.
FAQ
What is the piper command used for?
piper is a fast, local neural text-to-speech system that generates natural-sounding speech from text using ONNX-based voice models. It runs entirely offline after model download, requiring no internet connection or cloud API. Each model is trained for a specific language and voice. Multi-speaker models support selecting different voice variants via speaker ID. The length_scale and noise_scale parameters control speaking rate and variation, allowing fine-tuning of output characteristics. Input is read from stdin and output as WAV audio. JSON input mode enables structured text processing with per-utterance settings. Models are available for many languages through the Piper project's model repository.
How do I run a basic piper example?
Run `echo "Hello world" | piper --model [en_US-lessac-medium] --output_file [output.wav]` in a terminal, then adjust file names, paths, flags, or remote targets for your system.
What does -m, --model _FILE_ do in piper?
ONNX model file.