You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
* 替换了vocoder * 修改了vocoder_train * 减谱法 * 美化UI;语音增强;MFCC特征可视化 * 修复了训练fregan模型时的报错 * 增加了可以分析音频特征的独立文件 * 现已支持Fre-GAN声码器的训练 * 修复了训练fregan时保存模型的BUG * 删除了无用的文件 * 优化了识别声码器模型的方式 |
3 years ago | |
---|---|---|
.. | ||
LJSpeech-1.1 | 3 years ago | |
.gitignore | 3 years ago | |
LICENSE | 3 years ago | |
README.md | 3 years ago | |
config.json | 3 years ago | |
discriminator.py | 3 years ago | |
dwt.py | 3 years ago | |
generator.py | 3 years ago | |
inference.py | 3 years ago | |
loss.py | 3 years ago | |
meldataset.py | 3 years ago | |
modules.py | 3 years ago | |
requirements.txt | 3 years ago | |
stft_loss.py | 3 years ago | |
train.py | 3 years ago | |
utils.py | 3 years ago |
README.md
Fre-GAN Vocoder
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Training:
python train.py --config config.json
Citation:
@misc{kim2021fregan,
title={Fre-GAN: Adversarial Frequency-consistent Audio Synthesis},
author={Ji-Hoon Kim and Sang-Hoon Lee and Ji-Hyun Lee and Seong-Whan Lee},
year={2021},
eprint={2106.02297},
archivePrefix={arXiv},
primaryClass={eess.AS}
}
Note
- For more complete and end to end Voice cloning or Text to Speech (TTS) toolbox please visit Deepsync Technologies.