The accompanying code for the paper Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis (accepted at Interspeech 2024) and Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables (published at ISMIR 2023).
The instructions on how to train and evaluate the model will be provided soon.
- The latest version of the code should be capable of loading the old checkpoints (under
ckpts/
), but the training for the v1 vocoder is not guaranteed to work. If you want to use the old code base that was made for the ISMIR 2023 paper, please checkout the ismir23 tag or commit6d323da
.
If you find this code useful, please consider citing the following papers:
@inproceedings{ycy2023golf,
title = {Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables},
author = {Yu, Chin-Yun and Fazekas, György},
booktitle={Proc. International Society for Music Information Retrieval},
year={2023},
pages={667--675}
}
@misc{ycy2024golf,
title = {Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis},
author = {Yu, Chin-Yun and Fazekas, György},
year={2024},
eprint={2406.05128},
archivePrefix={arXiv},
primaryClass={eess.AS}
}