PaddleSpeech r1.3.0
HighLIght
S2T
- Support U2/U2++ Conformer dy2static, and U2/U2++ C++ High Performance Streaming ASR Deployment. @zh794390558
- Add Wav2vec2ASR-en, wav2vec2.0 fine-tuning for ASR on LibriSpeech. @Zth9730
- Add Whisper CLI and Demos, support multi language recognition and translation. @zxcd
- Add Wav2vec2 CLI and Demos, support ASR and Feature Extraction. @Zth9730
- Add whisper. #2640 #2704 by @zxcd
- Fix gpu training hang. #2478 by @Zth9730
- Support u2++ based cli and server. #2489 #2510 by @Zth9730
- Add wav2vec2-en. #2518 #2527 #2637 by @Zth9730
- Add wav2vec2-zh cli. #2697 by @Zth9730
T2S
- Add seek for BytesIO. #2484 by @ZapBird
- Add mix finetune. #2525 #2647 by @lym0302
- Add streaming TTS fastdeploy serving. #2528 by @HexToString
- Add SSML for Chinese Text Frontend. #2531 by @david-95
- Add end-to-end Prosody Prediction pipeline (including using prosody labels in Acoustic Model). #2548 #2615 #2693 by @WongLaw
- Add Adversarial Loss for Chinese English mixed TTS. #2588 by @lym0302
- Fix frontend bugs. #2539 #2606 by @yt605155624
- Add TN for English unit. #2629 by @WongLaw
- Add male voice for TTS. #2660 by @lym0302
- Add double byte char for zh normalization. #2661 by @david-95
- Add TTS Paddle-Lite x86 inference. #2636 #2667 by @yt605155624
- Add greek char and fix #2571. #2683 by @david-95
- Add Slim for TTS. #2729 by @yt605155624
Audio
- Move paddlespeech/audio to paddleaudio. #2706 by @SmileGoat
Demo
- Add TTSAndroid demo. #2703 by @yt605155624
New Contributors
- @ZapBird made their first contribution in #2484
- @HexToString made their first contribution in #2528
- @dahu1 made their first contribution in #2554
- @kFoodie made their first contribution in #2664
- @zxcd made their first contribution in #2640
- @michael-skynorth made their first contribution in #2666
- @heyudage made their first contribution in #2688
Full Changelog: r1.2.0...r1.3.0