PaddleSpeech r1.3.0

SmileGoat released this 14 Dec 06:38

c54c950

HighLIght

S2T

Support U2/U2++ Conformer dy2static, and U2/U2++ C++ High Performance Streaming ASR Deployment. @zh794390558
Add Wav2vec2ASR-en, wav2vec2.0 fine-tuning for ASR on LibriSpeech. @Zth9730
Add Whisper CLI and Demos, support multi language recognition and translation. @zxcd
Add Wav2vec2 CLI and Demos, support ASR and Feature Extraction. @Zth9730
Add whisper. #2640 #2704 by @zxcd
Fix gpu training hang. #2478 by @Zth9730
Support u2++ based cli and server. #2489 #2510 by @Zth9730
Add wav2vec2-en. #2518 #2527 #2637 by @Zth9730
Add wav2vec2-zh cli. #2697 by @Zth9730

T2S

Add seek for BytesIO. #2484 by @ZapBird
Add mix finetune. #2525 #2647 by @lym0302
Add streaming TTS fastdeploy serving. #2528 by @HexToString
Add SSML for Chinese Text Frontend. #2531 by @david-95
Add end-to-end Prosody Prediction pipeline (including using prosody labels in Acoustic Model). #2548 #2615 #2693 by @WongLaw
Add Adversarial Loss for Chinese English mixed TTS. #2588 by @lym0302
Fix frontend bugs. #2539 #2606 by @yt605155624
Add TN for English unit. #2629 by @WongLaw
Add male voice for TTS. #2660 by @lym0302
Add double byte char for zh normalization. #2661 by @david-95
Add TTS Paddle-Lite x86 inference. #2636 #2667 by @yt605155624
Add greek char and fix #2571. #2683 by @david-95
Add Slim for TTS. #2729 by @yt605155624

Audio

Move paddlespeech/audio to paddleaudio. #2706 by @SmileGoat

Demo

Add TTSAndroid demo. #2703 by @yt605155624

New Contributors

@ZapBird made their first contribution in #2484
@HexToString made their first contribution in #2528
@dahu1 made their first contribution in #2554
@kFoodie made their first contribution in #2664
@zxcd made their first contribution in #2640
@michael-skynorth made their first contribution in #2666
@heyudage made their first contribution in #2688

Full Changelog: r1.2.0...r1.3.0

Contributors

zh794390558, zxcd, and 12 other contributors

Assets 2