Skip to content

Commit 26f916d

Browse files
yt605155624kslz
andauthored
[cherry-pick][r1.1] fix g2pw input type for windows (#2289)
* [TTS]指定G2PW的传入数据类型 , test=tts (#2288) * fix ONNXRuntimeError Specify data type (int64),test=tts * Tactron2→Tacotron2 ,test=doc * update version Co-authored-by: 李子 <54951765+kslz@users.noreply.github.com>
1 parent 2aa2fa8 commit 26f916d

File tree

10 files changed

+17
-17
lines changed

10 files changed

+17
-17
lines changed

docs/source/released_model.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -67,7 +67,7 @@ WaveRNN | CSMSC |[WaveRNN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tr
6767
Model Type | Dataset| Example Link | Pretrained Models
6868
:-------------:| :------------:| :-----: | :-----: |
6969
GE2E| AISHELL-3, etc. |[ge2e](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/other/ge2e)|[ge2e_ckpt_0.3.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/ge2e/ge2e_ckpt_0.3.zip)
70-
GE2E + Tactron2| AISHELL-3 |[ge2e-tactron2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/vc0)|[tacotron2_aishell3_ckpt_vc0_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_aishell3_ckpt_vc0_0.2.0.zip)
70+
GE2E + Tacotron2| AISHELL-3 |[ge2e-Tacotron2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/vc0)|[tacotron2_aishell3_ckpt_vc0_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_aishell3_ckpt_vc0_0.2.0.zip)
7171
GE2E + FastSpeech2 | AISHELL-3 |[ge2e-fastspeech2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/vc1)|[fastspeech2_nosil_aishell3_vc1_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_aishell3_vc1_ckpt_0.5.zip)
7272

7373

docs/source/tts/quick_start.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ The examples in PaddleSpeech are mainly classified by datasets, the TTS datasets
77
* VCTK (English multiple speakers)
88

99
The models in PaddleSpeech TTS have the following mapping relationship:
10-
* tts0 - Tactron2
10+
* tts0 - Tacotron2
1111
* tts1 - TransformerTTS
1212
* tts2 - SpeedySpeech
1313
* tts3 - FastSpeech2
@@ -17,7 +17,7 @@ The models in PaddleSpeech TTS have the following mapping relationship:
1717
* voc3 - MultiBand MelGAN
1818
* voc4 - Style MelGAN
1919
* voc5 - HiFiGAN
20-
* vc0 - Tactron2 Voice Clone with GE2E
20+
* vc0 - Tacotron2 Voice Clone with GE2E
2121
* vc1 - FastSpeech2 Voice Clone with GE2E
2222

2323
## Quick Start

docs/source/tts/quick_start_cn.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99

1010
PaddleSpeech 的 TTS 模型具有以下映射关系:
1111

12-
* tts0 - Tactron2
12+
* tts0 - Tacotron2
1313
* tts1 - TransformerTTS
1414
* tts2 - SpeedySpeech
1515
* tts3 - FastSpeech2
@@ -19,7 +19,7 @@ PaddleSpeech 的 TTS 模型具有以下映射关系:
1919
* voc3 - MultiBand MelGAN
2020
* voc4 - Style MelGAN
2121
* voc5 - HiFiGAN
22-
* vc0 - Tactron2 Voice Clone with GE2E
22+
* vc0 - Tacotron2 Voice Clone with GE2E
2323
* vc1 - FastSpeech2 Voice Clone with GE2E
2424

2525
## 快速开始

docs/tutorial/tts/tts_tutorial.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -769,7 +769,7 @@
769769
"```\n",
770770
"我们在每个数据集的 README.md 介绍了子目录和模型的对应关系, 在 TTS 中有如下对应关系:\n",
771771
"```text\n",
772-
"tts0 - Tactron2\n",
772+
"tts0 - Tacotron2\n",
773773
"tts1 - TransformerTTS\n",
774774
"tts2 - SpeedySpeech\n",
775775
"tts3 - FastSpeech2\n",

examples/aishell3/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
11
# Aishell3
22

3-
* tts0 - Tactron2
3+
* tts0 - Tacotron2
44
* tts1 - TransformerTTS
55
* tts2 - SpeedySpeech
66
* tts3 - FastSpeech2
77
* voc0 - WaveFlow
88
* voc1 - Parallel WaveGAN
99
* voc2 - MelGAN
1010
* voc3 - MultiBand MelGAN
11-
* vc0 - Tactron2 Voice Cloning with GE2E
11+
* vc0 - Tacotron2 Voice Cloning with GE2E
1212
* vc1 - FastSpeech2 Voice Cloning with GE2E

examples/csmsc/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11

22
# CSMSC
33

4-
* tts0 - Tactron2
4+
* tts0 - Tacotron2
55
* tts1 - TransformerTTS
66
* tts2 - SpeedySpeech
77
* tts3 - FastSpeech2

examples/ljspeech/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11

22
# LJSpeech
33

4-
* tts0 - Tactron2
4+
* tts0 - Tacotron2
55
* tts1 - TransformerTTS
66
* tts2 - SpeedySpeech
77
* tts3 - FastSpeech2

examples/vctk/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11

22
# VCTK
33

4-
* tts0 - Tactron2
4+
* tts0 - Tacotron2
55
* tts1 - TransformerTTS
66
* tts2 - SpeedySpeech
77
* tts3 - FastSpeech2

paddlespeech/t2s/frontend/g2pw/dataset.py

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -81,12 +81,12 @@ def prepare_onnx_input(tokenizer,
8181
position_ids.append(position_id)
8282

8383
outputs = {
84-
'input_ids': np.array(input_ids),
85-
'token_type_ids': np.array(token_type_ids),
86-
'attention_masks': np.array(attention_masks),
84+
'input_ids': np.array(input_ids).astype(np.int64),
85+
'token_type_ids': np.array(token_type_ids).astype(np.int64),
86+
'attention_masks': np.array(attention_masks).astype(np.int64),
8787
'phoneme_masks': np.array(phoneme_masks).astype(np.float32),
88-
'char_ids': np.array(char_ids),
89-
'position_ids': np.array(position_ids),
88+
'char_ids': np.array(char_ids).astype(np.int64),
89+
'position_ids': np.array(position_ids).astype(np.int64),
9090
}
9191
return outputs
9292

setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@
2828

2929
HERE = Path(os.path.abspath(os.path.dirname(__file__)))
3030

31-
VERSION = '1.1.1'
31+
VERSION = '1.1.2'
3232
COMMITID = 'none'
3333

3434
base = [

0 commit comments

Comments
 (0)