Kosp2e: Korean Speech to English Translation Corpus
Paper
•
2107.02875
•
Published
This is the ESPnet2 recipe for the KoSP2E (Korean Speech Perception and Production Experiment) dataset.
The KoSP2E dataset is a large-scale Korean speech corpus designed for speech perception and production experiments. This recipe provides a full ASR pipeline using ESPnet2 with both Transformer and Conformer architectures.
Environment
| dataset | Snt | Wrd | Corr | Sub | Del | Ins | Err | S.Err |
|---|---|---|---|---|---|---|---|---|
| test | 2320 | 22337 | 77.1 | 20.4 | 2.6 | 4.4 | 27.4 | 76.4 |
| dataset | Snt | Wrd | Corr | Sub | Del | Ins | Err | S.Err |
|---|---|---|---|---|---|---|---|---|
| test | 2320 | 84267 | 92.5 | 5.7 | 1.8 | 1.7 | 9.2 | 76.4 |
| dataset | Snt | Wrd | Corr | Sub | Del | Ins | Err | S.Err |
|---|---|---|---|---|---|---|---|---|
| test | 2320 | 65361 | 89.4 | 8.6 | 2.0 | 2.1 | 12.7 | 76.4 |