ljspeech¶
Created by Keith Ito and Linda Johnson
version |
1.0.0 |
license |
|
usage |
unrestricted |
languages |
eng |
format |
wav |
channel |
1 |
sampling rate |
22050 |
bit depth |
16 |
duration |
0 days 23:55:17.076281179 |
files |
13100, duration distribution: 1.1 s |
repository |
audb-public |
Description¶
LJSpeech consists of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours. The texts were published between 1884 and 1964, and are in the public domain. The audio was recorded in 2016-17 by the LibriVox project and is also in the public domain. The audio clips were segmented automatically based on silences in the recording. Clip boundaries generally align with sentence or clause boundaries, but not always. The text was matched to the audio manually, and a quality assurance pass was done to ensure that the text accurately matched the words spoken in the audio. The original LibriVox recordings were distributed as 128 kbps MP3 files. As a result, they may contain artifacts introduced by the MP3 encoding.
Tables¶
Click on a row to toggle a preview.
ID |
Type |
Columns |
---|---|---|
files |
filewise |
transcription, normalized-transcription, speaker |
speaker |
misc |
gender |
Schemes¶
ID |
Dtype |
Labels |
Mappings |
---|---|---|---|
gender |
str |
female, male, other |
|
normalized-transcription |
str |
||
speaker |
int |
0 |
gender |
transcription |
str |