ljspeech¶

Created by Keith Ito and Linda Johnson

version	1.0.0
license	CC0-1.0
usage	unrestricted
languages	eng
format	wav
channel	1
sampling rate	22050
bit depth	16
duration	0 days 23:55:17.076281179
files	13100, duration distribution: 1.1 s 10.1 s
repository	audb-public

Description¶

LJSpeech consists of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours. The texts were published between 1884 and 1964, and are in the public domain. The audio was recorded in 2016-17 by the LibriVox project and is also in the public domain. The audio clips were segmented automatically based on silences in the recording. Clip boundaries generally align with sentence or clause boundaries, but not always. The text was matched to the audio manually, and a quality assurance pass was done to ensure that the text accurately matched the words spoken in the audio. The original LibriVox recordings were distributed as 128 kbps MP3 files. As a result, they may contain artifacts introduced by the MP3 encoding.

Example¶

wavs/LJ002-0146.wav

../_images/ljspeech-1.0.0-player-waveform.png

Tables¶

Click on a row to toggle a preview.

files

filewise

transcription, normalized-transcription, speaker

file	transcription	normalized-transcription	speaker
wavs/LJ001-0001.wav	Printing, in the only sense with which we are at present concerned, differs from most if not from...	Printing, in the only sense with which we are at present concerned, differs from most if not from...	0
wavs/LJ001-0002.wav	in being comparatively modern.	in being comparatively modern.	0
wavs/LJ001-0003.wav	For although the Chinese took impressions from wood blocks engraved in relief for centuries befor...	For although the Chinese took impressions from wood blocks engraved in relief for centuries befor...	0
wavs/LJ001-0004.wav	produced the block books, which were the immediate predecessors of the true printed book,	produced the block books, which were the immediate predecessors of the true printed book,	0
wavs/LJ001-0005.wav	the invention of movable metal letters in the middle of the fifteenth century may justly be consi...	the invention of movable metal letters in the middle of the fifteenth century may justly be consi...	0
13100 rows x 3 columns

speaker

misc

gender

speaker	gender
0	female
1 row x 1 column

Schemes¶

ID	Dtype	Labels	Mappings
gender	str	female, male, other
normalized-transcription	str
speaker	int	0	gender
transcription	str