ljspeech

Created by Keith Ito and Linda Johnson

version

1.0.0

license

CC0-1.0

usage

unrestricted

languages

eng

format

wav

channel

1

sampling rate

22050

bit depth

16

duration

0 days 23:55:17.076281179

files

13100, duration distribution: 1.1 s ljspeech-1.0.0-file-duration-distribution 10.1 s

repository

audb-public

Description

LJSpeech consists of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours. The texts were published between 1884 and 1964, and are in the public domain. The audio was recorded in 2016-17 by the LibriVox project and is also in the public domain. The audio clips were segmented automatically based on silences in the recording. Clip boundaries generally align with sentence or clause boundaries, but not always. The text was matched to the audio manually, and a quality assurance pass was done to ensure that the text accurately matched the words spoken in the audio. The original LibriVox recordings were distributed as 128 kbps MP3 files. As a result, they may contain artifacts introduced by the MP3 encoding.

Example

wavs/LJ002-0146.wav

../_images/ljspeech-1.0.0-player-waveform.png

Tables

Click on a row to toggle a preview.

ID

Type

Columns

files

filewise

transcription, normalized-transcription, speaker

file

transcription

normalized-transcription

speaker

wavs/LJ001-0001.wav

Printing, in the only sense with which we are at present concerned, differs from most if not from...

Printing, in the only sense with which we are at present concerned, differs from most if not from...

0

wavs/LJ001-0002.wav

in being comparatively modern.

in being comparatively modern.

0

wavs/LJ001-0003.wav

For although the Chinese took impressions from wood blocks engraved in relief for centuries befor...

For although the Chinese took impressions from wood blocks engraved in relief for centuries befor...

0

wavs/LJ001-0004.wav

produced the block books, which were the immediate predecessors of the true printed book,

produced the block books, which were the immediate predecessors of the true printed book,

0

wavs/LJ001-0005.wav

the invention of movable metal letters in the middle of the fifteenth century may justly be consi...

the invention of movable metal letters in the middle of the fifteenth century may justly be consi...

0

13100 rows x 3 columns

speaker

misc

gender

speaker

gender

0

female

1 row x 1 column

Schemes

ID

Dtype

Labels

Mappings

gender

str

female, male, other

normalized-transcription

str

speaker

int

0

gender

transcription

str