stream()¶

audb.stream(name, table, *, version=None, map=None, batch_size=16, shuffle=False, buffer_size=100000, only_metadata=False, bit_depth=None, channels=None, format=None, mixdown=False, sampling_rate=None, full_path=True, cache_root=None, num_workers=1, timeout=86400, verbose=True)[source]¶

Stream table and media files of a database.

Loads only the first batch_size rows of a table into memory, and downloads only the related media files, if any media files are requested.

By setting bit_depth, channels, format, mixdown, and sampling_rate we can request a specific flavor of the database. In that case media files are automatically converted to the desired properties (see also audb.Flavor).

Parameters

name (str) – name of database
table (str) – name of table
version (Optional[str]) – version string, latest if None
map (Optional[dict[str, str | Sequence[str]]]) – map scheme or scheme fields to column values. For example if your table holds a column speaker with speaker IDs, which is assigned to a scheme that contains a dict mapping speaker IDs to age and gender entries, map={'speaker': ['age', 'gender']} will replace the column with two new columns that map ID values to age and gender, respectively. To also keep the original column with speaker IDS, you can do map={'speaker': ['speaker', 'age', 'gender']}
batch_size (int) – number of table rows to return in one iteration
shuffle (bool) – if True, it first reads buffer_size rows from the table and selects batch_size randomly from them
buffer_size (int) – number of table rows to be loaded when shuffle is True
only_metadata (bool) – load only header and tables of database
bit_depth (Optional[int]) – bit depth, one of 16, 24, 32
channels (Union[int, Sequence[int], None]) – channel selection, see audresample.remix(). Note that media files with too few channels will be first upsampled by repeating the existing channels. E.g. channels=[0, 1] upsamples all mono files to stereo, and channels=[1] returns the second channel of all multi-channel files and all mono files
format (Optional[str]) – file format, one of 'flac', 'wav'
mixdown (bool) – apply mono mix-down
sampling_rate (Optional[int]) – sampling rate in Hz, one of 8000, 16000, 22050, 24000, 44100, 48000
full_path (bool) – replace relative with absolute file paths
cache_root (Optional[str]) – cache folder where databases are stored. If not set audb.default_cache_root() is used
num_workers (Optional[int]) – number of parallel jobs or 1 for sequential processing. If None will be set to the number of processors on the machine multiplied by 5
timeout (float) – maximum time in seconds before giving up acquiring a lock to the database cache folder. None is returned in this case
verbose (bool) – show debug messages

Return type

DatabaseIterator

Returns

database object

Raises

ValueError – if table is requested that is not part of the database
ValueError – if a non-supported bit_depth, format, or sampling_rate is requested
RuntimeError – if a flavor is requested, but the database contains media files, that don’t contain audio, e.g. text files

Examples

>>> import numpy as np
>>> np.random.seed(1)
>>> db = audb.stream(
...     "emodb",
...     "files",
...     version="1.4.1",
...     batch_size=4,
...     shuffle=True,
...     only_metadata=True,
...     full_path=False,
...     verbose=False,
... )
>>> next(db)
                                 duration  speaker transcription
file
wav/14a05Fb.wav 0 days 00:00:03.128687500       14           a05
wav/15a05Eb.wav 0 days 00:00:03.993562500       15           a05
wav/12a05Nd.wav    0 days 00:00:03.185875       12           a05
wav/13a07Na.wav 0 days 00:00:01.911687500       13           a07