Free Spoken Digit Dataset (load_digits)

API reference

sequentia.datasets.load_digits(numbers=range(0, 10), random_state=None)[source]

Load audio samples of spoken digits from the Free Spoken Digit Dataset.

The Free Spoken Digit Dataset (FSDD) consists of 3000 recordings of the spoken digits 0-9.

The dataset consists of 50 recordings of each digit by 6 individual speakers.

Parameters
numbers: array-like of int

Subset of digits to include in the dataset. Defaults to 0-9.

random_state: numpy.random.RandomState, int, optional

A random state object or seed for reproducible randomness.

Returns
dataset:class:sequentia.datasets.Dataset

A dataset object representing the loaded digits.