Free Spoken Digit Dataset (load_digits
)
API reference
- sequentia.datasets.load_digits(numbers=range(0, 10), random_state=None)[source]
Load audio samples of spoken digits from the Free Spoken Digit Dataset.
The Free Spoken Digit Dataset (FSDD) consists of 3000 recordings of the spoken digits 0-9.
The dataset consists of 50 recordings of each digit by 6 individual speakers.
- Parameters
- numbers: array-like of int
Subset of digits to include in the dataset. Defaults to 0-9.
- random_state: numpy.random.RandomState, int, optional
A random state object or seed for reproducible randomness.
- Returns
- dataset:class:sequentia.datasets.Dataset
A dataset object representing the loaded digits.