API reference

sequentia.datasets.load_digits(*, digits=[0, 1, 2, 3, 4, 5, 6, 7, 8, 9])[source]

Loads MFCC features of spoken digit audio samples from the Free Spoken Digit Dataset.

The Free Spoken Digit Dataset (FSDD) consists of 3000 recordings of the spoken digits 0-9.

This version consists of 13 MFCC features of 50 recordings for each digit by 6 individual speakers.


digits (Iterable[int]) – Subset of digits to include in the dataset.


A dataset object representing the loaded digits.

Return type: