Digits

API reference

sequentia.datasets.load_digits(*, digits={0, 1, 2, 3, 4, 5, 6, 7, 8, 9})

Load a dataset of MFCC features of spoken digit audio samples from the Free Spoken Digit Dataset.

The Free Spoken Digit Dataset (FSDD) consists of 3000 recordings of the spoken digits 0-9.

This version consists of 13 MFCC features of 50 recordings for each digit by 6 individual speakers.

Parameters:

digits (set[int]) – Subset of digits to include in the dataset.

Returns:

A dataset object representing the loaded digits.

Return type:

SequentialDataset