fluent_speech_commands#

(s3prl.dataio.corpus.fluent_speech_commands)

Parse the Fluent Speech Command corpus

Authors:
  • Leo 2022

  • Cheng Liang 2022

FluentSpeechCommands#

class s3prl.dataio.corpus.fluent_speech_commands.FluentSpeechCommands(dataset_root: str, n_jobs: int = 4)[source][source]#

Bases: Corpus

Parse the Fluent Speech Command dataset

Parameters:

dataset_root – (str) The dataset root of Fluent Speech Command

property all_data[source]#

Return all the data points in a dict of the format

data_id1:
    path: (str) The waveform path
    speakerId: (str) The speaker name
    transcription: (str) The transcription
    action: (str) The action
    object: (str) The action's targeting object
    location: (str) The location where the action happens

data_id2:
    ...
property data_split[source]#

Return a list:

train_data, valid_data, test_data

each is a dict following the format specified in all_data

property data_split_ids[source]#

Return a list:

train_ids, valid_ids, test_ids

Each is a list containing data_ids. data_ids can be used as the key to access the all_data

classmethod download_dataset(tgt_dir: str) None[source][source]#

Download and unzip the dataset to tgt_dir/fluent_speech_commands_dataset

Parameters:

tgt_dir (str) – The root directory containing many different datasets

static dataframe_to_datapoints(df: DataFrame, unique_name_fn: callable)[source]#