dataset#
(s3prl.dataio.dataset)
Authors: |
|
Dataset#
DiarizationDataset#
EncodeCategories#
EncodeCategory#
EncodeMultiLabel#
EncodeText#
FrameLabelDataset#
- class s3prl.dataio.dataset.FrameLabelDataset(df: DataFrame, num_class: int, frame_shift: int, chunk_secs: float, step_secs: float, use_unfull_chunks: bool = True, load_audio_conf: Optional[dict] = None, sample_rate: int = 16000)[source][source]#
Bases:
Dataset
- Parameters:
df (pd.DataFrame) – the dataframe should have the following columns record_id (str), wav_path (str), duration (float), utt_id (str), label (int), start_sec (float), end_sec (float)
LoadAudio#
- class s3prl.dataio.dataset.LoadAudio(filepaths: List[str], start_secs: Optional[List[float]] = None, end_secs: Optional[List[float]] = None, sox_effects: Optional[Tuple[Tuple[str]]] = None, individual_sox_effects: Optional[List[Tuple[Tuple[str]]]] = None, max_secs: Optional[float] = None, generator: Optional[Random] = None, sample_rate: int = 16000)[source][source]#
Bases:
Dataset
- Parameters:
start_secs – use None if load from start
end_secs – use None if load to end