torchaudio.prototype.pipelines¶
The pipelines subpackage contains APIs to models with pretrained weights and relevant utilities.
RNN-T Streaming/Non-Streaming ASR¶
Pretrained Models¶
| Pre-trained Emformer-RNNT-based ASR pipeline capable of performing both streaming and non-streaming inference. | |
| Pre-trained Emformer-RNNT-based ASR pipeline capable of performing both streaming and non-streaming inference. | 
HiFiGAN Vocoder¶
Interface¶
HiFiGANVocoderBundle defines HiFiGAN Vocoder pipeline capable of transforming mel spectrograms into waveforms.
| Data class that bundles associated information to use pretrained  | 
Pretrained Models¶
| HiFiGAN Vocoder pipeline, trained on The LJ Speech Dataset [Ito and Johnson, 2017]. | 
VGGish¶
Interface¶
| VGGish [Hershey et al., 2017] inference pipeline ported from torchvggish and tensorflow-models. | |
| Implementation of VGGish model [Hershey et al., 2017]. | |
| Converts raw waveforms to batches of examples to use as inputs to VGGish. | 
Pretrained Models¶
| Pre-trained VGGish [Hershey et al., 2017] inference pipeline ported from torchvggish and tensorflow-models. |