torchaudio¶
该库是 PyTorch 项目的一部分。PyTorch 是一个开源的机器学习框架。
本文档中描述的功能按发布状态进行分类:
Stable: These features will be maintained long-term and there should generally be no major performance limitations or gaps in documentation. We also expect to maintain backwards compatibility (although breaking changes can happen and notice will be given one release ahead of time).
Beta: Features are tagged as Beta because the API may change based on user feedback, because the performance needs to improve, or because coverage across operators is not yet complete. For Beta features, we are committing to seeing the feature through to the Stable classification. We are not, however, committing to backwards compatibility.
Prototype: These features are typically not available as part of binary distributions like PyPI or Conda, except sometimes behind run-time flags, and are at an early stage for feedback and testing.
torchaudio 包含 I/O、常用数据集和常见的音频变换。
包参考
- 音频工具包
- torchaudio.backend
- torchaudio.functional
- 频谱图
- amplitude_to_DB
- create_fb_matrix
- create_dct
- mu_law_encoding
- mu_law_decoding
- complex_norm
- 角度
- 幅相
- phase_vocoder
- 线性滤波器
- 双二次
- lowpass_biquad
- highpass_biquad
- allpass_biquad
- equalizer_biquad
- bandpass_biquad
- bandreject_biquad
- band_biquad
- treble_biquad
- bass_biquad
- deemph_biquad
- riaa_biquad
- 对比度
- 直流偏移
- 超频
- 相位器
- 镶边器
- mask_along_axis
- mask_along_axis_iid
- compute_deltas
- detect_pitch_frequency
- sliding_window_cmn
- 语音活动检测
- torchaudio.transforms
- torchaudio.datasets
- torchaudio.models
- torchaudio.sox_effects
- torchaudio.compliance.kaldi
- torchaudio.kaldi_io
- torchaudio.utils