音频处理文档¶
Torchaudio 是一个用于音频和信号处理的 PyTorch 库。 它提供了输入/输出、信号和数据处理功能、数据集、 模型实现以及应用程序组件。
本文档中描述的功能按发布状态进行分类:
Stable: These features will be maintained long-term and there should generally be no major performance limitations or gaps in documentation. We also expect to maintain backwards compatibility (although breaking changes can happen and notice will be given one release ahead of time).
Beta: Features are tagged as Beta because the API may change based on user feedback, because the performance needs to improve, or because coverage across operators is not yet complete. For Beta features, we are committing to seeing the feature through to the Stable classification. We are not, however, committing to backwards compatibility.
Prototype: These features are typically not available as part of binary distributions like PyPI or Conda, except sometimes behind run-time flags, and are at an early stage for feedback and testing.
API 参考¶
高级用法¶
引用 torchaudio¶
如果你发现 torchaudio 有用,请引用以下论文:
杨毅远,Hira, M., 倪正,Chourdia, A., Astafurov, A., 陈超,叶承甫,Puhrsch, C., Pollack, D., Genzel, D., Greenberg, D., 杨恩泽,连杰,Mahadeokar, J., Hwang, J., 陈杰,Goldsborough, P., Roy, P., Narenthiran, S., 渡边伸,Chintala, S., Quenneville-Bélair, V, 史岩. (2021). TorchAudio: 音频和语音处理的构建模块。arXiv 预印本 arXiv:2110.15018.
BibTeX 格式:
@article{yang2021torchaudio,
title={TorchAudio: Building Blocks for Audio and Speech Processing},
author={Yao-Yuan Yang and Moto Hira and Zhaoheng Ni and
Anjali Chourdia and Artyom Astafurov and Caroline Chen and
Ching-Feng Yeh and Christian Puhrsch and David Pollack and
Dmitriy Genzel and Donny Greenberg and Edward Z. Yang and
Jason Lian and Jay Mahadeokar and Jeff Hwang and Ji Chen and
Peter Goldsborough and Prabhat Roy and Sean Narenthiran and
Shinji Watanabe and Soumith Chintala and
Vincent Quenneville-Bélair and Yangyang Shi},
journal={arXiv preprint arXiv:2110.15018},
year={2021}
}