使用 TensorDict 简化 PyTorch 内存管理¶

在本教程中，您将学习如何控制 a 的内容在内存中的存储位置，方法是将这些内容发送到设备或者利用内存映射。TensorDict

设备¶

创建时，可以使用 keyword argument 指定设备。如果设置了，则的所有条目都将放置在该设备上。如果未设置，则不要求 the 中的条目必须位于同一装置。TensorDictdevicedeviceTensorDictdeviceTensorDict

在此示例中，我们使用实例化 a 。什么时候我们打印可以看到它们已移动到设备上的内容。TensorDictdevice="cuda:0"

>>> import torch
>>> from tensordict import TensorDict
>>> tensordict = TensorDict({"a": torch.rand(10)}, [10], device="cuda:0")
>>> print(tensordict)
TensorDict(
    fields={
        a: Tensor(shape=torch.Size([10]), device=cuda:0, dtype=torch.float32, is_shared=True)},
    batch_size=torch.Size([10]),
    device=cuda:0,
    is_shared=True)

如果的设备不是，则还会移动新条目拖动到设备上。TensorDictNone

>>> tensordict["b"] = torch.rand(10, 10)
>>> print(tensordict)
TensorDict(
    fields={
        a: Tensor(shape=torch.Size([10]), device=cuda:0, dtype=torch.float32, is_shared=True),
        b: Tensor(shape=torch.Size([10, 10]), device=cuda:0, dtype=torch.float32, is_shared=True)},
    batch_size=torch.Size([10]),
    device=cuda:0,
    is_shared=True)

您可以使用该属性查看的当前设备。TensorDictdevice

>>> print(tensordict.device)
cuda:0

的内容可以发送到 PyTorch 张量等设备跟TensorDictTensorDict.cuda()或TensorDict.device(device)成为所需的设备。device

>>> tensordict.to(torch.device("cpu"))
>>> print(tensordict)
TensorDict(
    fields={
        a: Tensor(shape=torch.Size([10]), device=cpu, dtype=torch.float32, is_shared=False),
        b: Tensor(shape=torch.Size([10, 10]), device=cpu, dtype=torch.float32, is_shared=False)},
    batch_size=torch.Size([10]),
    device=cpu,
    is_shared=False)
>>> tensordict.cuda()
>>> print(tensordict)
TensorDict(
    fields={
        a: Tensor(shape=torch.Size([10]), device=cuda:0, dtype=torch.float32, is_shared=True),
        b: Tensor(shape=torch.Size([10, 10]), device=cuda:0, dtype=torch.float32, is_shared=True)},
    batch_size=torch.Size([10]),
    device=cuda:0,
    is_shared=True)

这TensorDict.devicemethod 需要有效的 device 作为参数传递。如果要从 to allow 值中删除 device 具有不同的设备，则应使用该方法。TensorDictTensorDict.clear_device

>>> tensordict.clear_device()
>>> print(tensordict)
TensorDict(
    fields={
        a: Tensor(shape=torch.Size([10]), device=cuda:0, dtype=torch.float32, is_shared=True),
        b: Tensor(shape=torch.Size([10, 10]), device=cuda:0, dtype=torch.float32, is_shared=True)},
    batch_size=torch.Size([10]),
    device=None,
    is_shared=False)

内存映射张量¶

tensordict提供类MemoryMappedTensor这允许我们将张量的内容存储在磁盘上，同时仍然支持快速索引和批量加载内容。请参阅 ImageNet 教程以获取实际示例。

要将转换为内存映射张量的集合，请使用TensorDictTensorDict.memmap_.

tensordict = TensorDict({"a": torch.rand(10), "b": {"c": torch.rand(10)}}, [10])
tensordict.memmap_()

print(tensordict)

TensorDict(
    fields={
        a: MemoryMappedTensor(shape=torch.Size([10]), device=cpu, dtype=torch.float32, is_shared=False),
        b: TensorDict(
            fields={
                c: MemoryMappedTensor(shape=torch.Size([10]), device=cpu, dtype=torch.float32, is_shared=False)},
            batch_size=torch.Size([10]),
            device=cpu,
            is_shared=False)},
    batch_size=torch.Size([10]),
    device=cpu,
    is_shared=False)

或者，可以使用TensorDict.memmap_like方法。这将新建TensorDict具有相同的结构MemoryMappedTensor值，但它不会复制原始张量的内容添加到内存映射张量。这允许您创建内存映射的TensorDict然后缓慢填充它，因此通常应该是首选。memmap_

tensordict = TensorDict({"a": torch.rand(10), "b": {"c": torch.rand(10)}}, [10])
mm_tensordict = tensordict.memmap_like()

print(mm_tensordict["a"].contiguous())

MemoryMappedTensor([0., 0., 0., 0., 0., 0., 0., 0., 0., 0.])

默认情况下，它的内容将保存到一个临时的位置，但是，如果您想控制它们的保存位置，您可以使用 keyword 参数。TensorDictprefix="/path/to/root"

的内容保存在一个目录结构中，该目录结构模拟本身的结构。张量的内容被保存在 NumPy memmap 中，以及关联的 PyTorch 保存文件中的元数据。例如以上内容保存如下：TensorDictTensorDictTensorDict

├── a.memmap
├── a.meta.pt
├── b
│ ├── c.memmap
│ ├── c.meta.pt
│ └── meta.pt
└── meta.pt

脚本总运行时间：（0 分 0.004 秒）

由 Sphinx-Gallery 生成的图库

使用 TensorDict 简化 PyTorch 内存管理¶

设备¶

内存映射张量¶

文档

教程

资源