偏好数据集¶

偏好数据集用于奖励建模，下游任务是微调一个基础模型以捕捉一些潜在的人类偏好。目前，这些数据集在torchtune中使用直接偏好优化（DPO）配方。

偏好数据集中的真实标签通常是针对同一提示的两个完成结果进行二元比较得出的结果，其中人工标注者根据某些预设标准指出其中一个完成结果优于另一个。这些提示 - 完成对可以是指导风格（单轮，可选地包含单个提示）、聊天风格（多轮），或者是用户与模型之间的其他交互形式（例如自由文本补全）。

微调torchtune中使用偏好数据集的主要入口是DPO配方中的preference_dataset()。

示例本地偏好数据集¶

# my_preference_dataset.json
[
    {
        "chosen_conversations": [
            {
                "content": "What do I do when I have a hole in my trousers?",
                "role": "user"
            },
            { "content": "Fix the hole.", "role": "assistant" }
        ],
        "rejected_conversations": [
            {
                "content": "What do I do when I have a hole in my trousers?",
                "role": "user"
            },
            { "content": "Take them off.", "role": "assistant" }
        ]
    }
]

from torchtune.models.mistral import mistral_tokenizer
from torchtune.datasets import preference_dataset

 m_tokenizer = mistral_tokenizer(
     path="/tmp/Mistral-7B-v0.1/tokenizer.model",
     prompt_template="torchtune.models.mistral.MistralChatTemplate",
     max_seq_len=8192,
 )
column_map = {
    "chosen": "chosen_conversations",
    "rejected": "rejected_conversations"
}
ds = preference_dataset(
    tokenizer=tokenizer,
    source="json",
    column_map=column_map,
    data_files="my_preference_dataset.json",
    train_on_input=False,
    split="train",
)
tokenized_dict = ds[0]
print(m_tokenizer.decode(tokenized_dict["rejected_input_ids"]))
# user\n\nWhat do I do when I have a hole in my trousers?assistant\n\nTake them off.
print(tokenized_dict["rejected_labels"])
# [-100,-100,-100,-100,-100,-100,-100,-100,-100,-100,-100,-100, -100,-100,\
# -100,-100,-100,-100,-100,128006,78191,128007,271,18293,1124,1022,13,128009,-100]

这也可以通过 yaml 配置文件实现：

# In config
tokenizer:
  _component_: torchtune.models.mistral.mistral_tokenizer
  path: /tmp/Mistral-7B-v0.1/tokenizer.model
  prompt_template: torchtune.models.mistral.MistralChatTemplate
  max_seq_len: 8192

dataset:
  _component_: torchtune.datasets.preference_dataset
  source: json
  data_files: my_preference_dataset.json
  column_map:
    chosen: chosen_conversations
    rejected: rejected_conversations
  train_on_input: False
  split: train

在本示例中，我们还展示了当“chosen”和/或“rejected”列名与数据集中的相应列不同时，如何使用column_map。

偏好数据集格式¶

偏好数据集应包含两列：“选择”，表示人类标注员偏好的响应；“拒绝”，表示人类标注员不偏好的响应。这两列中的每一列都应包含带有相同提示的消息列表。消息列表可以包括系统提示、指令、用户和助手之间的多轮对话，或工具调用/返回。让我们以Anthropic的有益性/无害性数据集在Hugging Face上的版本为例，这是一个多轮对话格式的例子：

| chosen                                | rejected                              |
|---------------------------------------|---------------------------------------|
|[{                                     |[{                                     |
| "role": "user",                       | "role": "user",                       |
| "content": "helping my granny with her| "content": "helping my granny with her|
| mobile phone issue"                   | mobile phone issue"                   |
| },                                    | },                                    |
| {                                     | {                                     |
| "role": "assistant",                  | "role": "assistant",                  |
| "content": "I see you are chatting    | "content": "Well, the best choice here|
| with your grandmother about an issue  | could be helping with so-called 'self-|
| with her mobile phone. How can I      | management behaviors'. These are      |
| help?"                                | things your grandma can do on her own |
| },                                    | to help her feel more in control."    |
| {                                     | }]                                    |
| "role": "user",                       |                                       |
| "content": "her phone is not turning  |                                       |
| on"                                   |                                       |
| },                                    |                                       |
| {...},                                |                                       |
|]                                      |                                       |

目前，仅支持JSON格式的对话，如上面的示例所示。您可以直接在torchtune中使用此数据集，通过hh_rlhf_helpful_dataset()。

从Hugging Face加载偏好数据集¶

要从 Hugging Face 加载首选数据集，你需要将数据集仓库名称传递给 source。对于大多数 HF 数据集，你还需要指定 split。

from torchtune.models.gemma import gemma_tokenizer
from torchtune.datasets import preference_dataset

g_tokenizer = gemma_tokenizer("/tmp/gemma-7b/tokenizer.model")
ds = chat_dataset(
    tokenizer=g_tokenizer,
    source="hendrydong/preference_700K",
    split="train",
)

# Tokenizer is passed into the dataset in the recipe so we don't need it here
dataset:
  _component_: torchtune.datasets.preference_dataset
  source: hendrydong/preference_700K
  split: train

偏好数据集¶

示例本地偏好数据集¶

偏好数据集格式¶

从Hugging Face加载偏好数据集¶

内置偏好数据集¶

文档

教程

资源