Helix the Robot
Helix Helix
arrow_backAll datasets

Oasst1 Pairwise Rlhf Reward

Hugging Face

Dataset Card for "oasst1_pairwise_rlhf_reward" OASST1 dataset preprocessed for reward modeling: import pandas as pd from datasets import load_dataset,concatenate_datasets, Dataset, DatasetDict import numpy as np dataset = load_dataset("OpenAssistant/oasst1") df=concatenate_datasets(list(dataset.values())).to_pandas() m2t=df.set_index("message_id")['text'].to_dict() m2r=df.set_index("message_id")['role'].to_dict() m2p=df.set_index('message_id')['parent_id'].to_dict()… See the full description on the dataset page: https://huggingface.co/datasets/tasksource/oasst1_pairwise_rlhf_reward.

descriptiontasksource--oasst1-pairwise-rlhf-reward.parquet view_list500 rows cloud_downloadtasksource/oasst1_pairwise_rlhf_reward
boltOpen in Helix

Interesting queries to try

Columns

  • lang categorical
  • parent_id text
  • prompt text
  • chosen text
  • rejected text

Login to Helix

Don't have an account? Sign up here

Sign Up for Helix

Already have an account? Login here