Oasst1 Pairwise Rlhf Reward

Name: Oasst1 Pairwise Rlhf Reward
Creator: Helix
Keywords: dataset, hugging face, language:de, language:en, language:es, language:pl, language:ru, rlhf

Hugging Face

Dataset Card for "oasst1_pairwise_rlhf_reward" OASST1 dataset preprocessed for reward modeling: import pandas as pd from datasets import load_dataset,concatenate_datasets, Dataset, DatasetDict import numpy as np dataset = load_dataset("OpenAssistant/oasst1") df=concatenate_datasets(list(dataset.values())).to_pandas() m2t=df.set_index("message_id")['text'].to_dict() m2r=df.set_index("message_id")['role'].to_dict() m2p=df.set_index('message_id')['parent_id'].to_dict()… See the full description on the dataset page: https://huggingface.co/datasets/tasksource/oasst1_pairwise_rlhf_reward.

tasksource--oasst1-pairwise-rlhf-reward.parquet 500 rows tasksource/oasst1_pairwise_rlhf_reward

Open in Helix Read research report

Ask a question about this data

Type any question in plain English — Helix builds the chart with AI. Sign in to run it and save your charts.

count of records by lang distribution of lang most common values in parent_id length distribution of parent_id

Data preview

500 rows · 5 columns · showing first 12

#	lang text	parent_id text	prompt text	chosen text	rejected text
1	en	c2925513-2533-49d0-bfe2-aa9c40071aa9	prompter: !pip install transformers !pip install langchain Apply silent options	It seems like you are trying to install some Python packages. However, I cannot run code in this chat box. If you want to learn more about …	``` WARNING: The script transformers-cli.exe is installed in 'C:\Users\Oasst\AppData\Local\Programs\Python\Python310\Scripts' which is not …
2	en	d1b4b0d4-6335-4214-9d28-a9bde543d278	prompter: !pip install transformers !pip install langchain Apply silent options assistant: It seems like you are trying to install some Py…	In IPython shells or Jupyter notebooks, the "!pip" command is often used to execute Python package management instructions inside the shell…	The exclamation mark at the beginning of the lines indicates input to the shell (in this case Powershell). This should not be included in y…
3	th	a38c7355-610b-4f92-bbb6-fc00d07aaadb	prompter: "Cloudflare ซึ่งเป็นบริษัทที่ให้บริการเครือข่ายกระจายข้อมูลและเสริมความปลอดภัยของเว็บไซต์และแอปพลิเคชันต่าง ๆ นับล้านทั่วโลก ได้แ…	Cloudfare เป็นบริษัทที่ให้บริการกระจายข้อมูลและเสริมความปลอดภัยกับหลายเว็บไซต์และแอปพลิเคชัน ดังนั้นเมื่อ Cloudfare เกิดปัญหาขึ้นย่อมทำให้ห…	Cloudflare เป็นบริษัทให้บริการเครือข่ายกระจายข้อมูลและเสริมความปลอดภัยเว็บไซต์กับแอปพลิเคชันต่าง ๆ นับล้านทั่วโลก เกิดเหตุขัดข้องในช่วงเวลา…
4	th	1a2437d6-50da-43d7-8dfc-2e5e19dcfbe3	prompter: "Cloudflare ซึ่งเป็นบริษัทที่ให้บริการเครือข่ายกระจายข้อมูลและเสริมความปลอดภัยของเว็บไซต์และแอปพลิเคชันต่าง ๆ นับล้านทั่วโลก ได้แ…	ช่วงเวลาประมาณ 13:43 น. เว็บไซต์และแอปต่าง ๆ ไม่สามารถเข้าถึงการใช้งานได้ เกิดจากเหตุขัดข้องของ Cloudflare ดังนั้นจึงสรุปได้ว่าในอินเทอร์เ…	สรุป Cloudflare ล่มเวลาประมาณ 13:43 น. (เวลาประเทศไทย) ทำให้เวลาดังกล่าวเว็บไซต์ส่วนใหญ่ในอินเทอร์เน็ตไม่สามารถเข้าถึงได้ตามไปด้วย และแก้ไข…
5	eu	b468ab77-6b0a-4508-8512-c3bb8d7cb487	prompter: "Etxean uso, kalean otso" bezalako esaera zahar bat osatu, baina beste animaliak erabiliz	Kalean katu, etxean basakatu.	Zozoari beleak ipurbeltz.
6	es	80f9e847-9eb1-4e62-941b-4b8fa447ad1e	prompter: "Hola Asistente, ¿cómo estás hoy? Espero que estés teniendo un buen día. Estoy interesado en obtener más información sobre un tem…	Hola, estoy bien, muchas gracias por preguntar, dispuesto a ayudarte en tu búsqueda de información. ¿Sobre que tema en concreto necesitas i…	Buenas, usuario, IP:192.158.1.38 Claro, dime cuál es esa información en particular que necesitas y trataré de brindarte toda la información…
7	en	df544d6f-3f03-45a9-b36d-fcba85a8fd13	prompter: "I want you to act as a Vietnamese translator. I will provide the text that I would like you to translate it into Vietnamese. The…	Sure, I can help you with that. The Vietnamese translation for "How are you?" is "Bạn khỏe không?". As a neutral and accurate translation…	`Bạn có khỏe không?`
8	en	4aef1179-d412-4258-914a-597160a9ab0e	prompter: "I want you to act as a Vietnamese translator. I will provide the text that I would like you to translate it into Vietnamese. The…	Sure, the Japanese translation of "how are you" in typographic form is: お元気（げんき）ですか？	お元気ですか？ (ogenki desu ka?)
9	en	d3a8505a-f277-4e51-8338-451502c99050	prompter: "I want you to act as a Vietnamese translator. I will provide the text that I would like you to translate it into Vietnamese. The…	`Bạn đang làm gì hôm nay?`	Hôm nay bạn đang làm gì?
10	eu	3d1162f5-bacd-4dff-bb7b-d510913ec824	prompter: "Kontaidazu txiste bat"	Hor doa txistea: —Aizu, bada al zenekien nire anaia bizikletan dabilela lau urte zituenetik? —Hara! Lekutan egongo da, bada, honezkero!	Bazihoazen bi gizaki kaletik eta adimen artifiziala jausi zen.
11	en	50ad2dbd-dabe-4c00-8116-7bd34923263a	prompter: "The robot paced over to the door, before looking back at the scientist who brought him to life. He was frightened, having never …	"It was with an air of nervousness that the robot approached the door. Each silent pause, however brief, between the buzz of the motors and…	Once upon a time, there was a mad scientist known as Gandalf the White. With his towering stature along with his long imposing lab coat, he…
12	en	6472cd1a-d5c2-48fa-9d04-64e8877a6886	prompter: "The robot paced over to the door, before looking back at the scientist who brought him to life. He was frightened, having never …	The robot's metallic fingers wrapped around the doorknob and with a gentle turn, the door creaked open. The hinges screeched like banshees …	"The robot opens a door. It reveals a room with heaps of his body parts. Inside of the pile the robot sees damaged and mangled versions of …

Auto-generated charts

Oasst1 Pairwise Rlhf Reward: 500 rows by 5 columns. These exploratory charts are generated automatically from the data - open the dataset in Helix to ask your own questions.

Rows500

Columns5

Categorical cols1

Charts

lang by record count

Most common lang values across records.

Interesting queries to try

Columns

lang categorical
parent_id text
prompt text
chosen text
rejected text