Dataset research report
Lambada research report
A reproducible data report with schema notes, generated chart evidence, suggested follow-up questions, and export-ready Helix queries.
Executive Summary
Dataset Card for LAMBADA Dataset Summary The LAMBADA evaluates the capabilities of computational models for text understanding by means of a word prediction task. LAMBADA is a collection of narrative passages sharing the characteristic that human subjects are able to guess their last word if they are exposed to the whole passage, but not if they only see the last sentence preceding the target word. To succeed on LAMBADA, computational models cannot simply rely on local… See the full description on the dataset page: https://huggingface.co/datasets/cimec/lambada.
Research Context
Lambada: 500 rows by 2 columns. These exploratory charts are generated automatically from the data - open the dataset in Helix to ask your own questions.
Data Profile
Chart Evidence
These views are generated from the dataset profile. Each chart is paired with a Helix query so it can be opened, adjusted, and exported.
Follow-Up Queries
Preview Rows
| # | texttext | domaintext |
|---|---|---|
| 1 | prism story of ci prism -lrb- story of ci -rrb- copyright 2010 rachel moschell published by rachel moschell at smashwords third edition the… | Adventure |
| 2 | skip episode one by perrin briar contents page prologue chapter one chapter two chapter three chapter four chapter five chapter six chapter… | Adventure |
| 3 | amanda martin two-hundred steps home volume five amanda martin was born in hertfordshire in 1976 . after graduating with first class hono… | Adventure |
| 4 | spear bearer book one stephen clary published by stephen clary at smashwords ebook edition copyright 2010 stephen clary discover other titl… | Adventure |
| 5 | the seven by victoria panovska published by victoria panovska at smashwords copyright 2013 victoria panovska smashwords edition , license n… | Adventure |
| 6 | the seventh age of man part 1 : regeneration by kevin gordon copyright 2011 by kevin gordon smashwords edition chapter 1 on every radio , t… | Adventure |
Data Dictionary
- text text
- domain categorical
Method And Limits
- Load the catalog entry and preview rows from the processed dataset file.
- Infer numeric, categorical, time, and location fields from real columns.
- Generate a small set of defensive Plotly chart specifications from that profile.
- Expose each chart idea as a query link so the report can be rerun or exported in Helix.
This report is intentionally reproducible. It uses the local catalog metadata and generated chart specifications rather than claiming external conclusions beyond the dataset.