Helix the Robot
Helix Helix
arrow_backAll datasets

Dbpedia 14

Hugging Face

Dataset Card for DBpedia14 Dataset Summary The DBpedia ontology classification dataset is constructed by picking 14 non-overlapping classes from DBpedia 2014. They are listed in classes.txt. From each of thse 14 ontology classes, we randomly choose 40,000 training samples and 5,000 testing samples. Therefore, the total size of the training dataset is 560,000 and testing dataset 70,000. There are 3 columns in the dataset (same for train and test splits), corresponding to… See the full description on the dataset page: https://huggingface.co/datasets/fancyzhx/dbpedia_14.

descriptiondbpedia-14.parquet view_list500 rows cloud_downloaddbpedia_14
boltOpen in Helix

Interesting queries to try

Columns

  • label numeric
  • title text
  • content text

Login to Helix

Don't have an account? Sign up here

Sign Up for Helix

Already have an account? Login here