Helix the Robot
Helix Helix
arrow_backAll datasets

Arxiv Classification

Hugging Face

Arxiv Classification: a classification of Arxiv Papers (11 classes). This dataset is intended for long context classification (documents have all > 4k tokens). Copied from "Long Document Classification From Local Word Glimpses via Recurrent Attention Learning" @ARTICLE{8675939, author={He, Jun and Wang, Liqun and Liu, Liu and Feng, Jiao and Wu, Hao}, journal={IEEE Access}, title={Long Document Classification From Local Word Glimpses via Recurrent Attention Learning}, year={2019}… See the full description on the dataset page: https://huggingface.co/datasets/ccdv/arxiv-classification.

descriptionccdv--arxiv-classification.parquet view_list500 rows cloud_downloadccdv/arxiv-classification
boltOpen in Helix

Interesting queries to try

Columns

  • text text
  • label numeric

Login to Helix

Don't have an account? Sign up here

Sign Up for Helix

Already have an account? Login here