Helix the Robot
Helix
arrow_backAll datasets

Breast Cancer Wisconsin

Hugging Face

Breast Cancer Wisconsin Diagnostic Dataset Following description was retrieved from breast cancer dataset on UCI machine learning repository. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at here. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T), a classification method which uses linear… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin.

descriptionscikit-learn--breast-cancer-wisconsin.parquet view_list500 rows cloud_downloadscikit-learn/breast-cancer-wisconsin
boltOpen in Helix

Ask a question about this data

Type any question in plain English — Helix builds the chart with AI. Sign in to run it and save your charts.

auto_awesome

Data preview

500 rows · 32 columns · showing first 12
# id integer diagnosis text radius_mean float texture_mean float perimeter_mean float area_mean float smoothness_mean float compactness_mean float concavity_mean float concave points_mean float symmetry_mean float fractal_dimension_mean float radius_se float texture_se float perimeter_se float area_se float smoothness_se float compactness_se float concavity_se float concave points_se float symmetry_se float fractal_dimension_se float radius_worst float texture_worst float perimeter_worst float area_worst float smoothness_worst float compactness_worst float concavity_worst float concave points_worst float symmetry_worst float fractal_dimension_worst float
1 842302M17.9910.38122.810010.11840.27760.30010.14710.24190.078711.0950.90538.589153.40.0063990.049040.053730.015870.030030.00619325.3817.33184.620190.16220.66560.71190.26540.46010.1189
2 842517M20.5717.77132.913260.084740.078640.08690.070170.18120.056670.54350.73393.39874.080.0052250.013080.01860.01340.013890.00353224.9923.41158.819560.12380.18660.24160.1860.2750.08902
3 84300903M19.6921.2513012030.10960.15990.19740.12790.20690.059990.74560.78694.58594.030.006150.040060.038320.020580.02250.00457123.5725.53152.517090.14440.42450.45040.2430.36130.08758
4 84348301M11.4220.3877.58386.10.14250.28390.24140.10520.25970.097440.49561.1563.44527.230.009110.074580.056610.018670.059630.00920814.9126.598.87567.70.20980.86630.68690.25750.66380.173
5 84358402M20.2914.34135.112970.10030.13280.1980.10430.18090.058830.75720.78135.43894.440.011490.024610.056880.018850.017560.00511522.5416.67152.215750.13740.2050.40.16250.23640.07678
6 843786M12.4515.782.57477.10.12780.170.15780.080890.20870.076130.33450.89022.21727.190.007510.033450.036720.011370.021650.00508215.4723.75103.4741.60.17910.52490.53550.17410.39850.1244
7 844359M18.2519.98119.610400.094630.1090.11270.0740.17940.057420.44670.77323.1853.910.0043140.013820.022540.010390.013690.00217922.8827.66153.216060.14420.25760.37840.19320.30630.08368
8 84458202M13.7120.8390.2577.90.11890.16450.093660.059850.21960.074510.58351.3773.85650.960.0088050.030290.024880.014480.014860.00541217.0628.14110.68970.16540.36820.26780.15560.31960.1151
9 844981M1321.8287.5519.80.12730.19320.18590.093530.2350.073890.30631.0022.40624.320.0057310.035020.035530.012260.021430.00374915.4930.73106.2739.30.17030.54010.5390.2060.43780.1072
10 84501001M12.4624.0483.97475.90.11860.23960.22730.085430.2030.082430.29761.5992.03923.940.0071490.072170.077430.014320.017890.0100815.0940.6897.65711.40.18531.0581.1050.2210.43660.2075
11 845636M16.0223.24102.7797.80.082060.066690.032990.033230.15280.056970.37951.1872.46640.510.0040290.0092690.011010.0075910.01460.00304219.1933.88123.811500.11810.15510.14590.099750.29480.08452
12 84610002M15.7817.89103.67810.09710.12920.099540.066060.18420.060820.50580.98493.56454.160.0057710.040610.027910.012820.020080.00414420.4227.28136.512990.13960.56090.39650.1810.37920.1048

Auto-generated charts

Breast Cancer Wisconsin: 500 rows by 32 columns. These exploratory charts are generated automatically from the data - open the dataset in Helix to ask your own questions.

Rows500
Columns32
Numeric cols30
Categorical cols1

Charts

Total radius_mean by diagnosis

Top diagnosis values ranked by summed radius_mean.

radius_mean vs texture_mean

radius_mean vs texture_mean, coloured by diagnosis.

Distribution of radius_mean

Histogram of radius_mean values.

Correlation of numeric columns

Pearson correlation between numeric columns.

radius_mean by diagnosis

Spread of radius_mean across diagnosis groups.

Interesting queries to try

Columns

  • id numeric
  • diagnosis categorical
  • radius_mean numeric
  • texture_mean numeric
  • perimeter_mean numeric
  • area_mean numeric
  • smoothness_mean numeric
  • compactness_mean numeric
  • concavity_mean numeric
  • concave points_mean numeric
  • symmetry_mean numeric
  • fractal_dimension_mean numeric
  • radius_se numeric
  • texture_se numeric
  • perimeter_se numeric
  • area_se numeric
  • smoothness_se numeric
  • compactness_se numeric
  • concavity_se numeric
  • concave points_se numeric
  • symmetry_se numeric
  • fractal_dimension_se numeric
  • radius_worst numeric
  • texture_worst numeric
  • perimeter_worst numeric
  • area_worst numeric
  • smoothness_worst numeric
  • compactness_worst numeric
  • concavity_worst numeric
  • concave points_worst numeric
  • symmetry_worst numeric
  • fractal_dimension_worst numeric
  • Unnamed: 32 unknown

Login to Helix

Don't have an account? Sign up here

Sign Up for Helix

Already have an account? Login here