Spambase
Hugging FaceSpambase The Spambase dataset from the UCI ML repository. Is the given mail spam? Configurations and tasks Configuration Task Description spambase Binary classification Is the mail spam? Usage from datasets import load_dataset dataset = load_dataset("mstz/spambase")["train"]
Ask a question about this data
Type any question in plain English — Helix builds the chart with AI. Sign in to run it and save your charts.
Data preview
500 rows · 58 columns · showing first 12| # | word_freq_make float | word_freq_address float | word_freq_all float | word_freq_3d float | word_freq_our float | word_freq_over float | word_freq_remove float | word_freq_internet float | word_freq_order float | word_freq_mail float | word_freq_receive float | word_freq_will float | word_freq_people float | word_freq_report float | word_freq_addresses float | word_freq_free float | word_freq_business float | word_freq_email float | word_freq_you float | word_freq_credit float | word_freq_your float | word_freq_font float | word_freq_000 float | word_freq_money float | word_freq_hp float | word_freq_hpl float | word_freq_george float | word_freq_650 float | word_freq_lab float | word_freq_labs float | word_freq_telnet float | word_freq_857 float | word_freq_data float | word_freq_415 float | word_freq_85 float | word_freq_technology float | word_freq_1999 float | word_freq_parts float | word_freq_pm float | word_freq_direct float | word_freq_cs float | word_freq_meeting float | word_freq_original float | word_freq_project float | word_freq_re float | word_freq_edu float | word_freq_table float | word_freq_conference float | char_freq_; float | char_freq_( float | char_freq_[ float | char_freq_! float | char_freq_$ float | char_freq_# float | capital_run_length_average float | capital_run_length_longest float | capital_run_length_total float | is_spam integer |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0 | 0.64 | 0.64 | 0 | 0.32 | 0 | 0 | 0 | 0 | 0 | 0 | 0.64 | 0 | 0 | 0 | 0.32 | 0 | 1.29 | 1.93 | 0 | 0.96 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.778 | 0 | 0 | 3.756 | 61 | 278 | 1 |
| 2 | 0.21 | 0.28 | 0.5 | 0 | 0.14 | 0.28 | 0.21 | 0.07 | 0 | 0.94 | 0.21 | 0.79 | 0.65 | 0.21 | 0.14 | 0.14 | 0.07 | 0.28 | 3.47 | 0 | 1.59 | 0 | 0.43 | 0.43 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.07 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.132 | 0 | 0.372 | 0.18 | 0.048 | 5.114 | 101 | 1028 | 1 |
| 3 | 0.06 | 0 | 0.71 | 0 | 1.23 | 0.19 | 0.19 | 0.12 | 0.64 | 0.25 | 0.38 | 0.45 | 0.12 | 0 | 1.75 | 0.06 | 0.06 | 1.03 | 1.36 | 0.32 | 0.51 | 0 | 1.16 | 0.06 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.06 | 0 | 0 | 0.12 | 0 | 0.06 | 0.06 | 0 | 0 | 0.01 | 0.143 | 0 | 0.276 | 0.184 | 0.01 | 9.821 | 485 | 2259 | 1 |
| 4 | 0 | 0 | 0 | 0 | 0.63 | 0 | 0.31 | 0.63 | 0.31 | 0.63 | 0.31 | 0.31 | 0.31 | 0 | 0 | 0.31 | 0 | 0 | 3.18 | 0 | 0.31 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.137 | 0 | 0.137 | 0 | 0 | 3.537 | 40 | 191 | 1 |
| 5 | 0 | 0 | 0 | 0 | 0.63 | 0 | 0.31 | 0.63 | 0.31 | 0.63 | 0.31 | 0.31 | 0.31 | 0 | 0 | 0.31 | 0 | 0 | 3.18 | 0 | 0.31 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.135 | 0 | 0.135 | 0 | 0 | 3.537 | 40 | 191 | 1 |
| 6 | 0 | 0 | 0 | 0 | 1.85 | 0 | 0 | 1.85 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.223 | 0 | 0 | 0 | 0 | 3 | 15 | 54 | 1 |
| 7 | 0 | 0 | 0 | 0 | 1.92 | 0 | 0 | 0 | 0 | 0.64 | 0.96 | 1.28 | 0 | 0 | 0 | 0.96 | 0 | 0.32 | 3.85 | 0 | 0.64 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.054 | 0 | 0.164 | 0.054 | 0 | 1.671 | 4 | 112 | 1 |
| 8 | 0 | 0 | 0 | 0 | 1.88 | 0 | 0 | 1.88 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.206 | 0 | 0 | 0 | 0 | 2.45 | 11 | 49 | 1 |
| 9 | 0.15 | 0 | 0.46 | 0 | 0.61 | 0 | 0.3 | 0 | 0.92 | 0.76 | 0.76 | 0.92 | 0 | 0 | 0 | 0 | 0 | 0.15 | 1.23 | 3.53 | 2 | 0 | 0 | 0.15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.3 | 0 | 0 | 0 | 0 | 0 | 0 | 0.271 | 0 | 0.181 | 0.203 | 0.022 | 9.744 | 445 | 1257 | 1 |
| 10 | 0.06 | 0.12 | 0.77 | 0 | 0.19 | 0.32 | 0.38 | 0 | 0.06 | 0 | 0 | 0.64 | 0.25 | 0 | 0.12 | 0 | 0 | 0.12 | 1.67 | 0.06 | 0.71 | 0 | 0.19 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.06 | 0 | 0 | 0 | 0 | 0.04 | 0.03 | 0 | 0.244 | 0.081 | 0 | 1.729 | 43 | 749 | 1 |
| 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0.96 | 0 | 0 | 1.92 | 0.96 | 0 | 0 | 0 | 0 | 0 | 0 | 0.96 | 3.84 | 0 | 0.96 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.96 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.462 | 0 | 0 | 1.312 | 6 | 21 | 1 |
| 12 | 0 | 0 | 0.25 | 0 | 0.38 | 0.25 | 0.25 | 0 | 0 | 0 | 0.12 | 0.12 | 0.12 | 0 | 0 | 0 | 0 | 0 | 1.16 | 0 | 0.77 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.022 | 0.044 | 0 | 0.663 | 0 | 0 | 1.243 | 11 | 184 | 1 |
Auto-generated charts
Spambase: 500 rows by 58 columns. These exploratory charts are generated automatically from the data - open the dataset in Helix to ask your own questions.
Rows500
Columns58
Numeric cols58
Charts
word_freq_make vs word_freq_address
Relationship between word_freq_make and word_freq_address.
Distribution of word_freq_make
Histogram of word_freq_make values.
Correlation of numeric columns
Pearson correlation between numeric columns.
Interesting queries to try
Columns
- word_freq_make numeric
- word_freq_address numeric
- word_freq_all numeric
- word_freq_3d numeric
- word_freq_our numeric
- word_freq_over numeric
- word_freq_remove numeric
- word_freq_internet numeric
- word_freq_order numeric
- word_freq_mail numeric
- word_freq_receive numeric
- word_freq_will numeric
- word_freq_people numeric
- word_freq_report numeric
- word_freq_addresses numeric
- word_freq_free numeric
- word_freq_business numeric
- word_freq_email numeric
- word_freq_you numeric
- word_freq_credit numeric
- word_freq_your numeric
- word_freq_font numeric
- word_freq_000 numeric
- word_freq_money numeric
- word_freq_hp numeric
- word_freq_hpl numeric
- word_freq_george numeric
- word_freq_650 numeric
- word_freq_lab numeric
- word_freq_labs numeric
- word_freq_telnet numeric
- word_freq_857 numeric
- word_freq_data numeric
- word_freq_415 numeric
- word_freq_85 numeric
- word_freq_technology numeric
- word_freq_1999 numeric
- word_freq_parts numeric
- word_freq_pm numeric
- word_freq_direct numeric
- word_freq_cs numeric
- word_freq_meeting numeric
- word_freq_original numeric
- word_freq_project numeric
- word_freq_re numeric
- word_freq_edu numeric
- word_freq_table numeric
- word_freq_conference numeric
- char_freq_; numeric
- char_freq_( numeric
- char_freq_[ numeric
- char_freq_! numeric
- char_freq_$ numeric
- char_freq_# numeric
- capital_run_length_average numeric
- capital_run_length_longest numeric
- capital_run_length_total numeric
- is_spam numeric