{ "cells": [ { "cell_type": "markdown", "id": "b33a2141-9cee-4595-811a-cb25e36c56c8", "metadata": {}, "source": [ "# Text Feature Selection" ] }, { "cell_type": "markdown", "id": "1f871ae0-1a1b-4927-93a4-16556e69dcc6", "metadata": {}, "source": [ "As we discussed in [here](featureselection.ipynb), we must perform the feature selection on text features first because it is causing MemeoryError due to its massive file size (10GB). Since we cannot use the `fraudulent` column, we will use column means to select the features based on several assumptions. After we get a smaller version of `text_features_train`, we will combine it with `fraudulent` column and perform supervised feature selection using Chi-Squre Statistics." ] }, { "cell_type": "markdown", "id": "4f9f3725-c151-43ea-af1b-e4069d7d393a", "metadata": {}, "source": [ "### Feature Selection Using Column Mean" ] }, { "cell_type": "code", "execution_count": 1, "id": "d5643cb1-502b-484d-a009-1ef6acdb048b", "metadata": { "tags": [ "hide-output" ] }, "outputs": [], "source": [ "import pandas as pd \n", "import joblib\n", "text_features_train = joblib.load('./data/text_features_train_jlib')" ] }, { "cell_type": "code", "execution_count": 2, "id": "ef0d544b-1234-454d-940a-5fd177ea517a", "metadata": { "tags": [] }, "outputs": [ { "data": { "text/html": [ "
\n", " | aa_desc | \n", "aaa_desc | \n", "aaab_desc | \n", "aab_desc | \n", "aabc_desc | \n", "aabd_desc | \n", "aabf_desc | \n", "aac_desc | \n", "aaccd_desc | \n", "aachen_desc | \n", "... | \n", "zodat_benefits | \n", "zollman_benefits | \n", "zombi_benefits | \n", "zone_benefits | \n", "zoo_benefits | \n", "zowel_benefits | \n", "zu_benefits | \n", "zult_benefits | \n", "zutrifft_benefits | \n", "zweig_benefits | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.165596 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
1 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
2 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
3 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
4 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
5 rows × 89527 columns
\n", "\n", " | abil_desc | \n", "abl_desc | \n", "accept_desc | \n", "access_desc | \n", "accord_desc | \n", "account_desc | \n", "accur_desc | \n", "achiev_desc | \n", "acquisit_desc | \n", "across_desc | \n", "... | \n", "without_desc | \n", "word_desc | \n", "work_desc | \n", "world_desc | \n", "would_desc | \n", "write_desc | \n", "written_desc | \n", "year_desc | \n", "york_desc | \n", "young_desc | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.043316 | \n", "0.04441 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.060186 | \n", "0.0 | \n", "0.021943 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.038188 | \n", "0.0 | \n", "0.0 | \n", "
1 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "
2 | \n", "0.050161 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.025411 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.044223 | \n", "0.0 | \n", "0.0 | \n", "
3 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.052053 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "
4 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.036083 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.069462 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.092432 | \n", "0.0 | \n", "0.0 | \n", "
5 rows × 812 columns
\n", "\n", " | abil_req | \n", "abl_req | \n", "account_req | \n", "across_req | \n", "activ_req | \n", "adapt_req | \n", "addit_req | \n", "administr_req | \n", "adob_req | \n", "advanc_req | \n", "... | \n", "willing_req | \n", "window_req | \n", "within_req | \n", "without_req | \n", "word_req | \n", "work_req | \n", "would_req | \n", "write_req | \n", "written_req | \n", "year_req | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.081650 | \n", "0.105596 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.064795 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.068928 | \n", "
1 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.000000 | \n", "0.061549 | \n", "0.151344 | \n", "0.0 | \n", "0.063024 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.033522 | \n", "
2 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "
3 | \n", "0.047567 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.087269 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.075495 | \n", "0.0 | \n", "0.0 | \n", "0.054646 | \n", "0.040155 | \n", "
4 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.072623 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.038628 | \n", "
5 rows × 350 columns
\n", "\n", " | abroad_title | \n", "account_title | \n", "admin_title | \n", "administr_title | \n", "agent_title | \n", "analyst_title | \n", "android_title | \n", "applic_title | \n", "apprenticeship_title | \n", "architect_title | \n", "... | \n", "system_title | \n", "teacher_title | \n", "team_title | \n", "technic_title | \n", "technician_title | \n", "time_title | \n", "ui_title | \n", "ux_title | \n", "web_title | \n", "year_title | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
1 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
2 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.440388 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
3 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
4 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
5 rows × 86 columns
\n", "\n", " | advanc_benefits | \n", "also_benefits | \n", "appli_benefits | \n", "applic_benefits | \n", "avail_benefits | \n", "base_benefits | \n", "benefit_benefits | \n", "best_benefits | \n", "bonu_benefits | \n", "bonus_benefits | \n", "... | \n", "us_benefits | \n", "vacat_benefits | \n", "vision_benefits | \n", "want_benefits | \n", "week_benefits | \n", "well_benefits | \n", "within_benefits | \n", "work_benefits | \n", "world_benefits | \n", "year_benefits | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.074620 | \n", "0.000000 | \n", "0.0 | \n", "0.142779 | \n", "... | \n", "0.0 | \n", "0.204903 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.348124 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.112979 | \n", "
1 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "
2 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "
3 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.109283 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.0 | \n", "0.150045 | \n", "0.150367 | \n", "0.0 | \n", "0.183764 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.165463 | \n", "
4 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.042046 | \n", "0.131633 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.0 | \n", "0.057729 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.063661 | \n", "
5 rows × 143 columns
\n", "\n", " | abil_desc | \n", "abl_desc | \n", "accept_desc | \n", "access_desc | \n", "accord_desc | \n", "account_desc | \n", "accur_desc | \n", "achiev_desc | \n", "acquisit_desc | \n", "across_desc | \n", "... | \n", "us_benefits | \n", "vacat_benefits | \n", "vision_benefits | \n", "want_benefits | \n", "week_benefits | \n", "well_benefits | \n", "within_benefits | \n", "work_benefits | \n", "world_benefits | \n", "year_benefits | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.043316 | \n", "0.04441 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.204903 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.348124 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.112979 | \n", "
1 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "
2 | \n", "0.050161 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "
3 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.052053 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.150045 | \n", "0.150367 | \n", "0.0 | \n", "0.183764 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.165463 | \n", "
4 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.036083 | \n", "... | \n", "0.00000 | \n", "0.057729 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.063661 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
14299 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.230658 | \n", "0.0 | \n", "0.000000 | \n", "
14300 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.09475 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "
14301 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "
14302 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "
14303 | \n", "0.000000 | \n", "0.00000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.077250 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "... | \n", "0.00000 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "
14304 rows × 1391 columns
\n", "\n", " | administr_desc | \n", "answer_desc | \n", "asia_desc | \n", "assist_desc | \n", "bill_desc | \n", "call_desc | \n", "cash_desc | \n", "desir_desc | \n", "duti_desc | \n", "earn_desc | \n", "... | \n", "life_benefits | \n", "need_benefits | \n", "per_benefits | \n", "posit_benefits | \n", "prospect_benefits | \n", "see_benefits | \n", "share_benefits | \n", "skill_benefits | \n", "start_benefits | \n", "train_benefits | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.092456 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "... | \n", "0.103912 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
1 | \n", "0.045662 | \n", "0.0 | \n", "0.0 | \n", "0.034465 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
2 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
3 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.047975 | \n", "0.0 | \n", "0.000000 | \n", "0.085044 | \n", "0.0 | \n", "0.051481 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
4 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.053905 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
14299 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
14300 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
14301 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
14302 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.120147 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
14303 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "0.000000 | \n", "0.000000 | \n", "... | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
14304 rows × 100 columns
\n", "