{ "cells": [ { "cell_type": "markdown", "id": "eb9109f0-7eeb-47b7-95ef-8a4905a079c5", "metadata": {}, "source": [ "# Summary \n", "\n", "This is a summary of how we will preprocess each column in the dataset. You can find the complete coding of this preprocessing [here](Pipeline.ipynb). " ] }, { "cell_type": "code", "execution_count": 4, "id": "34a1e503-7b9f-4827-9b4d-8137801babc1", "metadata": { "tags": [ "hide-output" ] }, "outputs": [], "source": [ "import pandas as pd " ] }, { "cell_type": "code", "execution_count": 5, "id": "a883baed-f347-41b0-8fe7-0c214a609647", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | Unnamed: 0.1 | \n", "Unnamed: 0 | \n", "job_id | \n", "title | \n", "location | \n", "department | \n", "salary_range | \n", "company_profile | \n", "description | \n", "requirements | \n", "benefits | \n", "telecommuting | \n", "has_company_logo | \n", "has_questions | \n", "employment_type | \n", "required_experience | \n", "required_education | \n", "industry | \n", "function | \n", "fraudulent | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0 | \n", "7530 | \n", "7531 | \n", "Contact Center Representatives | \n", "US, VA, Virginia Beach | \n", "NaN | \n", "NaN | \n", "Tidewater Finance Co. was established in 1992 ... | \n", "tidewat financ compani locat virginia beach va... | \n", "The position requires the following qualificat... | \n", "Our company offers a competitive salary plus B... | \n", "0 | \n", "1 | \n", "0 | \n", "Full-time | \n", "Entry level | \n", "Unspecified | \n", "Financial Services | \n", "Customer Service | \n", "0 | \n", "
1 | \n", "1 | \n", "129 | \n", "130 | \n", "Customer Service Associate | \n", "US, TX, Dallas | \n", "NaN | \n", "NaN | \n", "Novitex Enterprise Solutions, formerly Pitney ... | \n", "custom servic associ base dalla tx right candi... | \n", "QualificationsMinimum of 1 year customer servi... | \n", "NaN | \n", "0 | \n", "1 | \n", "0 | \n", "Full-time | \n", "Entry level | \n", "High School or equivalent | \n", "Telecommunications | \n", "Customer Service | \n", "0 | \n", "
2 | \n", "2 | \n", "4640 | \n", "4641 | \n", "Automated Test Analyst | \n", "NZ, , Auckland | \n", "Permanent | \n", "NaN | \n", "SilverStripe CMS & Framework is an open so... | \n", "look dedic passion softwar test analyst team p... | \n", "NaN | \n", "NaN | \n", "0 | \n", "1 | \n", "1 | \n", "Full-time | \n", "Mid-Senior level | \n", "NaN | \n", "Information Technology and Services | \n", "NaN | \n", "0 | \n", "
3 | \n", "3 | \n", "402 | \n", "403 | \n", "Inside Sales Professional-Omaha | \n", "US, NE, Omaha | \n", "NaN | \n", "NaN | \n", "ABC Supply Co., Inc. is the nation’s largest w... | \n", "sale repres provid assist custom purchas mater... | \n", "As a Sales Representative, you must have the a... | \n", "Your benefits package as a Sales Representativ... | \n", "0 | \n", "1 | \n", "0 | \n", "Full-time | \n", "NaN | \n", "NaN | \n", "Building Materials | \n", "Sales | \n", "0 | \n", "
4 | \n", "4 | \n", "13218 | \n", "13219 | \n", "Content Marketing/SEO Manager | \n", "US, CA, Los Angeles | \n", "Marketing | \n", "NaN | \n", "MeUndies is a lifestyle brand that is transfor... | \n", "meundi lifestyl brand transform way peopl perc... | \n", "REQUIREMENTS/QUALIFICATIONS/PERSONAL ATTRIBUTE... | \n", "WHY MEUNDIES?We're a fast-growing, VC-backed c... | \n", "0 | \n", "1 | \n", "0 | \n", "Full-time | \n", "Mid-Senior level | \n", "Bachelor's Degree | \n", "Internet | \n", "Marketing | \n", "0 | \n", "