Skip to content

Instantly share code, notes, and snippets.

@amitabhadey
Created July 22, 2018 19:47
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save amitabhadey/91773ca0a9d7ab5c5e4c442780c3e879 to your computer and use it in GitHub Desktop.
Save amitabhadey/91773ca0a9d7ab5c5e4c442780c3e879 to your computer and use it in GitHub Desktop.
Steps followed for preprocessing the data for ML
# Data Preprocessing Template
# Importing the libraries
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
# Importing the dataset
dataset = pd.read_csv('Data.csv')
X = dataset.iloc[:, :-1].values
y = dataset.iloc[:, 3].values
# Splitting the dataset into the Training set and Test set
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2, random_state = 0)
# Feature Scaling
"""from sklearn.preprocessing import StandardScaler
sc_X = StandardScaler()
X_train = sc_X.fit_transform(X_train)
X_test = sc_X.transform(X_test)
sc_y = StandardScaler()
y_train = sc_y.fit_transform(y_train)"""
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment