Skip to content

Instantly share code, notes, and snippets.

@mepsrajput
Last active September 13, 2020 08:30
Show Gist options
  • Save mepsrajput/e542eeafc67715a3821ae0fdcb9c609b to your computer and use it in GitHub Desktop.
Save mepsrajput/e542eeafc67715a3821ae0fdcb9c609b to your computer and use it in GitHub Desktop.
Data Pre-processing Master

Data Preprocessing

In any ML process, Data Preprocessing is that step in which the data gets transformed, or Encoded, to bring it to such a state that now the machine can easily parse it.

Feature

A feature is an individual measurable property or characteristic of a phenomenon being observed. alt text

Types of Features

Categorical: Features whose values are taken from a defined set of values. For instance, days in a week : {Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday} is a category because its value is always taken from this set. Another example could be the Boolean set : {True, False} Numerical: Features whose values are continuous or integer-valued. They are represented by numbers and possess most of the properties of numbers. For instance, number of steps you walk in a day, or the speed at which you are driving your car at.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment