Skip to content

Instantly share code, notes, and snippets.

@tkhan0
Last active October 21, 2019 23:58
Show Gist options
  • Save tkhan0/8e4ee93feed1228d0101b23a67e8ee1c to your computer and use it in GitHub Desktop.
Save tkhan0/8e4ee93feed1228d0101b23a67e8ee1c to your computer and use it in GitHub Desktop.
Multiple Linear Regression
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Display the source blob
Display the rendered blob
Raw
{
"nbformat": 4,
"nbformat_minor": 0,
"metadata": {
"_draft": {
"nbviewer_url": "https://gist.github.com/f5aa427cb52d033cf1204d9687d087bd"
},
"gist": {
"data": {
"description": "GRE School.ipynb",
"public": true
},
"id": "f5aa427cb52d033cf1204d9687d087bd"
},
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.0"
},
"colab": {
"name": "Multiple Linear Regression",
"provenance": [],
"collapsed_sections": [],
"include_colab_link": true
}
},
"cells": [
{
"cell_type": "markdown",
"metadata": {
"id": "view-in-github",
"colab_type": "text"
},
"source": [
"<a href=\"https://colab.research.google.com/gist/tkhan0/8e4ee93feed1228d0101b23a67e8ee1c/notebook.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "hEUktiS4vmVh",
"colab_type": "text"
},
"source": [
"### 1. Import the data to a DataFrame using Pandas"
]
},
{
"cell_type": "code",
"metadata": {
"id": "gAjjeQNWvmVl",
"colab_type": "code",
"colab": {}
},
"source": [
"import pandas as pd"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "vHCnZiDovmVr",
"colab_type": "code",
"colab": {}
},
"source": [
"df = pd.read_csv('/Admission_Predict_Ver1.2.csv',encoding = 'utf-8')"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "IeO3_B9EvmVu",
"colab_type": "code",
"colab": {},
"outputId": "019769de-01fe-4ffc-87c1-d2cbe15370ee"
},
"source": [
"df.head()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>Serial No.</th>\n",
" <th>GRE Score</th>\n",
" <th>TOEFL Score</th>\n",
" <th>University Rating</th>\n",
" <th>SOP</th>\n",
" <th>LOR</th>\n",
" <th>CGPA</th>\n",
" <th>Research</th>\n",
" <th>Admit</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>1</td>\n",
" <td>337</td>\n",
" <td>118</td>\n",
" <td>4</td>\n",
" <td>4.5</td>\n",
" <td>4.5</td>\n",
" <td>9.65</td>\n",
" <td>1</td>\n",
" <td>0.92</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>2</td>\n",
" <td>324</td>\n",
" <td>107</td>\n",
" <td>4</td>\n",
" <td>4.0</td>\n",
" <td>4.5</td>\n",
" <td>8.87</td>\n",
" <td>1</td>\n",
" <td>0.76</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>3</td>\n",
" <td>316</td>\n",
" <td>104</td>\n",
" <td>3</td>\n",
" <td>3.0</td>\n",
" <td>3.5</td>\n",
" <td>8.00</td>\n",
" <td>1</td>\n",
" <td>0.72</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>4</td>\n",
" <td>322</td>\n",
" <td>110</td>\n",
" <td>3</td>\n",
" <td>3.5</td>\n",
" <td>2.5</td>\n",
" <td>8.67</td>\n",
" <td>1</td>\n",
" <td>0.80</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>5</td>\n",
" <td>314</td>\n",
" <td>103</td>\n",
" <td>2</td>\n",
" <td>2.0</td>\n",
" <td>3.0</td>\n",
" <td>8.21</td>\n",
" <td>0</td>\n",
" <td>0.65</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" Serial No. GRE Score TOEFL Score University Rating SOP LOR CGPA \\\n",
"0 1 337 118 4 4.5 4.5 9.65 \n",
"1 2 324 107 4 4.0 4.5 8.87 \n",
"2 3 316 104 3 3.0 3.5 8.00 \n",
"3 4 322 110 3 3.5 2.5 8.67 \n",
"4 5 314 103 2 2.0 3.0 8.21 \n",
"\n",
" Research Admit \n",
"0 1 0.92 \n",
"1 1 0.76 \n",
"2 1 0.72 \n",
"3 1 0.80 \n",
"4 0 0.65 "
]
},
"metadata": {
"tags": []
},
"execution_count": 5
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "eVHQuxSKvmVy",
"colab_type": "code",
"colab": {},
"outputId": "cde6c237-ba6a-472b-c82c-ddaae4681c26"
},
"source": [
"df.info()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "stream",
"text": [
"<class 'pandas.core.frame.DataFrame'>\n",
"RangeIndex: 500 entries, 0 to 499\n",
"Data columns (total 9 columns):\n",
"Serial No. 500 non-null int64\n",
"GRE Score 500 non-null int64\n",
"TOEFL Score 500 non-null int64\n",
"University Rating 500 non-null int64\n",
"SOP 500 non-null float64\n",
"LOR 500 non-null float64\n",
"CGPA 500 non-null float64\n",
"Research 500 non-null int64\n",
"Admit 500 non-null float64\n",
"dtypes: float64(4), int64(5)\n",
"memory usage: 35.2 KB\n"
],
"name": "stdout"
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "QgMo-0afvmV2",
"colab_type": "text"
},
"source": [
"### 2. It is a good practice to shuffle the data to remove any kind of order effects in data."
]
},
{
"cell_type": "code",
"metadata": {
"id": "pAh8I46BvmV3",
"colab_type": "code",
"colab": {}
},
"source": [
"from sklearn.utils import shuffle"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "Oj54Q6d-vmV8",
"colab_type": "code",
"colab": {}
},
"source": [
"df_shuffled = shuffle(df,random_state = 42)"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "txYrKmfxvmV_",
"colab_type": "code",
"colab": {},
"outputId": "23a631bb-29b7-446e-c3e0-25e0d033bab6"
},
"source": [
"df_shuffled.head()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>Serial No.</th>\n",
" <th>GRE Score</th>\n",
" <th>TOEFL Score</th>\n",
" <th>University Rating</th>\n",
" <th>SOP</th>\n",
" <th>LOR</th>\n",
" <th>CGPA</th>\n",
" <th>Research</th>\n",
" <th>Admit</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>361</th>\n",
" <td>362</td>\n",
" <td>334</td>\n",
" <td>116</td>\n",
" <td>4</td>\n",
" <td>4.0</td>\n",
" <td>3.5</td>\n",
" <td>9.54</td>\n",
" <td>1</td>\n",
" <td>0.93</td>\n",
" </tr>\n",
" <tr>\n",
" <th>73</th>\n",
" <td>74</td>\n",
" <td>314</td>\n",
" <td>108</td>\n",
" <td>4</td>\n",
" <td>4.5</td>\n",
" <td>4.0</td>\n",
" <td>9.04</td>\n",
" <td>1</td>\n",
" <td>0.84</td>\n",
" </tr>\n",
" <tr>\n",
" <th>374</th>\n",
" <td>375</td>\n",
" <td>315</td>\n",
" <td>105</td>\n",
" <td>2</td>\n",
" <td>2.0</td>\n",
" <td>2.5</td>\n",
" <td>7.65</td>\n",
" <td>0</td>\n",
" <td>0.39</td>\n",
" </tr>\n",
" <tr>\n",
" <th>155</th>\n",
" <td>156</td>\n",
" <td>312</td>\n",
" <td>109</td>\n",
" <td>3</td>\n",
" <td>3.0</td>\n",
" <td>3.0</td>\n",
" <td>8.69</td>\n",
" <td>0</td>\n",
" <td>0.77</td>\n",
" </tr>\n",
" <tr>\n",
" <th>104</th>\n",
" <td>105</td>\n",
" <td>326</td>\n",
" <td>112</td>\n",
" <td>3</td>\n",
" <td>3.5</td>\n",
" <td>3.0</td>\n",
" <td>9.05</td>\n",
" <td>1</td>\n",
" <td>0.74</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" Serial No. GRE Score TOEFL Score University Rating SOP LOR CGPA \\\n",
"361 362 334 116 4 4.0 3.5 9.54 \n",
"73 74 314 108 4 4.5 4.0 9.04 \n",
"374 375 315 105 2 2.0 2.5 7.65 \n",
"155 156 312 109 3 3.0 3.0 8.69 \n",
"104 105 326 112 3 3.5 3.0 9.05 \n",
"\n",
" Research Admit \n",
"361 1 0.93 \n",
"73 1 0.84 \n",
"374 0 0.39 \n",
"155 0 0.77 \n",
"104 1 0.74 "
]
},
"metadata": {
"tags": []
},
"execution_count": 9
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "5tde403PvmWC",
"colab_type": "code",
"colab": {}
},
"source": [
"DV = 'Admit '"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "markdown",
"metadata": {
"id": "bQ_sTkwwvmWH",
"colab_type": "text"
},
"source": [
"### 3. Splitting the DataFrame df_shuffled into feature variable(X) and dependent variable(y)"
]
},
{
"cell_type": "code",
"metadata": {
"id": "PeVf53xvvmWI",
"colab_type": "code",
"colab": {}
},
"source": [
"X = df_shuffled.drop(['Admit ','Serial No.'], axis=1)"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "-5SrRxYuvmWM",
"colab_type": "code",
"colab": {},
"outputId": "4a627e24-719b-4adc-e2b3-61fb9c40108b"
},
"source": [
"X.head()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>GRE Score</th>\n",
" <th>TOEFL Score</th>\n",
" <th>University Rating</th>\n",
" <th>SOP</th>\n",
" <th>LOR</th>\n",
" <th>CGPA</th>\n",
" <th>Research</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>361</th>\n",
" <td>334</td>\n",
" <td>116</td>\n",
" <td>4</td>\n",
" <td>4.0</td>\n",
" <td>3.5</td>\n",
" <td>9.54</td>\n",
" <td>1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>73</th>\n",
" <td>314</td>\n",
" <td>108</td>\n",
" <td>4</td>\n",
" <td>4.5</td>\n",
" <td>4.0</td>\n",
" <td>9.04</td>\n",
" <td>1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>374</th>\n",
" <td>315</td>\n",
" <td>105</td>\n",
" <td>2</td>\n",
" <td>2.0</td>\n",
" <td>2.5</td>\n",
" <td>7.65</td>\n",
" <td>0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>155</th>\n",
" <td>312</td>\n",
" <td>109</td>\n",
" <td>3</td>\n",
" <td>3.0</td>\n",
" <td>3.0</td>\n",
" <td>8.69</td>\n",
" <td>0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>104</th>\n",
" <td>326</td>\n",
" <td>112</td>\n",
" <td>3</td>\n",
" <td>3.5</td>\n",
" <td>3.0</td>\n",
" <td>9.05</td>\n",
" <td>1</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" GRE Score TOEFL Score University Rating SOP LOR CGPA Research\n",
"361 334 116 4 4.0 3.5 9.54 1\n",
"73 314 108 4 4.5 4.0 9.04 1\n",
"374 315 105 2 2.0 2.5 7.65 0\n",
"155 312 109 3 3.0 3.0 8.69 0\n",
"104 326 112 3 3.5 3.0 9.05 1"
]
},
"metadata": {
"tags": []
},
"execution_count": 12
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "TfTMlE94vmWP",
"colab_type": "code",
"colab": {}
},
"source": [
"y = df_shuffled[DV]"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "Ng5YUz-ivmWS",
"colab_type": "code",
"colab": {},
"outputId": "05aa3ed6-4028-4827-e14c-76897d37055c"
},
"source": [
"y.head()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"361 0.93\n",
"73 0.84\n",
"374 0.39\n",
"155 0.77\n",
"104 0.74\n",
"Name: Admit , dtype: float64"
]
},
"metadata": {
"tags": []
},
"execution_count": 14
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "r7BMeZEgvmWW",
"colab_type": "text"
},
"source": [
"### 4. Split X and y into training and testing sets"
]
},
{
"cell_type": "code",
"metadata": {
"id": "X5E-NfiGvmWY",
"colab_type": "code",
"colab": {}
},
"source": [
"from sklearn.model_selection import train_test_split"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "FbFyMhp6vmWb",
"colab_type": "code",
"colab": {}
},
"source": [
"X_train, X_test, y_train, y_test = train_test_split(X,y,test_size = 0.33, random_state = 42)"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "VLr_dlWBvmWe",
"colab_type": "code",
"colab": {},
"outputId": "1c65a109-7d9b-451f-9179-64f7bf200182"
},
"source": [
"X_train.head()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>GRE Score</th>\n",
" <th>TOEFL Score</th>\n",
" <th>University Rating</th>\n",
" <th>SOP</th>\n",
" <th>LOR</th>\n",
" <th>CGPA</th>\n",
" <th>Research</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>443</th>\n",
" <td>321</td>\n",
" <td>114</td>\n",
" <td>5</td>\n",
" <td>4.5</td>\n",
" <td>4.5</td>\n",
" <td>9.16</td>\n",
" <td>1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>497</th>\n",
" <td>330</td>\n",
" <td>120</td>\n",
" <td>5</td>\n",
" <td>4.5</td>\n",
" <td>5.0</td>\n",
" <td>9.56</td>\n",
" <td>1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>124</th>\n",
" <td>301</td>\n",
" <td>106</td>\n",
" <td>4</td>\n",
" <td>2.5</td>\n",
" <td>3.0</td>\n",
" <td>8.47</td>\n",
" <td>0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>50</th>\n",
" <td>313</td>\n",
" <td>98</td>\n",
" <td>3</td>\n",
" <td>2.5</td>\n",
" <td>4.5</td>\n",
" <td>8.30</td>\n",
" <td>1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>331</th>\n",
" <td>311</td>\n",
" <td>105</td>\n",
" <td>2</td>\n",
" <td>3.0</td>\n",
" <td>2.0</td>\n",
" <td>8.12</td>\n",
" <td>1</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" GRE Score TOEFL Score University Rating SOP LOR CGPA Research\n",
"443 321 114 5 4.5 4.5 9.16 1\n",
"497 330 120 5 4.5 5.0 9.56 1\n",
"124 301 106 4 2.5 3.0 8.47 0\n",
"50 313 98 3 2.5 4.5 8.30 1\n",
"331 311 105 2 3.0 2.0 8.12 1"
]
},
"metadata": {
"tags": []
},
"execution_count": 17
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "WFnm3NkJvmWi",
"colab_type": "code",
"colab": {},
"outputId": "2abcc47c-d361-4dc0-ce21-26d0720c44c3"
},
"source": [
"X_train.shape"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"(335, 7)"
]
},
"metadata": {
"tags": []
},
"execution_count": 18
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "3wVHmZrVvmYC",
"colab_type": "text"
},
"source": [
"### 5. Instantiating the Multiple Linear Regression model and fitting the model"
]
},
{
"cell_type": "code",
"metadata": {
"id": "YtdpglWavmYD",
"colab_type": "code",
"colab": {}
},
"source": [
"from sklearn.linear_model import LinearRegression"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "mSTkqjl0vmYF",
"colab_type": "code",
"colab": {}
},
"source": [
"model = LinearRegression()"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "-Y35KHX-vmYI",
"colab_type": "code",
"colab": {},
"outputId": "03536f2e-63db-4627-eec7-cec21b0edd48"
},
"source": [
"model.fit(X_train,y_train)"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"LinearRegression(copy_X=True, fit_intercept=True, n_jobs=1, normalize=False)"
]
},
"metadata": {
"tags": []
},
"execution_count": 41
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "ivVjryWMvmYP",
"colab_type": "text"
},
"source": [
"### 6. Calculate the Model Intercept and Coefficient --Regression Coefficient"
]
},
{
"cell_type": "code",
"metadata": {
"id": "XW8BHdTzvmYR",
"colab_type": "code",
"colab": {}
},
"source": [
"intercept = model.intercept_"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "AD8Fgi9LvmYV",
"colab_type": "code",
"colab": {},
"outputId": "a5c551ce-d255-4e5b-a69f-d47a3568bc2c"
},
"source": [
"intercept"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"-1.4242541443027852"
]
},
"metadata": {
"tags": []
},
"execution_count": 43
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "78T_tGgFvmYX",
"colab_type": "code",
"colab": {}
},
"source": [
"coefficients = model.coef_"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "YpyXPIhlvmYa",
"colab_type": "code",
"colab": {},
"outputId": "70bd1908-9c0e-4fee-de25-057e575706b4"
},
"source": [
"coefficients"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"array([0.00217167, 0.00294465, 0.00431416, 0.00161238, 0.01659515,\n",
" 0.12281766, 0.02050198])"
]
},
"metadata": {
"tags": []
},
"execution_count": 45
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "YcY8RQNDvmYh",
"colab_type": "text"
},
"source": [
"### 7. Printing the equation using the coefficients we got above"
]
},
{
"cell_type": "code",
"metadata": {
"id": "c5yXmO6lvmYi",
"colab_type": "code",
"colab": {},
"outputId": "4c9833e9-bb9e-4280-9a25-3508cb72f466"
},
"source": [
"print('Admit_Predict = {0:0.4f} + ({1:0.4f} x GRE Score) + ({2:0.4f} x TOEFL Score) + ({3:0.4f} x University Rating) + ({4:0.4f} x SOP) +({5:0.4f} x LOR) + ({6:0.4f} x CGPA) +({7:0.4f} x Research)'.format(intercept, \n",
" coefficients[0], \n",
" coefficients[1], \n",
" coefficients[2], \n",
" coefficients[3], \n",
" coefficients[4], \n",
" coefficients[5], \n",
" coefficients[6]))"
],
"execution_count": 0,
"outputs": [
{
"output_type": "stream",
"text": [
"Admit_Predict = -1.4243 + (0.0022 x GRE Score) + (0.0029 x TOEFL Score) + (0.0043 x University Rating) + (0.0016 x SOP) +(0.0166 x LOR) + (0.1228 x CGPA) +(0.0205 x Research)\n"
],
"name": "stdout"
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "hmBanVw5vmYk",
"colab_type": "text"
},
"source": [
"### 8. Implementing the above equation and predicting the Admit Scores"
]
},
{
"cell_type": "code",
"metadata": {
"id": "lOHmHOzsvmYl",
"colab_type": "code",
"colab": {}
},
"source": [
"Admit_Predict = -1.4243 + (0.0022 * X_train['GRE Score']) + (0.0029 * X_train['TOEFL Score']) + (0.0043 * X_train['University Rating']) + (0.0016 * X_train['SOP']) +(0.0166 * X_train['LOR ']) + (0.1228 * X_train['CGPA']) +(0.0205 * X_train['Research'])"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "CaBu5Z7UvmYn",
"colab_type": "code",
"colab": {},
"outputId": "f255f8f1-fae1-49ae-e54f-952f17d18e75"
},
"source": [
"Admit_Predict.head()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"443 0.861248\n",
"497 0.955868\n",
"124 0.656416\n",
"50 0.679840\n",
"331 0.628636\n",
"dtype: float64"
]
},
"metadata": {
"tags": []
},
"execution_count": 118
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "a4blZ0glvmYq",
"colab_type": "code",
"colab": {}
},
"source": [
"predictions = model.predict(X_test)"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "m51zeDZivmYv",
"colab_type": "code",
"colab": {},
"outputId": "c0765445-2d37-44f9-b792-fc5c4eea1c7a"
},
"source": [
"predictions.shape"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"(165,)"
]
},
"metadata": {
"tags": []
},
"execution_count": 120
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "dBSkxNsHvmYx",
"colab_type": "code",
"colab": {},
"outputId": "039ec1b2-a507-4d78-ed2a-01e6d6e2989f"
},
"source": [
"predictions"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"array([0.54539918, 0.55363752, 0.78000432, 0.597551 , 0.64566267,\n",
" 0.68833295, 0.82664645, 0.68428919, 0.65744214, 0.81637454,\n",
" 0.80592038, 0.69639124, 0.73953556, 0.72629543, 0.91058275,\n",
" 0.65096547, 0.86424101, 0.55253423, 0.56833614, 0.69443069,\n",
" 0.93981603, 0.82809052, 0.7057749 , 0.68265075, 0.84866986,\n",
" 0.41677175, 0.45426869, 0.78282754, 0.74439537, 0.62941571,\n",
" 0.60821993, 0.71991221, 0.65369032, 0.47574244, 0.56190713,\n",
" 0.41525819, 0.70733081, 0.61018855, 0.5510159 , 0.59136738,\n",
" 0.58473005, 0.64840503, 0.66023105, 0.8882523 , 0.86243497,\n",
" 0.86309401, 0.91663565, 0.72721531, 0.69744991, 0.71971577,\n",
" 0.68922378, 0.56502979, 0.71907273, 0.48716266, 0.46391183,\n",
" 0.58724119, 0.63811268, 0.52989804, 0.69304485, 0.52932209,\n",
" 0.96703953, 0.50234456, 0.760185 , 0.80683706, 0.6320383 ,\n",
" 0.70421576, 0.65733996, 0.95995125, 0.90553398, 0.81891241,\n",
" 0.63092162, 0.51734089, 0.81072184, 0.64351458, 0.93423053,\n",
" 0.68180096, 0.59762578, 0.70679619, 0.81197215, 0.70090992,\n",
" 0.59868914, 0.61671976, 0.75784717, 0.69352287, 0.91258705,\n",
" 0.73889149, 0.62336392, 0.84694287, 0.78719785, 0.95083272,\n",
" 0.58962115, 0.83550833, 0.90836346, 0.78785687, 0.53057778,\n",
" 0.81483149, 0.6040547 , 0.64881483, 0.635637 , 0.50640703,\n",
" 0.63748851, 0.63647359, 0.56320363, 0.88665176, 0.85762558,\n",
" 0.9594266 , 0.59228023, 0.6218097 , 0.85438238, 0.84801528,\n",
" 0.96605969, 0.75338374, 0.89920837, 0.59894699, 0.85988345,\n",
" 0.68324326, 0.50029916, 0.60548738, 0.88691487, 0.66408934,\n",
" 0.83845589, 0.73074958, 0.78803455, 0.71693481, 0.7619414 ,\n",
" 0.6623658 , 0.7788172 , 0.79581755, 0.56030919, 0.61204924,\n",
" 0.73125575, 0.97052884, 0.90841142, 1.00860231, 0.79695337,\n",
" 0.7402222 , 0.60401769, 0.77472074, 0.59145471, 0.52849543,\n",
" 0.86786918, 0.72693087, 0.66230199, 0.7716777 , 0.68760897,\n",
" 0.9152402 , 0.78942901, 0.65033575, 0.64887332, 0.50990954,\n",
" 0.86208432, 0.66961578, 0.57715668, 0.83828783, 0.80576419,\n",
" 0.89029024, 0.70912519, 0.591096 , 0.65530893, 0.62941468,\n",
" 0.63998591, 0.61842808, 0.7481594 , 0.74215304, 0.61427862])"
]
},
"metadata": {
"tags": []
},
"execution_count": 121
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "VWNf65nwvmY1",
"colab_type": "text"
},
"source": [
"### 9. Plotting the predicted versus actual values on a scatterplot using the following code:"
]
},
{
"cell_type": "code",
"metadata": {
"id": "kN9QkCPUvmY1",
"colab_type": "code",
"colab": {}
},
"source": [
"import matplotlib.pyplot as plt"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "nIpQVolwvmY4",
"colab_type": "code",
"colab": {}
},
"source": [
"from scipy.stats import pearsonr"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "68AsL_LevmY7",
"colab_type": "code",
"colab": {},
"outputId": "669ebc2b-52d7-4097-fefe-741b35990777"
},
"source": [
"plt.scatter(y_test,predictions)\n",
"plt.xlabel('Y test(True values)')\n",
"plt.ylabel('Predicted Values')\n",
"plt.title('Predicted vs Actual value(r = {0:0.2f})'.format(pearsonr(y_test,predictions)[0]))\n",
"plt.show()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "display_data",
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAYUAAAEWCAYAAACJ0YulAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAIABJREFUeJzt3XucXHV9//HXO8sCCwKBJraw5AIaoVwbWLn80ipouZcQxQvxgvFGbaX2hxobLIWA+CAVqdVKq4jUC0gUYmMQNLYGpKIgG0PAcOkvcksWrMGwKBDJZvP5/XHOTM7OnjPzndk5O2d2Ps/HYx+ZOXPmzPfMbs73fD/f7/fzlZnhnHPOAUxqdQGcc84Vh1cKzjnnyrxScM45V+aVgnPOuTKvFJxzzpV5peCcc67MKwUHgKSZkkzSTvHz70l61zh87mJJ1+f9OUUg6QRJG3M47gJJP27i8W6UNK9Zx2s1SUdI+kmry9EuvFJoI5Iel7RF0vOS/lfSv0t6WR6fZWanmdlXA8v053mUodkk3SHpWUm7BO4/oqLsBJKOAI4EvjMOnyVJ/yjpN/HPpySpyr5/L+lJSb+VtFTSnonXd5F0XfzaryR9uPSamd0PDEo6M+9zmgi8Umg/Z5rZy4CjgFcDF1XuEP8H8t9tgqSZwJ8BBsxtaWGK7S+BGyxjVmuTK8jzgHlEldARwF/En5/mXOCdwBxgP6AH+JfE64uBWcAM4ETgY5JOTbx+Q5VjuwS/cLQpMxsAvgccBuW74E9Kugt4EThQ0l6SvizpaUkDki6X1BXv3yXp05KekfQocEby+PHx3pd4/n5JD0n6naQHJR0l6evAdOCWuPXysXjf4yT9RNKgpLWSTkgc5wBJP4qP85/AlKxzjD/vLxLPd4rLe5SkXSVdH99hDkq6V9IfVvnKzgXuBr4CjAiLSeqRdJWkJyQ9J+nHknqAO+NdBuPzO74y3JUSdnt34nt6VFLQhUjSFyR9umLbd0p3vJIWSfpl4vt/Q8ZxRrVuUn6X74nL+KyklZJmJA5xGvCjxL4LJN0l6TOSNhNdfJvlXcBVZrYx/nu+CliQse+ZwJfNbIOZPQ/8I/BWSbvFr58LfMLMnjWzh4AvVRzrDuD1oa3ETuaVQpuSNA04HViT2PxOoruvPYAngK8C24BXArOBk4HSxeH9RHdms4E+4E1VPuvNRBeDc4E9ie60f2Nm7wSeJG69mNmnJPUCtwKXA/sAHwWWSZoaH+4bwGqiyuATVFygK9wIzE88PwV4xsx+Hr9vL2Aa8AfAB4AtVY51LtHd4g3AKRUVyKeBo4H/E5f5Y8B24DXx65Pj8/tpleOX/Jroe90TeDfwGUlHBbzvG0QXOQFI2pvo97U0fv2XRC2dvYBLgesl7Rtw3BEU9RV8HHgjMBX4b6LvGUm7AwcAj1S87VjgUeDlwCdTjvm2uGLO+pmeUZxDgbWJ52vjbalFj3+Sz3cBZsXf1X7VjhVXOkPAQRnHdyVm5j9t8gM8DjwPDBJd9P8V6IlfuwO4LLHvHwIvlV6Pt80Hbo8frwI+kHjtZKLQyk6J470vfrwS+NsqZfrzxPO/A75esc9Koov4dKJKavfEa98Ars849iuB3wG7xc9vAC6OH78H+AlwRMD39qdEF4Qp8fOHgQvix5OIKpMjU943M/mdxNsWJ8ubtk/FMZaXvjvgBGBjxn4iqmBfEz9/P7CqyjndB5wVP14A/LhKmZO/y+8B7028NomoZTkD6I3fu2vi9QXAkzn9PQ8DByeez4o/Xyn7vg/4n/j89gJWxPseT3RjUFnuk4DHK44xUPp+/Sf7x1sK7WeemU02sxlm9tdmlrw73pB4PAPoBp4u3bEBXyS624Poziq5/xNVPnMa0Z1qiBnAm5N3ikQX5X3jz3zWzF4I+VwzWw88BJwZhwnmElUiAF8nqmyWSnpKUSdld8ah3gX8wMyeiZ9/gx0tlCnArnWcX1WSTpN0t6TN8bmfTpUQWYlFV62l7GgZvY2oEiwd91xJ9yW+08NCjptiBvDZxHE2E1VIvUQ3GxC1NJM2kI/niVpUJXsCz8ffRaXriFo0dwDrgNvj7Rvj45TenzzW7yqOsQc7ztFl8EphYkn+Z9pA1FKYElcik81sTzMrNamfJrrYl2Q18UvHekXAZ5b2/XriMyeb2e5mtiT+zL3jMEXI58KOENJZwINxRYGZDZnZpWZ2CFHY5y+IQkQjxH0DbwFeq2hUyq+AC4AjJR0JPAP8PuP80i5OLwC7JZ7/UeKzdgGWEYWj/tDMJgO3MTLsUetc3xTH+I+Nj0X8/EvA+cAfxMf9RcZxSxVuahmJfj9/WfH76TGzn8SV9S+BV1Ucs2oqZUlvj/tcsn6yfsfriDqZS46Mt41iZtvN7BIzm2lm+8f7DQADZvYs0d9W5rEk7QfszOjQmKvglcIEZWZPAz8ArpK0p6RJkl4h6bXxLt8CPiRp/zgmu6jK4a4FPirpaEVemeic/F/gwMS+1xPd2Z+iqDN7V0Xj8/c3syeAfuBSSTtL+lOiDsRqlhKFtv6KHa0EJJ0o6XBFHee/JQoPDae8f168/RDgT+KfPyaKpZ9rZtuJ7kL/SdJ+cZmPjy/wm4j6FpLndx/wGknTJe0FXJh4bWeiOPcmYJuk0+KyBzGzNfF7rwVWmlnprnZ3ogvzpvjc3008wCDlGJuILpbviM/lPYys8L4AXCjp0PhYe8V9RiW3Aa+lDmZ2g0V9Llk/T2a89WvAhyX1xhftjxANBBhF0j7x368kHQL8E1G4dHviWBdJ2lvSwUTht+SxTiAKx71Uz7l1pFbHr/wn/IeK+H3Fa3cQx40T2/YC/o2oif0cUaf0OfFrOwGfAX4DPAZ8kIw+hfj5B4jusp4nukudHW8/iygWPgh8NN52LNEIls1EF7JbgenxawcSXZCfB/4T+DwZfQqJz/4hUV/EHyW2zY/L8wJRxfQ5UuL6wPeJRrhUbn8L8Kv4e+gB/pnoYvoc0aijUl/NZfE5DALHxduujp+vJ7r4JL+3D8blGSQKcS0FLo9fO4GMPoVEuf4hPt6bK7Z/Mv4+nyG6IP6IHf0EC4j7FOLnp8W/00GiET3lfePX3wk8QFSZbgCuS7x2GNEdttKO3eS/ZwGfis9rc/xYidefB/4sfvyq+Pf9IlHI8cMVx9qFqHL/bfz9V75+KzC31f+H2+Gn9It3zjkAJH0D+JaZLW91WZpB0uHANWZ2fKvL0g68UnDOOVfmfQrOOefKvFJwzjlX5pWCc865srbL/jhlyhSbOXNmq4vhnHNtZfXq1c+Y2dRa+7VdpTBz5kz6+/tbXQznnGsrkqplLSjz8JFzzrkyrxScc86VeaXgnHOuzCsF55xzZV4pOOecK/NKwTnnXJlXCs4558q8UnDOOVfmlYJzzrkyrxScc86VtV2aC+ecm8iWrxngypWP8NTgFvab3MPCUw5i3uzecft8rxScc64glq8Z4MJvP8CWoWi58YHBLVz47QcAxq1iyC18JOk6Sb+W9IuM1yXpc5LWS7pf0lF5lcU559rBlSsfKVcIJVuGhrly5SPjVoY8+xS+Apxa5fXTgFnxz3lEC8w751zHempwS13b85BbpWBmdwKbq+xyFvA1i9wNTJa0b17lcc65ottvck9d2/PQytFHvcCGxPON8bZRJJ0nqV9S/6ZNm8alcM45N94WnnIQPd1dI7b1dHex8JSDxq0MrawUlLLN0nY0s2vMrM/M+qZOrblwkHPOtaV5s3u54o2H0zu5BwG9k3u44o2Hd8zoo43AtMTz/YGnWlQW55wrhHmze8e1EqjUypbCCuDceBTSccBzZvZ0C8vjnHMdL7eWgqQbgROAKZI2ApcA3QBm9gXgNuB0YD3wIvDuvMrinHMuTG6VgpnNr/G6AR/M6/Odc87Vz3MfOeecK/M0F845l4NW5zBqlFcKzjnXZKE5jNIqDsAT4jnn3ERSLYdR6QKfVnEsvGktCIaGrbxtvBPieaXgnHN1CAkLheQwSqs4hraPnr9bWZnkzTuanXMuUOnufmBwC8aOO/nlawZG7BeSw2igjiR3EyIhnnPOTTShqa1Dchh1KS3TT7pOSYjnnHNtJTS19bzZvZx9dG/5wt8lcfbRI9NXDFtqqrdROikhnnPOtZXQ1NbL1wywbPVA+cI/bMay1QMjwky9Gcea3NPdsQnxnHOurSw85aARI4Yg/U4+ZPRR1rEWzz20pfMZvFJwzrlApYt1M0YfhR5rvHml4JxzdQhJbb3f5J7U0UWVYaZWp8lO430KzjmXYfmaAeYsWcUBi25lzpJVo4aeZjnx4PTFwLK2F4m3FJxzLkVoqoo0tz+cvmxw1vYi8ZaCc86lCJ2TkCZ06GoReaXgnHMpxnJhDx26WkReKTjnXIqxXNhDZjQXlVcKzjmXYiwX9nmze7nijYe3dBJao7yj2TnnUmTNIwCYs2RV0BoI7VAJVJIF5t8oir6+Puvv7291MZxzHahyRBJAd5cYHja2J/brniSufPORhaoUJK02s75a+3lLwTnnMlSunfDi1m2j10AYHn1jPbTdWLxiXaEqhVBeKTjnOlKtxXLS5inUY3DLUFPLO168UnDOdZyQiWlp8xQ6Qa6jjySdKukRSeslLUp5fYakH0q6X9IdkvbPszzOOQdhE9PaYaJZHnKrFCR1AVcDpwGHAPMlHVKx26eBr5nZEcBlwBV5lcc550pCJqa1w0SzPOTZUjgGWG9mj5rZVmApcFbFPocAP4wf357yunPONV3IxLS0eQrhC2jWt9xmkeRZKfQCGxLPN8bbktYCZ8eP3wDsIekPKg8k6TxJ/ZL6N20qfkIp51yxhUxMS5uAVs8A/vnHThu1rdGsq+Mpz47mtGqy8jv9KPB5SQuAO4EBYNuoN5ldA1wD0TyF5hbTOdeuao0gypI2Me3Eg6dy5cpHuOCb942YlJa0927dPPvi6FFFu+/cxe+HtjNsRpfE/GOncfm8w0eVtdGsq+Mpt8lrko4HFpvZKfHzCwHMLLXfQNLLgIfNrGpns09ec85B+kSynu6uhtJJpE5KmyTQyHkIadtCP3POklWpw1p7J/dw16LX1VXeRoROXsszfHQvMEvSAZJ2Bs4BViR3kDRFUqkMFwLX5Vge59wEMpbU1iHHGtpuoyamDW03dt95p4ZyGrVLOu3cwkdmtk3S+cBKoAu4zszWSboM6DezFcAJwBWSjCh89MG8yuOcm1iaeZGt5z3PbRnivktOrvszQpfobLVc5ymY2W1m9ioze4WZfTLednFcIWBmN5vZrHif95nZS3mWxzk3cTRzzYJ63tPoRbxd0ml76mznXFtq5kU27Vjdk0R318jxMmO5iLdLOm1Pc+Gca0tZqa1DL7KVI5fOPrqX2x/eFJQSeyxlLlolUMlTZzvnOk4zRy61C0+d7ZxrikbnArRKSHmrjVwq8rmNB68UnHOZ2mXCVUlWefuf2DwiNJSVBrve9NgTkXc0O+cyNXMuwHjIKu/1dz/JwOAWDL/w1+KVgnMuU7tMuCoparnaiVcKzrlMe/V017W91cY6EaxdM5s2k1cKzrlMWdfIZl47m5k5dOEpB0X5iRq0y04qdAbT8eAdzc65TIMpGUGrba9XLh3ZgXVC9yTYbmLYDEV57nhxaHvzytGmvKXgnMvUzFQSaZrdkX3lykdGJbFL0z1JXPnmP+GXV5zO40vOYL+9ethe8bYid6jnySsF51ymPPL1JMNFWSOBGu0wrva+ZHqJK9985IgWQLt1qOfJw0fOuUxjTSVRKW0mcZpGWyJ79XQzuGV0aGtyT3fVNQvaJYPpePBKwTlXVTPz9aSFiyqNpSUyNLy9ru0lC085KDXtRdEymI6HmpWCpDnAfWb2gqR3AEcBnzWzJ3IvnXNuQqkWjhGMSEQ3Z8mqulsnL2xNr3Cytpc0u0XUzkJaCv8GHCnpSOBjwJeBrwGvzbNgzrn2EpJzKCtMk1ySslWpNdohg+l4COlo3mZRKtWziFoInwX2yLdYzrl2UrqQJ1NJXPjtB0aN9Q/puB7LiKTJGZPqsra70UIqhd9JuhB4J3CrpC7Av2HnXFnWhXzxinUjJqYBNReaGctIoMVzDx01ea17klg899A6z6hzhYSP3gq8DXiPmf1K0nTgynyL5ZwbD5UhnxMPnjpqoZmxLEo/uGWoPBqo1Hq44o2H5zYSyPsGxq5mS8HMfgUsA3aJNz0D/EeehXLO5S8t5FOZTTQtBJQmdOhmSBioXdYynqhqVgqS3g/cDHwx3tQLLM+zUM65/IUMDw2N5addyLMMDG6pmutoLGsZh/ZtuGwh4aMPAscA9wCY2f+T9PJcS+Wcy13obN20/dJGGl3xxsNHbHtx6zaezciRVAoPDQxu4cPfug8YObKo0ZFAvqLa2IV0NL9kZltLTyTtBLTXws7OuVFCQz6V+2XdjQPcteh1PLbkDO5a9DouOfPQoNbDdoOPf/v+usufxtNVjF1IpfAjSR8HeiSdBNwE3BJycEmnSnpE0npJi1Jeny7pdklrJN0v6fT6iu+ca1RIyCctlh86ZDQtDJSllJ10rPJO4NcJQsJHi4D3Ag8AfwncBlxb603x0NWrgZOAjcC9klaY2YOJ3S4CvmVm/ybpkPjYM+s6A+cytNuC8+MtbaRO2ugjGDm7uJ4kdpVhoJmLbs3hTHbwdBVjV7NSMLPtwJfin3ocA6w3s0cBJC0lmgCXrBQM2DN+vBfwVJ2f4VyqdltwvlVqxe7TvkeRHj8OuRuXwFLe3KxFe3xI6tiF5D56jJS/ATM7sMZbe4ENiecbgWMr9lkM/EDS3wC7A3+eUYbzgPMApk+fXqvIznmHY5OkfY9pFUL3JAXdjb/92Olcf/eTqdubxdNVjE1I+Kgv8XhX4M3APgHvS6v7K/+e5gNfMbOrJB0PfF3SYXHrZMebzK4BrgHo6+vzTu42NN6hHO9wbI7Q7yu0R+DyeYcDcOM9Gxg2o0ti/rHTyttd64WEj35TsemfJf0YuLjGWzcC0xLP92d0eOi9wKnx5/xU0q7AFODXtcrl2kcrQjmeH785Ju/WnTmsNGl4u3HpLeuCfp+XzzvcK4ECCwkfHZV4Oomo5RCSEO9eYJakA4AB4ByidBlJTwKvB74i6Y+JWiKbAo7t2kgrQjkTocOxCB3lafH/LCGVBxTjvFy2kPDRVYnH24DHgbfUepOZbZN0PrAS6AKuM7N1ki4D+s1sBfAR4EuSLiAKLS2IM7K6CaQVoZyidzjWujAWpaP8uZRVzMaiKOflsoWEj05s9OBmdhvRMNPktosTjx8E5jR6fNceWhXKKWqHY8iFcaytq2bdjVcbglopJD21DwAovsxKQdKHq73RzP6p+cVxE9FECOU0U8iFsZ7WVVqm02WrB5pyN572u+ueJIbN2J5o04emp/YBAMVXraXgC+m4pih6KGe8hVwYQ1tXaa2OG+5+ctQwv0bvxrN+d2nbQo7tAwCKL7NSMLNLx7MgbmIraiinFUIujKGtq9B5BND43XjW766R36e3GosvZPTRrkRDRw8lGh0EgJm9J8dyOTdhhVwYQ1tX9Vzoi3A37q3G4gsZffR14GHgFOAy4O3AQ3kWyrmJLPTCGNK6qqcj+MSDp9bcJ62DOqSs9fBWY7Gp1ghQSWvMbLak+83sCEndwEozy15PL0d9fX3W39/fio92rnAq+xSAzNxEvZN7qi6DmXas7kkCwdDwjiP2dHcFL3rjikPSajPrq7VfSOrs0kDlQUmHESWumzmGsjnnmiQtPXWjfQpp/RND221EhQDhq7G59hQSPrpG0t5Eaa5XAC8D/iHXUjnnglWGY+YsWdXQCJ96+idCQ1au/WS2FCT9IYCZXWtmz5rZnWZ2oJm93My+mPU+51z9lq8ZqLpucT0aXfi+no7ormblunaFU62lsFbSA8CNwDIze26cyuRcR8ma4dz/xOZRC96kxfFD1ksO6RxOGxWVZdiz0UxY1SqFXqL1Dc4BrpD0U6IKYoWZedvRTRitTtCWNcM5OQkta1by8jUDfOSmtQzH04sHBrfwkZvWctWbj6zaqZwmbVTUi1u3pSa6q7a0pmtvNUcfAUjaGTiNqII4Efihmb0957Kl8tFHrpnSRtyM9+iaAxbdmtk5XKlyBNGhF3+fF7aOvrPffecu1l126pjLVoTvxzVHM0cfYWZbiZbRfAj4LXDI2IrnXDGELkKfp3pi+ZWdwWkVQrXt9Uob3eQVwsRWdfSRpOnAW4lWSNsdWAqcZWY+ec1NCEVI0JYWy8+aa7BXTzdzlqwqh3fGQ9pks1aH3Fx+qmVJ/QlRv8JNwHlm5jEbN+EUIUFbWiz/xIOn8s2fbWAokYp0EvDC1m0MxmscVBsW2ujYoJCLva+JMLFVaylcCNzpi964iawoCdoq78aXrxngm/duGLHPdmD7cNh/x7cfN73uMoRe7H1NhIkts0/BzH7kFYKb6IoaM79y5SOjZhKHmiTom7FPQ58Z0r9ShJCby0/IjGbnJrRGE7SFJo9L21br88Zygd1uNHTXHnqxL0LIzeXHKwXnGpAWall481owyv0AA4NbWHjT2hEJ5ULj7/VkP03TSKUSerEvSsjN5cOX43SuAanJ41LCPcmO4pKQ+Hs9I5LSNHLXHnqx9zURJraQ5TgPAl5NlAwP4EzgzjwL5VzRjTV+Xuv9aRfe0JZDo3ft82b30v/EZm68ZwPDZnRJnH109qprXglMTDWX45T0A+AoM/td/Hwx0TBV5zpGZf/B5N26U9M/hGrkTn5yT3d5OGrS3rt1s9vOO435rn35mgGWrR4o5zUaNmPZ6gH6ZuzjFUAHCelTmA5sTTzfiq+n4JqgXSZApfUfdE8S3V0aETLq7tKIPgXIXqSm1p182md2TUqffXDGEfty+bzDGz6/Eh9q6iB8Oc6fSfoPopDmG4CvhRxc0qnAZ4Eu4FozW1Lx+meIcikB7Aa83MwmB5bdtbF2mgCVtfjM5J5udt9lp1xGH6V95nBK/wTA7Q9vavTURvChpg4CKgUz+6Sk7wF/Fm96t5mtqfU+SV3A1cBJwEbgXkkrzOzBxLEvSOz/N8DsOsvv2lQ73ZVmXRSf2zLEfZecPGp72gzgWipbTfWMPGrWRduHmjoITIhHdBf/WzP7LLBR0gEB7zkGWG9mj8YJ9ZYCZ1XZfz5Ram7XAdrprjTrohhysVy+ZoCFN61lYHALxo5hqsmKotRqSu7TjPLVa+EpB0UhsITuLvlQ0w5Ts1KQdAnwd0RpLwC6gesDjt0LJOfpb4y3pX3GDOAAYFXG6+dJ6pfUv2lTc5rKrrXGcqEdbycePLWu7UmLV6wbNSx1aLuxeMW68vO0VlOops8PqIxQeU6DjhPSUngDMBd4AcDMnmLHcNVq0nrFsv7EzgFuNrPU/xlmdo2Z9ZlZ39Sptf8juuJrdMnIVsiK2adtr1xWM220EDBiez0tg713684tJceVKx9JrcDGM424a72QjuatZmaSDEDS7oHH3ghMSzzfH3gqY99zgA8GHtdNAEWaAFVrFFRoqCut8zxE6KS07knikjMPDfqOGhnZ1U4hPZefkErhW5K+CEyW9H7gPcC1Ae+7F5gV9z8MEF3431a5k6SDgL2BnwaX2k0IRZgAFTIKKrQDtp4w0N67dZcfB0doAvNh1zOyK1l5TJJS114uYkjP5adm+MjMPg3cDCwjmt18sZl9LuB924DzgZVEK7Z9y8zWSbpM0tzErvOBpZ6R1bVCSGbQ0FBX6B11d1d0x1+voeGwUE5ottPKDu60CqGoIT2Xn5otBUn/aGZ/B/xnyraqzOw24LaKbRdXPF8cXFrnmiwkZBIa6spqUaTNZ0i+d+86ZkeHVDyhYaCslk2XxHazQk8odPkJCR+dRDT6KOm0lG3OtZ3Q0FBIqCsrodziudX7AS4581AW3rw2aP2EkFBO6DllVR7bzXhsyRk1P8dNTJnhI0l/JekB4GBJ9yd+HgMeGL8iOpefZo6CanTBnnmze7nyTUeOeN87jpsepchI6J4UNmcg9JzaaViwGz/VWgrfAL4HXAEsSmz/nZltzrVUzo2TZo+CamrneWXHcmBHc+g5+boILo1q9e9KOg5Yl8iSugdwiJndMw7lG6Wvr8/6+/tb8dHOjdCshH6Vo4Uge5hq7+Qe7lr0usYLnfLZjZxDuyQzdDtIWm1mfbX2C+lT+DfgqMTzF1K2OddRlq8ZGNEPUF55jfoT+qV1+GbdqjV7zkAjLZt2Smbo6hcyo1nJ4aJmth1fxtN1uEtvWTeqY3ho2Lj0lnWj9q2c5VyZIK+eGc1FiPeHDnl17Snk4v6opA8RtQ4A/hp4NL8iOddaIaGRrCGkldubeVddlHi/z3ye2EIqhQ8AnwMuImrV/hA4L89COdcMaRd3qN4BW8pqWsoBVMpqCuEX8TlLVpWP/+LWbWNKEd47uadwcXtPsT2xhayn8GuiFBXOtY20O/SFN68dsTJa2l17taymyQty1tKYpeMm/00TelddhE7lSj5qaWLLrBQkfczMPiXpX0jp9zKzD+VaMufGIHW1tJTJYZV37SFZTQEWzz10RIuiXs28qw652DczjFWkZIau+aq1FB6K//Xxn67t1BPfbiQWnnZhDO0wrryr7q2SHiMZihrLxb7ZK90VIZmhy0dmpWBmt8T/fnX8iuNcc9RzkU7etWflIUpmNS2pvDDOWbKqodxHaeGY7knidy9tK7dQsoa8hl7svXPYhaoWPrqFKll9zWxu1mvOtVrahbZrkhhOCfckV1A744h9uf7uJ0ftc8YR+9b8zBMPnpr63r84cl8un3d45vvSWh2DL27lha2jw1+X3rKuoYu9dw67UNXCR5+O/30j8EfsWIJzPvB4jmVyba4Is13TLrQvJO68k5IrqGWtsnbr/U9z+8Obqp5T1nu/vXojN96zgWEzuiTmHzttVCVR2eqYuejW1GNVtmJCL/beOexCVQsf/QhA0ifM7DWJl26RdGfuJXNtqUizXSsvtAdkXGiTd9VZd97PvjhUviBnnVPWe18c2l5+PGxWbk1Uaz2ECr3Ye+ewCxUyo3mqpANLT+KV1HyhZJeqyLNdQ7KChoZT0s6pnlDMjfdsqPr65J7RfRhp2+vJzDoJaTWTAAAUFklEQVRvdi93LXodjy05g7sWvc4rBJcqZPLaBcAdkkqzmGcCf5lbiSawIoRV8lbkDs2Qu+qsfoE0leeUdvwsaaucJaUNee2eJBbPHb1im48Ecs0UMnnt+5JmAQfHmx42s5fyLdbEU6SwSp6K3KEZEkLJ6hdIk7YQT+Xxn3puC2nX/y5Vz4OdVVag5jBV58YiZDnO3YAPAzPM7P2SZkk6yMy+m3/xJo5mjxMvqqJ3aNa6q66nRZMctZR1/IuWP5Da8ph/7LSax688VqfcWLjWCulT+HdgK3B8/HwjcHluJZqgihxWaaZGVx8bL7UyltbTovnu2qdr7nP5vMN5x3HTyy2DLol3HDe9oU7mIvfXuIkjpE/hFWb2VknzAcxsi1Sj7etGKVJYJe++jaLGuEPutOvpU8hKiVHp8nmHN2WkUafcWLjWCqkUtkrqIZ7IJukVgPcp1KkoYZVOCkFUVn5ZGUsXr1hX3m9Sge53Kss/OWO2dRH6a9zEEVIpXAJ8H5gm6QZgDrAgz0JNREUZJ94pfRtplV+WwS1D5bv+WqOCktJSXzRLWvm7J4nuLo1I7Fek/ho3MVStFOIw0cNEs5qPI1o69m/N7JmQg0s6Ffgs0AVca2ZLUvZ5C7CYqCWy1szeVs8JtJMihFXaLQTRaKgrrfJrpu4uccmZo4eHprlo+QM1ZzRXSs3yut1q5lFybqyqVgpmZpKWm9nRQPp00AySuoCrgZOIOqfvlbTCzB5M7DMLuBCYY2bPSnp53Wfg6lKkvo1axhLqamYl19PdxdlH99ZMc5GmcvRR6IzmrPI/t2WI+y45uc4zcC5cyOijuyW9uoFjHwOsN7NHzWwrsBQ4q2Kf9wNXm9mzUF7Qx+Vo4SkH0dPdNWJbUUMQYxlt06xKrkvi7KN76ZuxT0Pvz5q5XGtGc8jsa+fyEFIpnEhUMfxS0v2SHpB0f8D7eoHkX/7GeFvSq4BXSbpL0t1xuGkUSedJ6pfUv2lT+OQiN1rRh4wmjSXUlVb5NWLYjG/+bAMLb17LwOAWjB0tlsrhrFnvr2d7SdociGrbnWuWkI7m0xo8dtowjsr/CTsBs4ATgP2B/5Z0mJkNjniT2TXANQB9fX2NLXXlyorQtxFiLKGutI79zS+8xJZEcrpQaaurhXbOd0mpFUCtGc1ZM6vrmXHtXCOqraewK/AB4JXAA8CXzWxbHcfeCCSnbe4PPJWyz91mNgQ8JukRokri3jo+x00QlZ3KJx48lWWrBxoexltZ+c2+7AcNVQpZQlos84+d1tCM5nYbEOAmjmrho68CfUQVwmnAVXUe+15glqQDJO0MnAOsqNhnOVF4CklTiMJJj+I6TqlTORmiWbZ6gLOP7m1aqGswZYz/WIS0WBqd0ex9Cq5VqoWPDjGzwwEkfRn4WT0HNrNtks4HVhINSb3OzNZJugzoN7MV8WsnS3oQGAYWmtlvGjkR196yOpVvf3gTdy16XVM+o54lOmupp8XSyIzmokx2dJ2nWqVQvq2KL/B1H9zMbgNuq9h2ceKxESXb+3DdB3cTyniES7LWQkaMmBCWpqd7Evvsvsu4zQ8oymRH13mqVQpHSvpt/FhAT/xcRNfzPXMvnesY4zF/olo66tI2CVL6ldm1u6tpLZZQ7TIgwE0s1ZbjHPt4PueqSHYs79XTPS4pHLIutKVtWUt2Nrs/wrmiChmS6lzTVc5WHtwyxCRgUnynXpo01uw75VppM9ppxrdzefBKoU6dsKRmlmaee1rH8nYoz2QZNmPZ6gH6ZuzTtO83JG2Gd/C6Thcyo9nF0oZNhs5sbXfNPveQDuRmLyATkjajnWZ8O5cHbynUoVPSTqdp9rmHDg8dGNzStDWJQ0c4eQev62ReKdShk2eZNvvc08I0acSOtRBKrZP+JzY3lLG0yP0FnRyWdMXi4aM6dPIs02afe2WYZnI8+ihJjE6WtWVomBvufrKhMFZRM8R2cljSFY9XCnUo6kVlPOR97rvvshNvffW0EbH8rOlkaRVFSN9DUfsLxpIi3Llm8/BRHTp5lmmzzz1tJNCy1QMjLtJzlqwKTksRGsYqYn9BJ4clXfF4pVCnIl5Uxkszzz2k4zqt3yEtpATtHcIrcl+H6zwePnItEXJ3nBbueftx0ydcCK+Tw5KueLyl4Foi9O44rXXSN2OfCRXC6+SwpCserxSq8GGCjQn53sYyc7j/ic386rnfY8Cvnvs9/U9sbvvfSyeHJV2xeKWQISQlghst9Htr9O74ouUPjFjJbNis/LzeNQucc6PJaiwgXjR9fX3W39+f++dkjXzpndwz7imU20ne39srLrwtc83jX15x+piP79xEJWm1mfXV2s87mjP4MMHG5P29pVUI1bY75+rj4aMMPkywMXl/b11SagUg0bQcSc51Mm8pZPBhgo3J+3ubf+y01O2lHEmeJsK5sfGWQgYfJtiYsXxvIaOWSp3JN96zgWEzuiR22Um8OLR9xH6dkr3WuWbzjmZXCJWjliBqYYTkJjpg0a2ps5wFPLbkjOYW1Lk25R3NLtPyNQPMWbKKAxbdypwlqwoRZhlLUrhOzl7rXLPlGj6SdCrwWaALuNbMllS8vgC4EihdlT5vZtfmWaZOFzqPIC2UA/mF08YyasmX0HSueXKrFCR1AVcDJwEbgXslrTCzByt2/aaZnZ9XOdxIIYno0iqOhTetBcHQsJW3NXMy31hGLXn/j3PNk2dL4RhgvZk9CiBpKXAWUFkpuHEUckeeVnEMbR8dtW9mZ+5Y7/Y9TYRzzZFnn0IvsCHxfGO8rdLZku6XdLOk9PGGrmlC4u/1TDRr1qS0oi6A41ynybOloJRtlbebtwA3mtlLkj4AfBUYlQtB0nnAeQDTp09vdjk7SsgdeVYoJ00zO3P9bt+51suzpbARSN757w88ldzBzH5jZi/FT78EHJ12IDO7xsz6zKxv6tSpuRS2U4TckadNQOuepFFrKPd0d3HiwVMLN5LJOde4PFsK9wKzJB1ANLroHOBtyR0k7WtmT8dP5wIP5VgeF6t1R57VcVu57cSDp7Js9YBnknVuAsmtUjCzbZLOB1YSDUm9zszWSboM6DezFcCHJM0FtgGbgQV5lcfVJ6viSG6bs2RVzZFMzrn2kus8BTO7DbitYtvFiccXAhfmWYZOMt6LAnkmWecmHp/RPEGU5haMZ1I4n0ns3MTjlcIEMZY0EY3yTLLOTTyeJXWCaEUox2cSOzfxeKUwQdSTJqKZfQ8+t8C5icXDRxNEaCinFX0Pzrn24ZXCBBGaJqIVfQ/Oufbh4aMJJCSUU+RhpOM9pNY5N5q3FDpMUYeReljLuWLwSqHDFHUYqYe1nCsGDx91mKIOIy1yWMu5TuKVQgcq4jDSsay85pxrHg8fuUIoaljLuU7jLQVXCEUNaznXabxScIVRxLCWc53Gw0fOOefKvFJwzjlX5pWCc865Mq8UnHPOlXVsR7Pn2XHOudE6slIo5dkppVUo5dkB2qZi8ErNOZeHjgwftXueHU8e55zLS0dWCu2eZ6fdKzXnXHF1ZKVQ1PTRodq9UnPOFVeulYKkUyU9Imm9pEVV9nuTJJPUl2d5Sto9z067V2rOueLKrVKQ1AVcDZwGHALMl3RIyn57AB8C7smrLJVCl64sqnav1JxzxZXn6KNjgPVm9iiApKXAWcCDFft9AvgU8NEcyzJKO+fZ8eRxzrm85Fkp9AIbEs83Ascmd5A0G5hmZt+VlFkpSDoPOA9g+vTpORS1/bRzpeacK648+xSUss3KL0qTgM8AH6l1IDO7xsz6zKxv6tSpTSyic865pDwrhY3AtMTz/YGnEs/3AA4D7pD0OHAcsGK8Opudc86NlmelcC8wS9IBknYGzgFWlF40s+fMbIqZzTSzmcDdwFwz68+xTM4556rIrVIws23A+cBK4CHgW2a2TtJlkubm9bnOOecal2vuIzO7DbitYtvFGfuekGdZnHPO1daRM5qdc86l80rBOedcmVcKzjnnyrxScM45VyYzq71XgUjaBDzR6nJkmAI80+pCjEG7lx/a/xzavfzQ/ufQ7uWH9HOYYWY1Z/+2XaVQZJL6zaxtJ9+1e/mh/c+h3csP7X8O7V5+GNs5ePjIOedcmVcKzjnnyrxSaK5rWl2AMWr38kP7n0O7lx/a/xzavfwwhnPwPgXnnHNl3lJwzjlX5pWCc865Mq8UGiDpVEmPSFovaVGV/d4kyYq2RkSt8ktaIGmTpPvin/e1opzVhPwOJL1F0oOS1kn6xniXsZqA38FnEt///0gabEU5qwk4h+mSbpe0RtL9kk5vRTmzBJR/hqQfxmW/Q9L+rShnFknXSfq1pF9kvC5Jn4vP735JRwUd2Mz8p44foAv4JXAgsDOwFjgkZb89gDuJ1onoa3W56yk/sAD4fKvLOsZzmAWsAfaOn7+81eWu928osf/fANe1utwN/A6uAf4qfnwI8Hiry11n+W8C3hU/fh3w9VaXu6J8rwGOAn6R8frpwPeIVsE8Drgn5LjeUqjfMcB6M3vUzLYCS4GzUvb7BPAp4PfjWbgAoeUvspBzeD9wtZk9C2Bmvx7nMlZT7+9gPnDjuJQsXMg5GLBn/HgvRq682Goh5T8E+GH8+PaU11vKzO4ENlfZ5Szgaxa5G5gsad9ax/VKoX69wIbE843xtjJJs4FpZvbd8SxYoJrlj50dNzlvljQt5fVWCjmHVwGvknSXpLslnTpupast9HeApBnAAcCqcShXPULOYTHwDkkbidZV+ZvxKVqQkPKvBc6OH78B2EPSH4xD2Zol+O8sySuF+illW3lcr6RJwGeAj4xbiepTtfyxW4CZZnYE8F/AV3MvVX1CzmEnohDSCUR32tdKmpxzuUKFlL/kHOBmMxvOsTyNCDmH+cBXzGx/olDG1+P/H0UQUv6PAq+VtAZ4LTAAbMu7YE1Uz99ZWVF+Qe1kI5C8c96fkc3iPYDDgDskPU4Uy1tRoM7mWuXHzH5jZi/FT78EHD1OZQtV8xzifb5jZkNm9hjwCFElUQQh5S85h+KFjiDsHN4LfAvAzH4K7EqUqK0IQv4fPGVmbzSz2cDfx9ueG78ijlk9f2dlXinU715glqQDJO1M9J92RelFM3vOzKaY2Uwzm0nU0TzXzPpbU9xRqpYfoCLuOJdoje0iqXkOwHLgRABJU4jCSY+OaymzhZQfSQcBewM/HefyhQg5hyeB1wNI+mOiSmHTuJYyW8j/gymJls2FwHXjXMaxWgGcG49COg54zsyervWmXNdonojMbJuk84GVRCMYrjOzdZIuA/rNbNR/7iIJLP+HJM0laipvJhqNVBiB57ASOFnSg8AwsNDMftO6Uu9Qx9/QfGCpxUNJiiTwHD4CfEnSBURhiwVFOZfA8p8AXCHJiEYSfrBlBU4h6UaiMk6J+20uAboBzOwLRP04pwPrgReBdwcdtyC/I+eccwXg4SPnnHNlXik455wr80rBOedcmVcKzjnnyrxScM45V+aVgiuUeEz1jyWdltj2Fknfr9jv42P4jAWS9qvYdrOkAyXdE2cmfVIjM8XObPTzmkHSKyXd1+Rj/l9J72zmMV378yGprnAkHUaUoXI20Rjy+4BTzeyXiX2eN7OXNXj8O4CPliYUSjoUuNzM3pDYZwFRdtvzM47RNZ6pJyS9kijdxZ808ZgvA+40s7CUyq4jeEvBFY6Z/YIo/9LfEU3I+VpFhbAE6Inv4G+It71D0s/ibV+U1BX/fEXSLyQ9IOkCSW8C+oAb4n17gLcD36lWJkk7SRqUdLmknwHHSNpYyqck6ThJ/xU/fln8uT9TtJbAmSnHWybp5MTz6yWdJekVkv47ft9qScemvPd9kv458fz7kv40fnyapJ9K+rmkb0raPd5+paK1Je6X9I/x9/w8MKDQPPuuI/iMZldUlwI/B7YSXcTLzGyRpPNLd81xCoW3AnPMbEjSvxJd6NcBvWZ2WLzfZDMbjGeyJlsKcwjLL7QX8HMzuyh+X9Z+FwPfN7MFkvYG7pH0n2aWTKO+NC7zDyTtSpRw7b1ELaOTzOz3kg4mSkY4qmJII+nlwCLg9Wb2oqS/B/5W0peJZrYeamamkYkB+4E/I/qunfNKwRWTmb0g6ZvA84nkfFleT5S07974Qt0D/JqotXGgpH8BbgV+kPH+fQnLybMV+I+A/U4GTtOO1bx2BaYD/5PY51bgKkndwBnAKjN7Ka5EPi/pSKI0I68I+LyS/0O0BsBP4u9hZ+DHRKlKthOlnLgVSKZ0/zUws47PcBOcVwquyLbHP7UI+KqZXTjqhejiegpR3pq3AO9Jef8Wogt3LVsqcvdsY0cINvl+AfOSIa9K8Z38XcBJRC2Gf49f+ghRDvx3EOWxeT7l7cnPTX62iFooozqPFWXpPYko8dtfEVVcpfduySqn6zzep+Da1VB8lw3R6lhvisMnSNpH0fq6U4BJZrYM+AeipQsBfkeU4rzkIeCVDZThcXakFT87sX0l8KHSE0WLLqVZShQyOp5o3QqIQlRPx5XPu0jPif84MDseqTUzUYafEOX/PzD+3N0lzZK0B7BnvOjTBUQd+CWvAlLX+HWdyVsKrl1dA9wv6edm9nZJFxHF5ycBQ0Qtgy3Av2tk+mOArwBfkLSF6IJ8K1G2yf+iPouJQjK/An6W2H4p8M+SHiC68VpP+lKO3yfqM7jJzIbibZ8HbpY0Py5PWujsR0QLvjxAdEG/D8DM/lfSe4FvKkoHDfBxou/h25J2icvz4cSxjo/3cQ7wIanOEY9Aup2oo7poK5zlRtKrgb82s6CUyq4zeKXgHCDpFOAhM3uy1WUZL514zq42rxScc86VeUezc865Mq8UnHPOlXml4JxzrswrBeecc2VeKTjnnCv7/+Quzs2UQND3AAAAAElFTkSuQmCC\n",
"text/plain": [
"<Figure size 432x288 with 1 Axes>"
]
},
"metadata": {
"tags": [],
"needs_background": "light"
}
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "3wAfW6aSvmY-",
"colab_type": "text"
},
"source": [
"|Strength of Association|\tPositive Coefficient, r\t|Negative Coefficient, r|\n",
"|-------|-----------|------------|\n",
"|Small\t|.1 to .3\t|-0.1 to -0.3|\n",
"|Medium\t|.3 to .5\t|-0.3 to -0.5|\n",
"|Large\t|.5 to 1.0\t|-0.5 to -1.0|\n"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "qjBa-jdpvmY-",
"colab_type": "text"
},
"source": [
"### 10. Plotting the residuals\n",
"\n",
"As discussed in the Introduction section, A **Residual** in simple terms is the difference between the Actual and Predicted value of the dependent variable."
]
},
{
"cell_type": "code",
"metadata": {
"id": "Nv3crpJNvmY_",
"colab_type": "code",
"colab": {}
},
"source": [
"import seaborn as sns\n",
"from scipy.stats import shapiro"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "qVDERqMHvmZB",
"colab_type": "code",
"colab": {},
"outputId": "cb303a10-76f7-40fc-abc0-9c3dec00ba6b"
},
"source": [
"sns.distplot((y_test- predictions),bins=50)\n",
"plt.xlabel('Residuals')\n",
"plt.ylabel('Density')\n",
"plt.title('Histograms of residuals (Shapiro W p-value = {0:03f})'.format(shapiro(y_test-predictions)[1]))\n",
"plt.show()"
],
"execution_count": 0,
"outputs": [
{
"output_type": "stream",
"text": [
"C:\\Users\\tkhan050\\AppData\\Local\\Continuum\\anaconda3\\lib\\site-packages\\scipy\\stats\\stats.py:1713: FutureWarning: Using a non-tuple sequence for multidimensional indexing is deprecated; use `arr[tuple(seq)]` instead of `arr[seq]`. In the future this will be interpreted as an array index, `arr[np.array(seq)]`, which will result either in an error or a different result.\n",
" return np.add.reduce(sorted[indexer] * weights, axis=axis) / sumval\n"
],
"name": "stderr"
},
{
"output_type": "display_data",
"data": {
"image/png": "\n",
"text/plain": [
"<Figure size 432x288 with 1 Axes>"
]
},
"metadata": {
"tags": [],
"needs_background": "light"
}
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "fZY7WTC-vmZF",
"colab_type": "text"
},
"source": [
"The histogram shows us that the residuals are negatively skewed and the value of the Shapiro W p-value in the title tells us that the distribution is not normal. This gives us further evidence that our model has room for improvement."
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "37su5EjRvmZG",
"colab_type": "text"
},
"source": [
"### 11. Computing the metrics for mean absolute error, mean squared error, root mean squared error, and R-squared to determine the model performance"
]
},
{
"cell_type": "code",
"metadata": {
"id": "h7OCFPO1vmZH",
"colab_type": "code",
"colab": {}
},
"source": [
"from sklearn import metrics"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"code_folding": [],
"id": "QNv1NtaMvmZK",
"colab_type": "code",
"colab": {}
},
"source": [
"metrics_df = pd.DataFrame({'Metric':['MAE','MSE','RMSE','R-Squared'],\n",
" 'Value':[metrics.mean_absolute_error(y_test,predictions),\n",
" metrics.mean_squared_error(y_test,predictions),\n",
" np.sqrt(metrics.mean_squared_error(y_test,predictions)),\n",
" metrics.explained_variance_score(y_test,predictions)]}).round(3)"
],
"execution_count": 0,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "MezsHFrYvmZQ",
"colab_type": "code",
"colab": {},
"outputId": "8608ecf6-d460-4edb-efed-504c76b2647b"
},
"source": [
"metrics_df"
],
"execution_count": 0,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>Metric</th>\n",
" <th>Value</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>MAE</td>\n",
" <td>0.041</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>MSE</td>\n",
" <td>0.003</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>RMSE</td>\n",
" <td>0.058</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>R-Squared</td>\n",
" <td>0.809</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" Metric Value\n",
"0 MAE 0.041\n",
"1 MSE 0.003\n",
"2 RMSE 0.058\n",
"3 R-Squared 0.809"
]
},
"metadata": {
"tags": []
},
"execution_count": 131
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "BOPAAOICvmZT",
"colab_type": "text"
},
"source": [
"|Metric|Simple Linear Regression|Mutiple Linear Regression|\n",
"|-------|-----------|------------|\n",
"|MAE\t|0.059\t|0.041|\n",
"|MSE\t|0.006\t|0.003|\n",
"|RMSE\t|0.080|0.058|\n",
"|R-Squared|0.629|0.809|"
]
},
{
"cell_type": "code",
"metadata": {
"id": "u6ULsdU6vmZU",
"colab_type": "code",
"colab": {}
},
"source": [
""
],
"execution_count": 0,
"outputs": []
}
]
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment