Skip to content

Instantly share code, notes, and snippets.

@pfokin92
Last active December 22, 2019 06:21
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save pfokin92/5ea68906eb0f8f47473c72ceadab1f4f to your computer and use it in GitHub Desktop.
Save pfokin92/5ea68906eb0f8f47473c72ceadab1f4f to your computer and use it in GitHub Desktop.
Display the source blob
Display the rendered blob
Raw
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Домашнее задание к лекции \"Функции и работа с данными\"\n",
"\n",
"## Задание 1\n",
"\n",
"Напишите функцию, которая классифицирует фильмы из материалов занятия по следующим правилам:\n",
"- оценка 2 и меньше - низкий рейтинг \n",
"- оценка 4 и меньше - средний рейтинг \n",
"- оценка 4.5 и 5 - высокий рейтинг \n",
"\n",
"Результат классификации запишите в столбец class"
]
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {},
"outputs": [],
"source": [
"import pandas as pd"
]
},
{
"cell_type": "code",
"execution_count": 3,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>userId</th>\n",
" <th>movieId</th>\n",
" <th>rating</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>1</td>\n",
" <td>1</td>\n",
" <td>4.0</td>\n",
" <td>964982703</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>1</td>\n",
" <td>3</td>\n",
" <td>4.0</td>\n",
" <td>964981247</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>1</td>\n",
" <td>6</td>\n",
" <td>4.0</td>\n",
" <td>964982224</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>1</td>\n",
" <td>47</td>\n",
" <td>5.0</td>\n",
" <td>964983815</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>1</td>\n",
" <td>50</td>\n",
" <td>5.0</td>\n",
" <td>964982931</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" userId movieId rating timestamp\n",
"0 1 1 4.0 964982703\n",
"1 1 3 4.0 964981247\n",
"2 1 6 4.0 964982224\n",
"3 1 47 5.0 964983815\n",
"4 1 50 5.0 964982931"
]
},
"execution_count": 3,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"raiting=pd.read_csv('ratings.csv')\n",
"raiting.head()"
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {},
"outputs": [],
"source": [
"def class_movies(row):\n",
" if row['rating']<=2.0:\n",
" return 'низкий рейтинг'\n",
" elif 2.0<row['rating']<=4.0:\n",
" return 'средний рейтинг'\n",
" else:\n",
" return 'высокий рейтинг'"
]
},
{
"cell_type": "code",
"execution_count": 29,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>userId</th>\n",
" <th>movieId</th>\n",
" <th>rating</th>\n",
" <th>timestamp</th>\n",
" <th>class</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>1</td>\n",
" <td>1</td>\n",
" <td>4.0</td>\n",
" <td>964982703</td>\n",
" <td>средний рейтинг</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>1</td>\n",
" <td>3</td>\n",
" <td>4.0</td>\n",
" <td>964981247</td>\n",
" <td>средний рейтинг</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>1</td>\n",
" <td>6</td>\n",
" <td>4.0</td>\n",
" <td>964982224</td>\n",
" <td>средний рейтинг</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>1</td>\n",
" <td>47</td>\n",
" <td>5.0</td>\n",
" <td>964983815</td>\n",
" <td>высокий рейтинг</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>1</td>\n",
" <td>50</td>\n",
" <td>5.0</td>\n",
" <td>964982931</td>\n",
" <td>высокий рейтинг</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" userId movieId rating timestamp class\n",
"0 1 1 4.0 964982703 средний рейтинг\n",
"1 1 3 4.0 964981247 средний рейтинг\n",
"2 1 6 4.0 964982224 средний рейтинг\n",
"3 1 47 5.0 964983815 высокий рейтинг\n",
"4 1 50 5.0 964982931 высокий рейтинг"
]
},
"execution_count": 29,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"raiting['class'] = raiting.apply(class_movies, axis=1)\n",
"raiting.head()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Задание 2\n",
"\n",
"Используем файл keywords.csv.\n",
"\n",
"Необходимо написать гео-классификатор, который каждой строке сможет выставить географическую принадлежность определенному региону. Т. е. если поисковый запрос содержит название города региона, то в столбце ‘region’ пишется название этого региона. Если поисковый запрос не содержит названия города, то ставим ‘undefined’.\n",
"\n",
"Правила распределения по регионам Центр, Северо-Запад и Дальний Восток:\n",
"\n",
"geo_data = {\n",
"\n",
"'Центр': ['москва', 'тула', 'ярославль'],\n",
"\n",
"'Северо-Запад': ['петербург', 'псков', 'мурманск'],\n",
"\n",
"'Дальний Восток': ['владивосток', 'сахалин', 'хабаровск']\n",
"}\n",
"\n",
"Результат классификации запишите в отдельный столбец region."
]
},
{
"cell_type": "code",
"execution_count": 63,
"metadata": {},
"outputs": [],
"source": [
"import pandas as pd"
]
},
{
"cell_type": "code",
"execution_count": 64,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"['москва',\n",
" 'тула',\n",
" 'ярославль',\n",
" 'петербург',\n",
" 'псков',\n",
" 'мурманск',\n",
" 'владивосток',\n",
" 'сахалин',\n",
" 'хабаровск']"
]
},
"execution_count": 64,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"geo_data = {\n",
"\n",
"'Центр':['москва', 'тула', 'ярославль'],\n",
"\n",
"'Северо-Запад':['петербург', 'псков', 'мурманск'],\n",
"\n",
"'Дальний Восток':['владивосток', 'сахалин', 'хабаровск'] \n",
"}\n",
"towns=[]\n",
"for values in geo_data.values():\n",
" for i in values:\n",
" towns.append(i)\n",
"towns"
]
},
{
"cell_type": "code",
"execution_count": 65,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Центр\n"
]
}
],
"source": [
"row='вконтакте моя страница москва'\n",
"data=row.split(' ')\n",
"def check(t):\n",
" for i in t:\n",
" if i in towns:\n",
" for items in geo_data.items():\n",
" if i in items[1]:\n",
" print(items[0])\n",
"check(data)"
]
},
{
"cell_type": "code",
"execution_count": 66,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>keyword</th>\n",
" <th>shows</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>вк</td>\n",
" <td>64292779</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>одноклассники</td>\n",
" <td>63810309</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>порно</td>\n",
" <td>41747114</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>ютуб</td>\n",
" <td>39995567</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>вконтакте</td>\n",
" <td>21014195</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" keyword shows\n",
"0 вк 64292779\n",
"1 одноклассники 63810309\n",
"2 порно 41747114\n",
"3 ютуб 39995567\n",
"4 вконтакте 21014195"
]
},
"execution_count": 66,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df=pd.read_csv('keywords.csv')\n",
"df.head()"
]
},
{
"cell_type": "code",
"execution_count": 67,
"metadata": {},
"outputs": [],
"source": [
"def region_in_keyword(row):\n",
" data=row['keyword'].split(' ')\n",
" i=0\n",
" for word in data:\n",
" if word in towns:\n",
" for items in geo_data.items():\n",
" if word in items[1]:\n",
" i+=1\n",
" return items[0]\n",
" if i==0:\n",
" return 'undefined' "
]
},
{
"cell_type": "code",
"execution_count": 68,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>keyword</th>\n",
" <th>shows</th>\n",
" <th>region</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>вк</td>\n",
" <td>64292779</td>\n",
" <td>undefined</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>одноклассники</td>\n",
" <td>63810309</td>\n",
" <td>undefined</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>порно</td>\n",
" <td>41747114</td>\n",
" <td>undefined</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>ютуб</td>\n",
" <td>39995567</td>\n",
" <td>undefined</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>вконтакте</td>\n",
" <td>21014195</td>\n",
" <td>undefined</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" keyword shows region\n",
"0 вк 64292779 undefined\n",
"1 одноклассники 63810309 undefined\n",
"2 порно 41747114 undefined\n",
"3 ютуб 39995567 undefined\n",
"4 вконтакте 21014195 undefined"
]
},
"execution_count": 68,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df['region'] = df.apply(region_in_keyword, axis=1)\n",
"df.head()"
]
},
{
"cell_type": "code",
"execution_count": 69,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>keyword</th>\n",
" <th>shows</th>\n",
" <th>region</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>127</th>\n",
" <td>авито москва</td>\n",
" <td>979292</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>370</th>\n",
" <td>авито ру санкт петербург</td>\n",
" <td>425134</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>849</th>\n",
" <td>авито ярославль</td>\n",
" <td>209581</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1063</th>\n",
" <td>фарпост владивосток</td>\n",
" <td>176951</td>\n",
" <td>Дальний Восток</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1236</th>\n",
" <td>банк санкт петербург</td>\n",
" <td>174375</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1259</th>\n",
" <td>москва</td>\n",
" <td>153050</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1346</th>\n",
" <td>санкт петербург</td>\n",
" <td>144232</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1396</th>\n",
" <td>гисметео москва</td>\n",
" <td>140279</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1482</th>\n",
" <td>госуслуги москва</td>\n",
" <td>133061</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1579</th>\n",
" <td>авито тула</td>\n",
" <td>126168</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1682</th>\n",
" <td>погода москва</td>\n",
" <td>151701</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2153</th>\n",
" <td>авито псков</td>\n",
" <td>98931</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2175</th>\n",
" <td>дром хабаровск</td>\n",
" <td>98361</td>\n",
" <td>Дальний Восток</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2398</th>\n",
" <td>авито мурманск</td>\n",
" <td>89787</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2550</th>\n",
" <td>банк санкт петербург интернет банк</td>\n",
" <td>89123</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2806</th>\n",
" <td>циан недвижимость москва</td>\n",
" <td>78729</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2974</th>\n",
" <td>авто ру санкт петербург</td>\n",
" <td>74918</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3206</th>\n",
" <td>дром владивосток</td>\n",
" <td>73084</td>\n",
" <td>Дальний Восток</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3275</th>\n",
" <td>100 сп владивосток</td>\n",
" <td>76418</td>\n",
" <td>Дальний Восток</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3687</th>\n",
" <td>авито хабаровск</td>\n",
" <td>63736</td>\n",
" <td>Дальний Восток</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3786</th>\n",
" <td>метро москва</td>\n",
" <td>68024</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4888</th>\n",
" <td>перекресток акции москва каталог сегодня</td>\n",
" <td>48619</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5013</th>\n",
" <td>фк спартак москва</td>\n",
" <td>49699</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5434</th>\n",
" <td>авто ру москва</td>\n",
" <td>46991</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5512</th>\n",
" <td>схема метро москва</td>\n",
" <td>62915</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5642</th>\n",
" <td>погода псков</td>\n",
" <td>47395</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5671</th>\n",
" <td>avito санкт петербург</td>\n",
" <td>44214</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5700</th>\n",
" <td>холодильник ру интернет магазин москва</td>\n",
" <td>44385</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5768</th>\n",
" <td>спартак москва</td>\n",
" <td>43656</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5825</th>\n",
" <td>петрович санкт петербург каталог товаров</td>\n",
" <td>43280</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>...</th>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96233</th>\n",
" <td>автосуши тула</td>\n",
" <td>3812</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96245</th>\n",
" <td>вакансии псков</td>\n",
" <td>4085</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96422</th>\n",
" <td>наш город москва портал gorod mos ru</td>\n",
" <td>3757</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96427</th>\n",
" <td>хостел москва</td>\n",
" <td>4193</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96681</th>\n",
" <td>налоговая инспекция по адресу проживания москва</td>\n",
" <td>3737</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96746</th>\n",
" <td>афиша ярославль</td>\n",
" <td>4033</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96820</th>\n",
" <td>рп5 тула</td>\n",
" <td>3750</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>96988</th>\n",
" <td>спартак москва футбол</td>\n",
" <td>5273</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97115</th>\n",
" <td>авиабилеты москва сочи</td>\n",
" <td>13146</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97199</th>\n",
" <td>запись на прием к врачу москва</td>\n",
" <td>3756</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97349</th>\n",
" <td>спорт и отдых мурманск</td>\n",
" <td>3791</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97382</th>\n",
" <td>комсомольская правда москва</td>\n",
" <td>3708</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97413</th>\n",
" <td>kassir ru санкт петербург</td>\n",
" <td>3723</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97514</th>\n",
" <td>расписание электричек владимир москва</td>\n",
" <td>8480</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97606</th>\n",
" <td>торпедо москва</td>\n",
" <td>3729</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>97613</th>\n",
" <td>би порт мурманск</td>\n",
" <td>3723</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>98183</th>\n",
" <td>москва октябрьская это какой вокзал</td>\n",
" <td>3696</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>98272</th>\n",
" <td>управляющая компания жкх по адресу москва</td>\n",
" <td>3745</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>98388</th>\n",
" <td>пилюли ру интернет аптека москва</td>\n",
" <td>3690</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>98542</th>\n",
" <td>спортмастер ярославль каталог товаров</td>\n",
" <td>3734</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>98679</th>\n",
" <td>геи москва</td>\n",
" <td>4205</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>98726</th>\n",
" <td>ледовый дворец санкт петербург</td>\n",
" <td>3772</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>98943</th>\n",
" <td>вакансии на авито москва от прямых работодателей</td>\n",
" <td>3738</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>99084</th>\n",
" <td>122 медсанчасть санкт петербург официальный сайт</td>\n",
" <td>3679</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>99204</th>\n",
" <td>bileter ru санкт петербург</td>\n",
" <td>3661</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>99292</th>\n",
" <td>афиша кино москва на сегодня</td>\n",
" <td>3704</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>99590</th>\n",
" <td>авито ярославль автомобили с пробегом</td>\n",
" <td>3673</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>99808</th>\n",
" <td>полармед мурманск запись на прием</td>\n",
" <td>3719</td>\n",
" <td>Северо-Запад</td>\n",
" </tr>\n",
" <tr>\n",
" <th>99890</th>\n",
" <td>яндекс метро москва</td>\n",
" <td>4548</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" <tr>\n",
" <th>99922</th>\n",
" <td>авто ру ярославль</td>\n",
" <td>3712</td>\n",
" <td>Центр</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>594 rows × 3 columns</p>\n",
"</div>"
],
"text/plain": [
" keyword shows \\\n",
"127 авито москва 979292 \n",
"370 авито ру санкт петербург 425134 \n",
"849 авито ярославль 209581 \n",
"1063 фарпост владивосток 176951 \n",
"1236 банк санкт петербург 174375 \n",
"1259 москва 153050 \n",
"1346 санкт петербург 144232 \n",
"1396 гисметео москва 140279 \n",
"1482 госуслуги москва 133061 \n",
"1579 авито тула 126168 \n",
"1682 погода москва 151701 \n",
"2153 авито псков 98931 \n",
"2175 дром хабаровск 98361 \n",
"2398 авито мурманск 89787 \n",
"2550 банк санкт петербург интернет банк 89123 \n",
"2806 циан недвижимость москва 78729 \n",
"2974 авто ру санкт петербург 74918 \n",
"3206 дром владивосток 73084 \n",
"3275 100 сп владивосток 76418 \n",
"3687 авито хабаровск 63736 \n",
"3786 метро москва 68024 \n",
"4888 перекресток акции москва каталог сегодня 48619 \n",
"5013 фк спартак москва 49699 \n",
"5434 авто ру москва 46991 \n",
"5512 схема метро москва 62915 \n",
"5642 погода псков 47395 \n",
"5671 avito санкт петербург 44214 \n",
"5700 холодильник ру интернет магазин москва 44385 \n",
"5768 спартак москва 43656 \n",
"5825 петрович санкт петербург каталог товаров 43280 \n",
"... ... ... \n",
"96233 автосуши тула 3812 \n",
"96245 вакансии псков 4085 \n",
"96422 наш город москва портал gorod mos ru 3757 \n",
"96427 хостел москва 4193 \n",
"96681 налоговая инспекция по адресу проживания москва 3737 \n",
"96746 афиша ярославль 4033 \n",
"96820 рп5 тула 3750 \n",
"96988 спартак москва футбол 5273 \n",
"97115 авиабилеты москва сочи 13146 \n",
"97199 запись на прием к врачу москва 3756 \n",
"97349 спорт и отдых мурманск 3791 \n",
"97382 комсомольская правда москва 3708 \n",
"97413 kassir ru санкт петербург 3723 \n",
"97514 расписание электричек владимир москва 8480 \n",
"97606 торпедо москва 3729 \n",
"97613 би порт мурманск 3723 \n",
"98183 москва октябрьская это какой вокзал 3696 \n",
"98272 управляющая компания жкх по адресу москва 3745 \n",
"98388 пилюли ру интернет аптека москва 3690 \n",
"98542 спортмастер ярославль каталог товаров 3734 \n",
"98679 геи москва 4205 \n",
"98726 ледовый дворец санкт петербург 3772 \n",
"98943 вакансии на авито москва от прямых работодателей 3738 \n",
"99084 122 медсанчасть санкт петербург официальный сайт 3679 \n",
"99204 bileter ru санкт петербург 3661 \n",
"99292 афиша кино москва на сегодня 3704 \n",
"99590 авито ярославль автомобили с пробегом 3673 \n",
"99808 полармед мурманск запись на прием 3719 \n",
"99890 яндекс метро москва 4548 \n",
"99922 авто ру ярославль 3712 \n",
"\n",
" region \n",
"127 Центр \n",
"370 Северо-Запад \n",
"849 Центр \n",
"1063 Дальний Восток \n",
"1236 Северо-Запад \n",
"1259 Центр \n",
"1346 Северо-Запад \n",
"1396 Центр \n",
"1482 Центр \n",
"1579 Центр \n",
"1682 Центр \n",
"2153 Северо-Запад \n",
"2175 Дальний Восток \n",
"2398 Северо-Запад \n",
"2550 Северо-Запад \n",
"2806 Центр \n",
"2974 Северо-Запад \n",
"3206 Дальний Восток \n",
"3275 Дальний Восток \n",
"3687 Дальний Восток \n",
"3786 Центр \n",
"4888 Центр \n",
"5013 Центр \n",
"5434 Центр \n",
"5512 Центр \n",
"5642 Северо-Запад \n",
"5671 Северо-Запад \n",
"5700 Центр \n",
"5768 Центр \n",
"5825 Северо-Запад \n",
"... ... \n",
"96233 Центр \n",
"96245 Северо-Запад \n",
"96422 Центр \n",
"96427 Центр \n",
"96681 Центр \n",
"96746 Центр \n",
"96820 Центр \n",
"96988 Центр \n",
"97115 Центр \n",
"97199 Центр \n",
"97349 Северо-Запад \n",
"97382 Центр \n",
"97413 Северо-Запад \n",
"97514 Центр \n",
"97606 Центр \n",
"97613 Северо-Запад \n",
"98183 Центр \n",
"98272 Центр \n",
"98388 Центр \n",
"98542 Центр \n",
"98679 Центр \n",
"98726 Северо-Запад \n",
"98943 Центр \n",
"99084 Северо-Запад \n",
"99204 Северо-Запад \n",
"99292 Центр \n",
"99590 Центр \n",
"99808 Северо-Запад \n",
"99890 Центр \n",
"99922 Центр \n",
"\n",
"[594 rows x 3 columns]"
]
},
"execution_count": 69,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df[(df['region']!='undefined')]"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.3"
}
},
"nbformat": 4,
"nbformat_minor": 2
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment