production
https://apis.pasarpolis.com| # ## Agenda | |
| # | |
| # 1. Reading in the Kaggle data and adding features | |
| # 2. Using a **`Pipeline`** for proper cross-validation | |
| # 3. Combining **`GridSearchCV`** with **`Pipeline`** | |
| # 4. Efficiently searching for tuning parameters using **`RandomizedSearchCV`** | |
| # 5. Adding features to a document-term matrix (using SciPy) | |
| # 6. Adding features to a document-term matrix (using **`FeatureUnion`**) | |
| # 7. Ensembling models | |
| # 8. Locating groups of similar cuisines | 
| import csv, json, os, os.path, re, string, urllib2 | |
| from bs4 import BeautifulSoup | |
| # config and params | |
| protocol = 'https://www.' | |
| domain = 'instagram.com/' | |
| username = 'selenagomez' | |
| site = protocol + domain + username | |
| # tag identifier of which content will be graped |