Skip to content

Instantly share code, notes, and snippets.

@ravy101
Created April 19, 2019 01:55
Show Gist options
  • Save ravy101/8f741e45fe4d6446a7c8b3e377cf1f1e to your computer and use it in GitHub Desktop.
Save ravy101/8f741e45fe4d6446a7c8b3e377cf1f1e to your computer and use it in GitHub Desktop.
Created on Cognitive Class Labs
Display the source blob
Display the rendered blob
Raw
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Data\n",
"Our primary source of data for this project will be the Foursquare API, this will provide us with a representative (but not comprehensive) list of restaurants in Sydney. We are mostly interested in the category (restaurant type) and location of each venue, and the subset of venues will be those belonging to the 'food' category. This data will be pulled from the API in JSON format then converted to a dataframe. Due to the large number of venues we want to consider and the limitations of the API, results may be temporarily stored as a CSV in our workspace. We will request the venues at each postcode, then mark that postcode as 'updated' and save both the venues and the updated postcodes as a CSV. This way we can build a more complete set of venues over time without duplicates.\n",
"\n",
"https://api.foursquare.com/v2/venues/\n",
"\n",
"We will need a directory of postcodes for the greater Sydney area that include longitude and latitude. We will use a list of all Australian postcodes as a CSV and filter by state and then by coordinates.\n",
"\n",
"http://www.corra.com.au/downloads/Australian_Post_Codes_Lat_Lon.zip\n",
"\n",
"We will also consider data publicly available from the Australian Tax Office (ATO) via data.gov.au as a CSV file to get an indication of disposable income and potential customers in each area. The ATO publishes median incomes and number of tax payers by postcode, which will will subset to the Sydney area and join to our postcode data.\n",
"\n",
"https://data.gov.au/data/dataset/taxation-statistics-2016-17/resource/d2804aa6-4c32-453c-9d60-d957e9028952\n",
"\n",
"Supplimental data will be used from the Australian Beaurau of Statistics including counts of businesses registered by statistical area. This information will be mapped to postcode using a conversion table and then used to identify commercial areas vs residential areas. \n",
"\n",
"http://www.abs.gov.au/AUSSTATS/abs@.nsf/DetailsPage/8165.0June%202014%20to%20June%202018?OpenDocument\n",
"\n"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.6.8"
}
},
"nbformat": 4,
"nbformat_minor": 2
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment