Skip to content

Instantly share code, notes, and snippets.

@1ambda
Created December 23, 2021 23:09
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save 1ambda/25fc2e7bf7deedaa1a0aaeef0cf7e13c to your computer and use it in GitHub Desktop.
Save 1ambda/25fc2e7bf7deedaa1a0aaeef0cf7e13c to your computer and use it in GitHub Desktop.
from pyspark.sql.types import *
from pyspark.sql.functions import *
from pyspark.sql.window import Window
dfListing = spark.read.load("./airbnb_listings.csv",
format="csv", inferSchema=True, header=True,
quote='"', escape='"', sep=',', multiline=True)
dfListing.printSchema()
dfListing.count()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment