Skip to content

Instantly share code, notes, and snippets.

@matheus-rossi
Created February 21, 2024 18:26
Show Gist options
  • Save matheus-rossi/b70b6d55c206a76c847ffcb414e592a7 to your computer and use it in GitHub Desktop.
Save matheus-rossi/b70b6d55c206a76c847ffcb414e592a7 to your computer and use it in GitHub Desktop.
spark_tips_01
from pyspark.sql import SparkSession
spark = (
SparkSession
.builder
.appName("spark_parameterized_queries")
.getOrCreate()
)
##### Criando dois datasets de teste #####
dataset1 = [("Alice", 25), ("Bob", 30), ("Charlie", 35)]
df1 = spark.createDataFrame(dataset1, ["Name", "Age"])
##### createOrReplaceTempView #####
df1.createOrReplaceTempView("people")
spark.sql("SELECT Name, Age FROM people WHERE Age > 30").show()
##### Parametrized Queries #####
spark.sql(" SELECT Name, Age FROM {dataset} WHERE Age > {age}",
dataset=df1,
age=30
).show()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment