Skip to content

Instantly share code, notes, and snippets.

@dvainrub
Last active October 8, 2022 08:11
Show Gist options
  • Star 4 You must be signed in to star a gist
  • Fork 2 You must be signed in to fork a gist
  • Save dvainrub/b6178dc0e976e56abe9caa9b72f73d4a to your computer and use it in GitHub Desktop.
Save dvainrub/b6178dc0e976e56abe9caa9b72f73d4a to your computer and use it in GitHub Desktop.
from pyspark.sql import SparkSession
def init_spark():
spark = SparkSession.builder.appName("HelloWorld").getOrCreate()
sc = spark.sparkContext
return spark,sc
def main():
spark,sc = init_spark()
nums = sc.parallelize([1,2,3,4])
print(nums.map(lambda x: x*x).collect())
if __name__ == '__main__':
main()
@chaitanya94prasad
Copy link

How to run this file. I am using python 3.9 and the latest version of spark. while running it I am getting errors. Only difference is that all the spark related activities are done in another file which is imported in main.py
below are the error
Exception in thread "main" java.lang.ExceptionInInitializerError
at org.apache.spark.unsafe.array.ByteArrayMethods.(ByteArrayMethods.java:54)
at org.apache.spark.internal.config.package$.(package.scala:1095)
at org.apache.spark.internal.config.package$.(package.scala)
at org.apache.spark.deploy.SparkSubmitArguments.$anonfun$loadEnvironmentArguments$3(SparkSubmitArguments.scala:157)
at scala.Option.orElse(Option.scala:447)
at org.apache.spark.deploy.SparkSubmitArguments.loadEnvironmentArguments(SparkSubmitArguments.scala:157)
at org.apache.spark.deploy.SparkSubmitArguments.(SparkSubmitArguments.scala:115)
at org.apache.spark.deploy.SparkSubmit$$anon$2$$anon$3.(SparkSubmit.scala:1013)
at org.apache.spark.deploy.SparkSubmit$$anon$2.parseArguments(SparkSubmit.scala:1013)
at org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:85)
at org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:1030)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:1039)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
Caused by: java.lang.reflect.InaccessibleObjectException: Unable to make private java.nio.DirectByteBuffer(long,int) accessible: module java.base does not "opens java.nio" to unnamed module @4ccc0db7
at java.base/java.lang.reflect.AccessibleObject.checkCanSetAccessible(AccessibleObject.java:357)
at java.base/java.lang.reflect.AccessibleObject.checkCanSetAccessible(AccessibleObject.java:297)
at java.base/java.lang.reflect.Constructor.checkCanSetAccessible(Constructor.java:188)
at java.base/java.lang.reflect.Constructor.setAccessible(Constructor.java:181)
at org.apache.spark.unsafe.Platform.(Platform.java:56)
... 13 more
Traceback (most recent call last):
File "/Users/chprasad/Desktop/chaitanya personal/study/tutorials/python/RddTutorial/main.py", line 17, in
main()
File "/Users/chprasad/Desktop/chaitanya personal/study/tutorials/python/RddTutorial/main.py", line 13, in main
sc = RDD1.init_spark()
File "/Users/chprasad/Desktop/chaitanya personal/study/tutorials/python/RddTutorial/RDD1.py", line 15, in init_spark
sc = SparkContext(conf=con)
File "/Users/chprasad/Desktop/chaitanya personal/study/tutorials/python/RddTutorial/venv/lib/python3.9/site-packages/pyspark/context.py", line 144, in init
SparkContext._ensure_initialized(self, gateway=gateway, conf=conf)
File "/Users/chprasad/Desktop/chaitanya personal/study/tutorials/python/RddTutorial/venv/lib/python3.9/site-packages/pyspark/context.py", line 331, in _ensure_initialized
SparkContext._gateway = gateway or launch_gateway(conf)
File "/Users/chprasad/Desktop/chaitanya personal/study/tutorials/python/RddTutorial/venv/lib/python3.9/site-packages/pyspark/java_gateway.py", line 108, in launch_gateway
raise Exception("Java gateway process exited before sending its port number")
Exception: Java gateway process exited before sending its port number

@EMCP
Copy link

EMCP commented Apr 23, 2022

same issue for me as @chaitanya94prasad

@VijaybabuNakkonda
Copy link

I faced the same issue. But with Python3 the code is working fine. Not sure how to manage. Please let me know if you found a solution

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment