Skip to content

Instantly share code, notes, and snippets.

Bulat Yaminov byaminov

  • Utrecht
View GitHub Profile
byaminov /
Created Apr 11, 2019
Running Spark benchmarks to compare its string operations with Vaex and Pandas
Benchark ran on my laptop:
spark-submit --master local[*] benchmarks/
To run it:
* Download and install Spark 2.4.0 (
* Run the Vaex & Pandas benchmark (,
the test.parquet file will be created
* Set `args_n` constant in this script to the same value you used for `n` variable,
e.g. `python -n8`.
You can’t perform that action at this time.