Skip to content

Instantly share code, notes, and snippets.

View ramleethal's full-sized avatar
🎯
Focusing

sergey ramleethal

🎯
Focusing
View GitHub Profile
from pyspark.sql import SparkSession
import argparse
import os
import random
import string
def generate_random_data(size_mb):
chars_per_mb = 1024 * 1024
return ''.join(random.choice(string.ascii_letters) for _ in range(int(size_mb * chars_per_mb)))