Vijay Anand Pandian vijayanandrp

## system_design_interview_notes_with_python.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              2 stars
            
          
                vijayanandrp
                / system_design_interview_notes_with_python.md
            
            
              Created
              June 17, 2023 14:16
            
              
                System Design - All in One Interview - Reading notes with python examples
              
          
    Complete System Design Series

With examples and intelligible explanations…


Pic credits : Github
Welcome back peeps. We are now starting System Design Series ( over weekends) where we will cover how to design large ( and great) systems, the techniques, tip/tricks that you can refer to in order to scale these systems. As a senior software engineer it’s expected that you know not just the breadth but also depth of the system design concepts.

  
## kconnect.py
# credits source : https://gist.github.com/rueedlinger/76af36d04a0798a8e1f43ed16595bd97

import sys
import os
import json
import argparse
from base64 import b64encode

PYTHON_MAJOR_VERSION = sys.version_info.major
DEFAULT_HOST = 'localhost'

## pyspark_explode_null.py
from pyspark.sql.functions import *

def flatten_df(nested_df):
    flat_cols = [c[0] for c in nested_df.dtypes if c[1][:6] != 'struct']
    nested_cols = [c[0] for c in nested_df.dtypes if c[1][:6] == 'struct']
    flat_df = nested_df.select(flat_cols +
                               [col(nc + '.' + c).alias(nc + '_' + c)
                                for nc in nested_cols
                                for c in nested_df.select(nc + '.*').columns])
    print("flatten_df_count :", flat_df.count())

## mongodb_quest.py
# *-* coding: utf-8 *-*

import requests

try:
    from pymongo import MongoClient
except ImportError:
    raise ImportError('PyMongo is not installed')

try:

## Bigquery_util.py
#!/usr/bin/env python

# Copyright 2016 Google Inc. All Rights Reserved.
import os
import sys
import time
os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = 'BigQuery.json'

from google.cloud import bigquery
from google.cloud.bigquery.job import DestinationFormat, ExtractJobConfig, Compression

## bigquery_poc.py
from google.cloud import bigquery

import os

os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = 'Google Analytics POC.json'

client = bigquery.Client()

query_job = client.query("""
    SELECT

## lib_config_manager.py
#!/usr/bin/env python3.5
# encoding: utf-8

import configparser

config = configparser.ConfigParser()

# I believe this config parser should use the perl autovivification method to create dynamic objects
config['DEFAULT'] = {
                        'Name': 'Vijay Anand',

## example.ini
[DEFAULT]
married = False
sex = M
name = Vijay Anand
age = 26
nationality = Indian

[www.facebook.com]
user_name = VjyAnnd

## Yelp_Predictions.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                vijayanandrp
                / Yelp_Predictions.md
            
            
              Created
              December 19, 2017 18:56
            
              
                https://informationcorners.com/yelp-reviews-star-rating-prediction/
              
          
    Tutorial Exercise: Yelp reviews (Solution)

Introduction

This exercise uses a small subset of the data from Kaggle's Yelp Business Rating Prediction competition.
Description of the data:

yelp.csv contains the dataset. It is stored in the repository (in the data directory), so there is no need to download anything from the Kaggle website.


## spam_predict_1.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                vijayanandrp
                / spam_predict_1.md
            
            
              Last active
              December 15, 2017 17:38
            
              
                https://informationcorners.com/text-sms-spam-classifier/
              
          
    Text SMS - Spam Classification Model

The base requirement of this project is to analyse the SMS dataset and come up with a machine learning models to predict or claissify the sms text. For getting my latest code and datasets please do visit my github.com account.
The following are the list of actions that we gonna do to solve this problem approach


Reading a text-based dataset into pandas
Vectorizing our dataset
Building and evaluating a model
	# credits source : https://gist.github.com/rueedlinger/76af36d04a0798a8e1f43ed16595bd97

	import sys
	import os
	import json
	import argparse
	from base64 import b64encode

	PYTHON_MAJOR_VERSION = sys.version_info.major
	DEFAULT_HOST = 'localhost'
	from pyspark.sql.functions import *

	def flatten_df(nested_df):
	flat_cols = [c[0] for c in nested_df.dtypes if c[1][:6] != 'struct']
	nested_cols = [c[0] for c in nested_df.dtypes if c[1][:6] == 'struct']
	flat_df = nested_df.select(flat_cols +
	[col(nc + '.' + c).alias(nc + '_' + c)
	for nc in nested_cols
	for c in nested_df.select(nc + '.*').columns])
	print("flatten_df_count :", flat_df.count())
	# - coding: utf-8 -

	import requests

	try:
	from pymongo import MongoClient
	except ImportError:
	raise ImportError('PyMongo is not installed')

	try:
	#!/usr/bin/env python

	# Copyright 2016 Google Inc. All Rights Reserved.
	import os
	import sys
	import time
	os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = 'BigQuery.json'

	from google.cloud import bigquery
	from google.cloud.bigquery.job import DestinationFormat, ExtractJobConfig, Compression
	#!/usr/bin/env python3.5
	# encoding: utf-8

	import configparser

	config = configparser.ConfigParser()

	# I believe this config parser should use the perl autovivification method to create dynamic objects
	config['DEFAULT'] = {
	'Name': 'Vijay Anand',
	[DEFAULT]
	married = False
	sex = M
	name = Vijay Anand
	age = 26
	nationality = Indian

	[www.facebook.com]
	user_name = VjyAnnd