Tanveer Ahmad tahashmi

## ILP.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              2 stars
            
          
                linuxster
                / ILP.ipynb
            
            
              Last active
              August 9, 2023 16:43
            
              
                ILP example in python
              
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## echo.py
import socket
import sys
import time
import struct

host = 'localhost'
port = 8888
buffersize = 1024
N = 1000000
server_address = (host, port)

## how-to-install-latest-gcc-on-ubuntu-lts.txt
These commands are based on a askubuntu answer http://askubuntu.com/a/581497
To install gcc-6 (gcc-6.1.1), I had to do more stuff as shown below.
USE THOSE COMMANDS AT YOUR OWN RISK. I SHALL NOT BE RESPONSIBLE FOR ANYTHING.
ABSOLUTELY NO WARRANTY.

If you are still reading let's carry on with the code.

sudo apt-get update && \
sudo apt-get install build-essential software-properties-common -y && \
sudo add-apt-repository ppa:ubuntu-toolchain-r/test -y && \

## parquet-benchmark-20170210.py
import gc
import os
import time

import numpy as np
import pandas as pd
from pyarrow.compat import guid
import pyarrow as pa
import pyarrow.parquet as pq
import snappy

## dask-xgboost-airlines.ipynb

      
              1 file
            
          
              10 forks
            
          
              4 comments
            
          
              10 stars
            
          
                mrocklin
                / dask-xgboost-airlines.ipynb
            
            
              Created
              February 21, 2017 00:34
            
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## PySpark DataFrame from many small pandas DataFrames.ipynb

      
              2 files
            
          
              4 forks
            
          
              9 comments
            
          
              4 stars
            
          
                linar-jether
                / PySpark DataFrame from many small pandas DataFrames.ipynb
            
            
              Created
              July 8, 2018 10:15
            
              
                Convert a RDD of pandas DataFrames to a single Spark DataFrame using Arrow and without collecting all data in the driver.
              
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## cuda_installation_on_ubuntu_18.04
#!/bin/bash
## This gist contains step by step instructions to install cuda v9.0 and cudnn 7.3 in ubuntu 18.04

### steps ####
# verify the system has a cuda-capable gpu
# download and install the nvidia cuda toolkit and cudnn
# setup environmental variables
# verify the installation
###

## arrow-build.sh
#!/usr/bin/env bash

set -eu

PWD="$( cd "$( dirname "${BASH_SOURCE[0]}" )" >/dev/null 2>&1 && pwd )"
SRC_DIR=$(realpath "${PWD}/..")
CXX_SRC=${SRC_DIR}/cpp

# The following can be set
: "${CMAKE:=cmake}"

## run_spark_cluster.sh
#!/bin/bash
#SBATCH --job-name spark-cluster
#SBATCH --account=qh82
#SBATCH --time=02:00:00
# --- Master resources ---
#SBATCH --nodes=1
#SBATCH --mem-per-cpu=1G
#SBATCH --cpus-per-task=1
#SBATCH --ntasks-per-node=1
# --- Worker resources ---

## cuml-kmeans-mnmg-api.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              3 stars
            
          
                cjnolet
                / cuml-kmeans-mnmg-api.md
            
            
              Last active
              August 17, 2022 05:35
            
              
                Simple example of cuML's K-Means Single-GPU (SG) and Multi-Node Multi-GPU (MNMG) APIs compared to Scikit-learn and Dask-ML
              
          
    Comparing cuML K-Means API Against Scikit-learn & Dask-ML

First, a quick code example of K-Means in Scikit-learn
from sklearn.datasets import make_blobs
from sklearn.cluster import KMeans

n_centers = 5

X, _ = make_blobs(n_samples=10000, n_centers=n_centers)
	import socket
	import sys
	import time
	import struct

	host = 'localhost'
	port = 8888
	buffersize = 1024
	N = 1000000
	server_address = (host, port)
	These commands are based on a askubuntu answer http://askubuntu.com/a/581497
	To install gcc-6 (gcc-6.1.1), I had to do more stuff as shown below.
	USE THOSE COMMANDS AT YOUR OWN RISK. I SHALL NOT BE RESPONSIBLE FOR ANYTHING.
	ABSOLUTELY NO WARRANTY.

	If you are still reading let's carry on with the code.

	sudo apt-get update && \
	sudo apt-get install build-essential software-properties-common -y && \
	sudo add-apt-repository ppa:ubuntu-toolchain-r/test -y && \
	import gc
	import os
	import time

	import numpy as np
	import pandas as pd
	from pyarrow.compat import guid
	import pyarrow as pa
	import pyarrow.parquet as pq
	import snappy
	#!/bin/bash
	## This gist contains step by step instructions to install cuda v9.0 and cudnn 7.3 in ubuntu 18.04

	### steps ####
	# verify the system has a cuda-capable gpu
	# download and install the nvidia cuda toolkit and cudnn
	# setup environmental variables
	# verify the installation
	###
	#!/usr/bin/env bash

	set -eu

	PWD="$( cd "$( dirname "${BASH_SOURCE[0]}" )" >/dev/null 2>&1 && pwd )"
	SRC_DIR=$(realpath "${PWD}/..")
	CXX_SRC=${SRC_DIR}/cpp

	# The following can be set
	: "${CMAKE:=cmake}"
	#!/bin/bash
	#SBATCH --job-name spark-cluster
	#SBATCH --account=qh82
	#SBATCH --time=02:00:00
	# --- Master resources ---
	#SBATCH --nodes=1
	#SBATCH --mem-per-cpu=1G
	#SBATCH --cpus-per-task=1
	#SBATCH --ntasks-per-node=1
	# --- Worker resources ---