Emaad Manzoor emaadmanzoor

## amsterdam-london-sp.cypher
MATCH (source:Place {id: "Amsterdam"}), (destination:Place {id: "London"})
CALL gds.shortestPath.dijkstra.stream({
  nodeProjection: 'Place',
  relationshipProjection: {
    ROAD: {
      type: 'EROAD',
      properties: 'distance',
      orientation: 'UNDIRECTED'
    }
  },

## 02-kafka.service
[Unit]
Requires=zookeeper.service
After=zookeeper.service

[Service]
Type=simple
User=gb760
ExecStart=/bin/sh -c '/home/gb760/kafka/bin/kafka-server-start.sh /home/gb760/kafka/config/server.properties > /home/gb760/kafka/kafka.log 2>&1'
ExecStop=/home/gb760/kafka/bin/kafka-server-stop.sh
Restart=on-abnormal

## Word embeddings via PMI-matrix factorization.ipynb

      
              1 file
            
          
              3 forks
            
          
              0 comments
            
          
              8 stars
            
          
                emaadmanzoor
                / Word embeddings via PMI-matrix factorization.ipynb
            
            
              Created
              January 26, 2018 16:24
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## 95-865-Model_Evaluation_Demo.md

      
              2 files
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                emaadmanzoor
                / 95-865-Model_Evaluation_Demo.md
            
            
              Last active
              February 16, 2018 16:16
            
              
                95865 Model Evaluation Demo
              
          
    95865 Recitation - Model Evaluation Demo


Lecture slides: https://speakerdeck.com/emaadmanzoor/introduction-to-classification
Course website: http://www.andrew.cmu.edu/user/georgech/95-865/
Other recitations: http://www.eyeshalfclosed.com/teaching/

Contact:

Emaad Ahmed Manzoor (emaad[at]cmu.edu)
George Chen (georgechen[at]cmu.edu)


## get_spark_streaming_batch_statistics.py
#!/usr/bin/env python

# Copyright 2016 Emaad Ahmed Manzoor
# License: Apache License, Version 2.0
# http://www.eyeshalfclosed.com/blog/2016/07/22/spark-streaming-statistics/

"""
Get Spark Streaming microbatch statistics:
    - Batch start time
    - Scheduling delay (in seconds) for each microbatch

## 00-CDM_StreamSpot_Translation.md

      
              5 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                emaadmanzoor
                / 00-CDM_StreamSpot_Translation.md
            
            
              Last active
              July 4, 2016 06:25
            
              
                CDM to StreamSpot Format Translation
              
          
    Contents


README

StreamSpot format
CDM format

CDM events
CDM subjects, objects and principals
CDM simple edges


Combining CDM records into a StreamSpot edge


[Assigning a graph ID to each StreamSpot edge](https://gist.github.com/emaadmanzoor/b23d46c9d4d064746e1ef667cb8070bc#assi


## 00-StreamSpot-Bootstrap-Clusters.md

      
              4 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                emaadmanzoor
                / 00-StreamSpot-Bootstrap-Clusters.md
            
            
              Last active
              February 18, 2016 01:18
            
              
                StreamSpot Bootstrap Clusters
              
          
    StreamSpot Bootstrap Clusters

www3.cs.stonybrook.edu/~emanzoor/streamspot/
Below are the bootstrap clusters used for the experiments in the StreamSpot paper for each of following datasets:

all (01-C50_k10_all.txt): Chunk length of 50, 10 clusters.
ydc (02-C25_k5_ydc.txt): Chunk length of 25, 5 clusters.
gfc (03-C50_k5_gfc.txt): Chunk length of 50, 5 clusters.


## QuantifyingMonotonyAversion.md

      
              2 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                emaadmanzoor
                / QuantifyingMonotonyAversion.md
            
            
              Last active
              August 29, 2015 14:18
            
              
                Quantifying Monotony Aversion
              
          
    See the project website for more details.
Please report any issues to emaadahmed.manzoor@kaust.edu.sa.
Execution

Running this requires having the following files in the same directory as calculate_cluster_statistics.py:

all_links.p
all_tweets.p


## AttentionPotentialValidation.md

      
              2 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                emaadmanzoor
                / AttentionPotentialValidation.md
            
            
              Last active
              August 29, 2015 14:18
            
              
                Attention Potential Validation Code
              
          
    See the project website for more details.
Please report any issues to emaadahmed.manzoor@kaust.edu.sa.
Correlation Results

The attention potential (as estimated in section 4), when evaluated on this Twitter dataset:

Is 73.61% correlated with the retweets obtained.
Is significantly correlated (p < 0.05).


## freivald.py
import random
import operator

t = int(raw_input())
randint = random.randint

def deterministic(a,b,c,n):
    no = 0
    for p in xrange(n):
        for q in xrange(n):
	MATCH (source:Place {id: "Amsterdam"}), (destination:Place {id: "London"})
	CALL gds.shortestPath.dijkstra.stream({
	nodeProjection: 'Place',
	relationshipProjection: {
	ROAD: {
	type: 'EROAD',
	properties: 'distance',
	orientation: 'UNDIRECTED'
	}
	},
	[Unit]
	Requires=zookeeper.service
	After=zookeeper.service

	[Service]
	Type=simple
	User=gb760
	ExecStart=/bin/sh -c '/home/gb760/kafka/bin/kafka-server-start.sh /home/gb760/kafka/config/server.properties > /home/gb760/kafka/kafka.log 2>&1'
	ExecStop=/home/gb760/kafka/bin/kafka-server-stop.sh
	Restart=on-abnormal
	#!/usr/bin/env python

	# Copyright 2016 Emaad Ahmed Manzoor
	# License: Apache License, Version 2.0
	# http://www.eyeshalfclosed.com/blog/2016/07/22/spark-streaming-statistics/

	"""
	Get Spark Streaming microbatch statistics:
	- Batch start time
	- Scheduling delay (in seconds) for each microbatch
	import random
	import operator

	t = int(raw_input())
	randint = random.randint

	def deterministic(a,b,c,n):
	no = 0
	for p in xrange(n):
	for q in xrange(n):