Skip to content

Instantly share code, notes, and snippets.

#!/bin/bash
rm *.jpg
rm *.png
rm *.gif
rm *.mp4
find ../ -mindepth 1 -maxdepth 1 -mtime -7 -name "*.jpg" -exec cp -t . {} +
export MAGICK_THREAD_LIMIT=4
@selbyk
selbyk / movie.sh
Last active December 5, 2015 20:02
#!/bin/bash
# Set to number of cores your computer has or you want to use
export MAGICK_THREAD_LIMIT=8
# Deleted old files
rm *.jpg
rm *.png
rm *.gif
rm *.mp4
float catalan_nums_rec(int numTerms){
return catalan_nums_aux(numTerms, 1, 1.0);
}
float catalan_nums_aux(int numTerms, int n, float lastC){
if( n < numTerms )
return catalan_nums_aux(numTerms, n+1, 2*(2*n+1)*lastC/(n+2)));
else
return 2*(2*n+1)*lastC/(n+2);
}
var chart = new Highcharts.Chart({
'chart': {
'renderTo': 'yw0'
},
'exporting': {
'enabled': true
},
'title': {
'text': 'Past Hour'
},
@selbyk
selbyk / threaded_vectors.clj
Created April 18, 2013 20:38
10 threads manipulating one shared data structure, which consists of 100 vectors each one containing 10 (initially sequential) unique numbers. Each thread then repeatedly selects two random positions in two random vectors and swaps them. All changes to the vectors occur in transactions by making use of Clojure's software transactional memory sys…
(defn run [nvecs nitems nthreads niters]
(let [vec-refs (vec (map (comp ref vec)
(partition nitems (range (* nvecs nitems)))))
swap #(let [v1 (rand-int nvecs)
v2 (rand-int nvecs)
i1 (rand-int nitems)
i2 (rand-int nitems)]
(dosync
(let [temp (nth @(vec-refs v1) i1)]
(alter (vec-refs v1) assoc i1 (nth @(vec-refs v2) i2))
@selbyk
selbyk / php.ini
Created April 24, 2013 00:31
PHP5 Apache php.ini
[PHP]
;;;;;;;;;;;;;;;;;;;
; About php.ini ;
;;;;;;;;;;;;;;;;;;;
; PHP's initialization file, generally called php.ini, is responsible for
; configuring many of the aspects of PHP's behavior.
; PHP attempts to find and load this configuration from a number of locations.
; The following is a summary of its search order:
#include <vector>
#include <fstream>
#include <iostream>
#include <ctype.h>
#include "fann.h"
#include "fann_cpp.h"
using namespace std;
void error(const char* p, const char* p2 = ""){
Scraper/Content Extraction Training
Goal: Fetch relevant information sources, extract only appropriate content, save as documents as training data and usable by Watson
Method:
Fetch a few pages from various data sources using Phantom.js, then parse and save the website’s HTML as JSON
Iterate the text elements and extract features such as size, position, text, CSS properties, etc
Run the DBSCAN clustering algorithm over the document’s extracted feature data. Similar elements such as titles, headers, and article content should be grouped into the same clusters
Manually tag a portion of the documents to use as training data
A support vector machine (SVM) with linear kernel using a 4-fold cross validation should be capable of detecting the main content of a scraped page
@selbyk
selbyk / find_running_process.sh
Created March 17, 2016 16:13
accepts a command line argument for the process name, returns an exit code of 1 if the process is currently running
#/bin/bash
# Usage: ./find_running_process.sh <process_name>
# DEBUG=1 ./find_running_process.sh <process_name>
# Function to help with debug messages
debug_message () {
if [ $DEBUG -eq 1 ]
then
echo $1
fi
@selbyk
selbyk / process_pids.sh
Created March 17, 2016 16:34
iterate through all running Linux processes and just print the process ID
#!/bin/bash
# Usage: ./process_pids.sh
for proc in /proc/*
do
FILENAME=${proc##*/}
if [[ $FILENAME =~ ^-?[0-9]+ ]]
then
echo $FILENAME
fi