Tested with Apache Spark 1.3.1, Python 2.7.9 and Java 1.8.0_45 + workaround for Spark 1.4.x from @enahwe.
Download and install it from oracle.com
Tested with Apache Spark 1.3.1, Python 2.7.9 and Java 1.8.0_45 + workaround for Spark 1.4.x from @enahwe.
Download and install it from oracle.com
#List unique values in a DataFrame column | |
pd.unique(df.column_name.ravel()) | |
#Convert Series datatype to numeric, getting rid of any non-numeric values | |
df['col'] = df['col'].astype(str).convert_objects(convert_numeric=True) | |
#Grab DataFrame rows where column has certain values | |
valuelist = ['value1', 'value2', 'value3'] | |
df = df[df.column.isin(value_list)] |
I hereby claim:
To claim this, I am signing this object:
$ sudo npm install -g hexo-cli
$ hexo -v
hexo-cli: 0.1.9
# prereqs | |
sudo apt-get -y update | |
sudo apt-get -y upgrade | |
sudo apt-get -y install build-essential | |
sudo apt-get -y install sudo wget vim screen git software-properties-common | |
# nvidia cuda drivers | |
cd /media | |
sudo wget http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1404/x86_64/cuda-repo-ubuntu1404_8.0.44-1_amd64.deb | |
sudo dpkg -i cuda-repo-ubuntu1404_8.0.44-1_amd64.deb |