Skip to content

Instantly share code, notes, and snippets.

@icaoberg
Last active August 29, 2015 14:15
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save icaoberg/1edb8054129a8652fc9c to your computer and use it in GitHub Desktop.
Save icaoberg/1edb8054129a8652fc9c to your computer and use it in GitHub Desktop.
Helper script that downloads the 2D HeLa dataset from the Murphy Lab at Carnegie Mellon University
#!/bin/bash
echo "*********************"
echo "* LICENSE AGREEMENT *"
echo "*********************"
echo "All data downloaded from the Murphy Lab web site (http://murphylab.web.cmu.edu) remain the property of the
Murphy"
echo "Lab. You are granted a non-exclusive license to use these images for non-commercial, research purposes, with the"
echo "following conditions: (a) you agree to include a reference to"
echo ""
echo "R. F. Murphy, M. V. Boland and M. Velliste (2000). Towards a Systematics for Protein Subcellular Location: Quantitative Description of Protein Localization Patterns and Automated Analysis of Fluorescence Microscope Images. Proc Int Conf Intell Syst Mol Biol (ISMB 2000) 8: 251-259."
echo "M. V. Boland and R. F. Murphy (2001). A Neural Network Classifier Capable of Recognizing the Patterns of all Major Subcellular Structures in Fluorescence Microscope Images of HeLa Cells. Bioinformatics 17:1213-1223."
echo "E.J.S. Roques and R.F. Murphy (2002). Objective evaluation of differences in protein subcellular distribution. Traffic 3: 61-65."
echo ""
echo "when presenting or publishing your results, (b) you agree to send a copy of any paper or abstract (upon its"
echo "acceptance for publication or presentation) by FAX, email or regular mail to Dr. Murphy, and (c) you agree not to "
echo "distribute these images without inclusion of this notice."
echo "Dr. Robert F. Murphy"
echo "Professor of Biological Sciences, Biomedical Engineering and Machine Learning"
echo "Carnegie Mellon University"
echo "4400 Fifth Ave., Pittsburgh, PA 15213"
echo "FAX: 412-268-6571"
echo "email: murphy@cmu.edu"
echo ""
WEBSITE="http://murphylab.web.cmu.edu/data"
URLS=( "HeLa10Class2DImages_16bit_tiff.tgz"
"HeLa10Class2DImages_16bit_scaled.tgz"
"HeLa10Class2DImages_16bit_unscaled.tgz"
"2Dhelaimage.txt"
"HeLa10Class2DImages_16bit_dna_protein_png.tgz"
"HeLa10Class2DFeatures180.mat"
"HeLa10Class2DFeatures19990526.csv"
"HeLa10Class2DFeatures19990526.tab")
for URL in "${URLS[@]}"
do
echo Downloading "$WEBSITE"/"$URL" to `pwd`
wget --quiet -nc "$WEBSITE"/"$URL"
done
echo ""
for FILE in *.tgz
do
if [ -f "$FILE" ]; then
echo "Decompressing" "$FILE"
tar -xf $FILE
echo "Removing file" "$FILE"
rm -fv $FILE
fi
done
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment