Skip to content

Instantly share code, notes, and snippets.

@wenming
Created June 16, 2012 14:41
Show Gist options
  • Save wenming/2941512 to your computer and use it in GitHub Desktop.
Save wenming/2941512 to your computer and use it in GitHub Desktop.
Resources for Hadoop on Windows Azure
Hadooponazure.com is strictly a private CTP for microsoft's hadoop distro. It supports HIVE, PIG, a javascript console, a web portal. You can also terminal service into the actual clusters as needed. There's a lot of tutorials in the training kit, there's a deck and there's a bunch of tutorials.
You should also be able to find content on windowsazure.com
http://www.windowsazure.com/en-us/develop/net/scenarios/big-data/
http://www.windowsazure.com/en-us/develop/net/how-to-guides/hadoop/
I recommend going through at least one of these tutorials:
http://www.windowsazure.com/en-us/develop/net/tutorials/hadoop-marketplace/
and perhaps look at this deck in addition to the one included in the training kit. the http://view.officeapps.live.com/op/view.aspx?src=http%3a%2f%2fvideo.ch9.ms%2fteched%2f2012%2fna%2fAZR325.pptx
If you want to learn more about big data, these are great resources below.
Tutorials: https://www.windowsazure.com/en-us/develop/net/how-to-guides/hadoop/
Blog: http://blogs.msdn.com/hpctrekker/
Twitter: @wenmingye
Books:
Hadoop The Definitive Guide *
Mining the Social Web
Collective Intelligence *
Mahout In Action
Data Source Handbook Pete Warden
Big Data Now: Current Perspectives from O'Reilly Radar *
Million song database data cached here: AbsoluteUri http://wrfstorage2.blob.core.windows.net/dataset/train_triplets.txt.zip
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment