Where do I start with distributed computing?

Maybe you can read some papers related to MapReduce and distributed computing first, to gain a better understanding of it. Here are some I would like to recommand: MapReduce: Simplified Data Processing on Large Clusters usenix.org/events/osdi04/tech/full_paper... Bigtable: A Distributed Storage System for Structured Data usenix.org/events/osdi06/tech/chang/chan... Dryad: Distributed data-parallel programs from sequential building blocks pdos.csail.mit.edu/6.824-2007/papers/isa... The landscape of parallel computing research: A view from berkeley citeseerx.ist.psu.edu/viewdoc/download?d... On the other hand, if you want to know better of Hadoop, maybe you can start reading Hadoop MapReduce framework source code.

Maybe you can read some papers related to MapReduce and distributed computing first, to gain a better understanding of it. Here are some I would like to recommand: MapReduce: Simplified Data Processing on Large Clusters, usenix.org/events/osdi04/tech/full_paper... Bigtable: A Distributed Storage System for Structured Data, usenix.org/events/osdi06/tech/chang/chan... Dryad: Distributed data-parallel programs from sequential building blocks, pdos.csail.mit.edu/6.824-2007/papers/isa... The landscape of parallel computing research: A view from berkeley, citeseerx.ist.psu.edu/viewdoc/download?d... On the other hand, if you want to know better of Hadoop, maybe you can start reading Hadoop MapReduce framework source code.

Currently, bookwise I would check out - Hadoop A Definitive Guide. Its written by Tom White who has worked on Hadoop for a good while now, and works at Cloudera with Doug Cutting (Hadoop creator). Also on the free side, Jimmy Lin from UMD has written a book called: Data-Intensive Text Processing with MapReduce.

Here's a link to the final pre-production verison (link provided by the author on his website).

Here are some resources from Yahoo! Developer Network a tutorial: developer.yahoo.com/hadoop/tutorial/ an introductory course (requires Siverlight, sigh): yahoo.hosted.panopto.com/CourseCast/View....

The All Things Hadoop Podcast allthingshadoop.com/podcast has some good content and good guests. A lot of it is geared to getting started with Distributed Computing.

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Related Questions