From StackOverflow
http://stackoverflow.com/questions/6385888/what-are-some-good-resources-for-studying-hadoops-source-code
Studying Hadoop or MapReduce can be a daunting task if you get your hand dirty at the start.
I followed the schedule as follows :
- Start with very basics of MR with code.google.com/edu/parallel/dsd-tutorial.html code.google.com/edu/parallel/mapreduce-tutorial.html
- Then go for the first two lectures in www.cs.washington.edu/education/courses/cse490h/08au/lectures.htm A very good course intro to MapReduce and Hadoop.
- Read the seminal paper labs.google.com/papers/mapreduce.html and its improvements in the updated version http://www.cs.washington.edu/education/courses/cse490h/08au/readings/communications200801-dl.pdf
- Then go for all the other videos in the U.Washington link given above.
- Try youtubing the terms Map reduce and hadoop to find videos by ORielly and Google RoundTable for good overview of the future of Hadoop and MapReduce
- Then off to the most important videos - Cloudera Videos www.cloudera.com/resources/?media=Video and Google MiniLecture Series code.google.com/edu/submissions/mapreduce-minilecture/listing.html
Along with all the Multimedia above we need good written material
Documents:
- Architecture diagrams at hadooper.blogspot.com are good to have on your wall
- Hadoop: The definitive guide goes more into the nuts and bolts of the whole system where as Hadoop in Action is a good read with lots of teaching examples to learn the concepts of hadoop. Pro Hadoop is not for beginners
- pdfs of the documentation from Apache Foundation hadoop.apache.org/common/docs/current/ and hadoop.apache.org/common/docs/stable/ will help you learn as to how model your problem into a MR solution in order to gain the advantages of Hadoop in total.
- HDFS paper by Yahoo! Research is also a good read in order to gain in depth knowledge of hadoop
- Subscribe to the User Mailing List of Commons, MapReduce and HDFS in order to know problems, solutions and future solutions.
- Try the http://developer.yahoo.com/hadoop/tutorial/module1.html link for beginners to expert path to Hadoop
- For Any Queries ...Contact Apache, Google, Bing, Yahoo!
link|improve this answer
answered Jul 26 '11 at 7:24
Varad Meru
361
分享到:
相关推荐
If you are a developer with some experience with Spark and want to strengthen your knowledge of how to get around in the world of Spark, then this book is ideal for you. Basic knowledge of Linux, ...
This study focus on how to use the open source Hadoop cloud computing systems to build a cloud platform, and how to use MapReduce model to achieve the improved SVM classification algorithm on the ...
Data analysis is the process of applying logical and analytical reasoning to study each component of data present in the system. Python is a multi-domain, high-level, programming language that offers ...
You will learn to use Spark as a big data operating system, understand how to implement advanced analytics on the new APIs, and explore how easy it is to use Spark in day-to-day tasks. Style and ...
You learn the key to successful database implementation as you study how to properly design databases to fit within the larger strategic data environment. Clear, straightforward writing supports an ...
If you are a developer with some experience with Spark and want to strengthen your knowledge of how to get around in the world of Spark, then this book is ideal for you. Basic knowledge of Linux, ...
- **Watermarking:** Watermarking helps handle late-arriving events by defining a threshold for how far behind the current watermark an event can be. - **Fault Tolerance:** Structured Streaming ensures...