Please help transcribe this video using our simple transcription tool. You need to be logged in to do so.


Note: This is the part-1 of the tutorial. Please see Mapreduce Part-2 for the rest of the presentation. Abstract: Apache Hadoop has become the platform of choice for developing large-scale data-intensive applications. In this tutorial, we will discuss design philosophy of Hadoop, describe how to design and develop Hadoop applications and higher-level application frameworks to crunch several terabytes of data, using anywhere from four to 4,000 computers. We will discuss solutions to common problems encountered in maximizing Hadoop application performance. We will also describe several frameworks and utilities developed using Hadoop that increase programmer-productivity and application-performance.

Questions and Answers

You need to be logged in to be able to post here.