Open Source Books: Hadoop, the Definitive Guide

“Hadoop: The Definitive Guide” is the first book covering the now famous java framework supporting data intensive distributed applications.

Doug Cutting, the project’s author now working at Cloudera, wrote that Tom White - author of the book and long time contributor to the Apache top-level project - is the most qualified person to write a book about hadoop.

The book starts with an introduction to Google’s MapReduce, than it looks in depth first at HDFS, Hadoop’s own filesystem and I/O fundamentals in Hadoop.

The guide covers also Hadoop administration, and reports a number of  case studies, introducing the user to use Pig (a high level query language for large-scale data processing), HBase (Hadoop’s database for structured and semi-structured data) and ZooKeeper, a toolkit of coordination primitives for building distributed systems.

To know more about Hadoop and MapReduce read also “Getting started with Hadoop and MapReduce“.

Share and Enjoy:
  • Digg
  • Sphinn
  • del.icio.us
  • Facebook
  • StumbleUpon
  • Google
  • Technorati
  • TwitThis

  1. No Comments
  1. 1 Open Source Books: Hadoop, the Definitive Guide | Open Hacking

Leave a Reply


About the Editor

Roberto Galoppini on Open Source Software
Roberto has over 20 years experience in the computer industry, and has spent the last 10 years working in the intersection of open source software and business development. Roberto has taken an active interest in different open source projects and organizations, he also served on some advisory boards, and helped large IT vendors, open source vendors and customers to design and deploy their open source strategies. He works at SourceForge, and opinions expressed here don't necessarily represent employer's positions, strategies, or opinion.