Always Learning

Advanced Search

Hadoop in 24 Hours, Sams Teach Yourself

Hadoop in 24 Hours, Sams Teach Yourself

Jeffrey Aven

Apr 2017, Paperback, 496 pages
ISBN13: 9780672338526
ISBN10: 0672338521
Special online offer - Save 30%
Was 36.99, Now 25.89Save: 11.10
  • Print pagePrint page
  • Email this pageEmail page
  • Share

Apache Hadoop is the technology at the heart of the Big Data revolution, and Hadoop skills are in enormous demand. Now, in just 24 lessons of one hour or less, you can learn all the skills and techniques you'll need to deploy each key component of a Hadoop platform in your local environment or in the cloud, building a fully functional Hadoop cluster and using it with real programs and datasets. Each short, easy lesson builds on all that's come before, helping you master all of Hadoop's essentials, and extend it to meet your unique challenges. Apache Hadoop in 24 Hours, Sams Teach Yourself covers all this, and much more:

  • Understanding Hadoop and the Hadoop Distributed File System (HDFS)
  • Importing data into Hadoop, and process it there
  • Mastering basic MapReduce Java programming, and using advanced MapReduce API concepts
  • Making the most of Apache Pig and Apache Hive
  • Implementing and administering YARN
  • Taking advantage of the full Hadoop ecosystem
  • Managing Hadoop clusters with Apache Ambari
  • Working with the Hadoop User Environment (HUE)
  • Scaling, securing, and troubleshooting Hadoop environments
  • Integrating Hadoop into the enterprise
  • Deploying Hadoop in the cloud
  • Getting started with Apache Spark

Step-by-step instructions walk you through common questions, issues, and tasks; Q-and-As, Quizzes, and Exercises build and test your knowledge; "Did You Know?" tips offer insider advice and shortcuts; and "Watch Out!" alerts help you avoid pitfalls. By the time you're finished, you'll be comfortable using Apache Hadoop to solve a wide spectrum of Big Data problems.

Hour 1: Introduction to Hadoop
Hour 2: Understanding the Hadoop Distributed File System (HDFS)
Hour 3: Getting Data into Hadoop
Hour 4: Understanding Data Processing in Hadoop
Hour 5: MapReduce Programming in Java
Hour 6: Advanced MapReduce API Concepts
Hour 7: Introduction to Apache Pig
Hour 8: Advanced Pig Usage
Hour 9: Introduction to Apache Hive
Hour 10: Advanced Hive Usage
Hour 11: YARN Administration
Hour 12: SQL on Hadoop Overview
Hour 13: The Hadoop Ecosystem
Hour 14: Cluster Management using Apache Ambari
Hour 15: Scaling Hadoop
Hour 16: Advanced Cluster Configuration
Hour 17: The Hadoop User Environment (HUE)
Hour 18: Advanced HDFS
Hour 19: Securing Hadoop
Hour 20: Troubleshooting Hadoop
Hour 21: Integrating Hadoop into the Enterprise
Hour 22: Hadoop in the Cloud
Hour 23: Introduction to NoSQL
Hour 24: Introduction to Apache Spark

  • Covers all aspects of the Hadoop platform, its interfaces, and its key ecosystem components and associated Big Data technologies
  • Shows how to build Hadoop solutions step by step, with all samples available for download
  • Teaches through practical instructions, realistic examples, hands-on workshops, Q-and-As, quizzes, exercises, tips, and more