A List of Hadoop Books

Advanced Analytics with Spark: Patterns for Learning from Data at Scale by Sandy Ryza, Uri Laserson, Sean Owen and Josh Wills
Agile Data Science: Building Data Analytics Applications with Hadoop by Russell Jurney
Apache Drill: The SQL query engine for Hadoop and NoSQL by Ted Dunning, Ellen Friedman, Tomer Shiran and Jacques Nadeau
Apache Flume: Distributed Log Collection for Hadoop -Second Edition by Steve Hoffman
Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 (Addison-Wesley Data & Analytics) by Arun Murthy, Vinod Vavilapalli, Douglas Eadline, Joseph Niemiec and Jeff Markham
Apache Oozie: The Workflow Scheduler for Hadoop by Mohammad Kamrul Islam and Aravind Srinivasan 
Apache Solr 3 Enterprise Search Server by David Smiley and Eric Pugh
Apache Solr 3.1 Cookbook by Rafal Kuć
Apache Solr High Performance by Surendra Mohan
Apache Spark in 24 Hours, Sams Teach Yourself by Jeffrey Aven
Apache Sqoop Cookbook: Unlocking Hadoop for Your Relational Database by Kathleen Ting and Jarek Jarcec Cecho
Architecting HBase Applications: A Guidebook for Successful Development and Design by Jean-Marc Spaggiari and Kevin O'Dell
Beginning Apache Pig: Big Data Processing Made Easy by Balaswamy Vaddeman
Big Data Analytics Beyond Hadoop: Real-Time Applications with Storm, Spark, and More Hadoop Alternatives (FT Press Analytics) by Vijay Srinivas Agneeswaran
Big Data Analytics with R and Hadoop by Vignesh Prajapati
Big Data: A Revolution That Will Transform How We Live, Work, and Think by Viktor Mayer-Schönberger and Kenneth Cukier
Big Data Essentials by Anil Maheshwari
Big Data For Beginners: Understanding SMART Big Data, Data Mining & Data Analytics For improved Business Performance, Life Decisions & More! (Data ... Computer Programming, Growth Hacking, ITIL)
by Vince Reynolds
Big Data For Dummies by Judith Hurwitz, Alan Nugent, Fern Halper and Marcia Kaufman
Big Data Forensics: Learning Hadoop Investigations by Joe Sremack
Big Data Governance: Modern Data Management Principles for Hadoop, NoSQL & Big Data Analytics
by Peter Ghavami PhD
Big Data Made Easy: A Working Guide to the Complete Hadoop Toolset by Michael Frampton
Big Data, MapReduce, Hadoop, and Spark with Python: Master Big Data Analytics and Data Wrangling with MapReduce Fundamentals using Hadoop, Spark, and Python by Lazy Programmer
Big Data: Principles and best practices of scalable realtime data systems by Nathan Marz and James Warren
Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka by Raul Estrada and Isaac Ruiz
CCA175: Hadoop and Spark Developer Exam Hands-on Practice Book and Preparation: CCA175 & CCP:DE575 by HadoopExam Learning Resources
Cassandra: The Definitive Guide: Distributed Data at Web Scale by Jeff Carpenter and Eben Hewitt Cloudera Certified Developer for Apache Hadoop Last Minute Guide: CCD-410 by LMG
Data Analytics Made Accessible: 2017 edition by Anil Maheshwari
Data Analytics with Hadoop: An Introduction for Data Scientists by Benjamin Bengfort and Jenny Kim Data Algorithms: Recipes for Scaling Up with Hadoop and Spark by Mahmoud Parsian
Data Munging with Hadoop (Addison-Wesley Data & Analytics Series) by Ofer Mendelevitch and Casey Stella
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data by EMC Education Services
Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking by Foster Provost and Tom Fawcett
Data Science from Scratch: First Principles with Python by Joel Grus
Deep Learning: A Practitioner's Approach by Josh Patterson and Adam Gibson
Deep Learning with Hadoop by Dipayan Dev
Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems by Martin Kleppmann
The Enterprise Big Data Lake: Delivering on the Promise of Hadoop and Data Science in the Enterprise by Alex Gorelik
Enterprise Data Design Patterns: Best Practices for Putting Hadoop to Work by Douglas Moore and Jeffrey Breen
Enterprise Lucene and Solr by Lajos Moczar
Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS (Addison-Wesley Data & Analytics Series) by Sam R. Alapati
Field Guide to Hadoop: An Introduction to Hadoop, Its Ecosystem, and Aligned Technologies
by Kevin Sitto and Marshall Presser
Ecosystem (Addison-Wesley Data & Analytics) by Douglas Eadline
Fundamentals of Deep Learning: Designing Next-Generation Machine Intelligence Algorithms
by Nikhil Buduma (Author) and Nicholas Locascio (Contributor)
Getting Started with Impala: Interactive SQL for Apache Hadoop by John Russell
Hadoop 2.x Administration Cookbook by Gurmukh Singh
Hadoop 2 Essentials: An End-to-End Approach by Dr. Henry H Liu
Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2
HADOOP & YARN INTERVIEW QUESTION AND ANSWER: LEARN BIG DATA HADOOP & YARN IN QA - ALL INTERVIEW QUESTIONS COVERED by GICGAC ACADEMY
Hadoop and Big Data: Introduction to Basics of Big data analytics by Dr. Rajesh Pasupuleti
Hadoop Application Architectures: Designing Real-World Big Data Applications by Mark Grover, Ted Malaska, Jonathan Seidman and Gwen Shapira
Hadoop Backup and Recovery solutions by Gaurav Barot, Chintan Mehta and Amij Patel
Hadoop Beginner's Guide by Garry Turkington
Hadoop Big Data Guide for beginners by Parveen
Hadoop Big Data Interview Questions (Illustrated): Shyam Mallesh by Shyam Mallesh
Hadoop Blueprints by Anurag Shrivastava and Tanmay Deshpande
Hadoop Beginner's Guide by Garry Turkington
Hadoop Cluster Deployment by Danil Zburivsky
Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale by Tom White
Hadoop: Data Processing and Modelling by Garry Turkington, Tanmay Deshpande and Sandeep Karanth
Hadoop: The Engine That Drives Big Data (New Street Executive Summaries) by Lars Nielsen
Hadoop Essentials: A Quantitative Approach by Henry H. Liu
Hadoop Essentials - Tackling the Challenges of Big Data with Hadoop (Community Experience Distilled)
by Shiva Achari
Hadoop For Dummies by Dirk deRoos
Hadoop for Finance Essentials by Rajiv Tiwari
Hadoop in 24 Hours, Sams Teach Yourself by Jeffrey Aven
Hadoop in Action by Chuck Lam
Hadoop Interview Guide by Monika Singla, Sneha Poddar and Shivansh Kumar
Hadoop in the Enterprise: Architecture: A Guide to Successful Integration by Jan Kunigk, Lars George, Paul Wilkinson and Ian Buss
Hadoop in Practice: Includes 104 Techniques by Alex Holmes
Hadoop MapReduce v2 Cookbook Second Edition by Thilina Gunarathne
Hadoop Operations: A Guide for Developers and Administrators by Eric Sammer
Hadoop Operations and Cluster Management Cookbook by Shumin Guo
Hadoop Real World Solutions Cookbook - Second Edition by Tanmay Deshpande
HADOOP Security Best Practices by Jason Burkett
Hadoop Security: Protecting Your Big Data Platform by Ben Spivey and Joey Echeverria
Hadoop Succinctly by Elton Stoneman
HBase: The Definitive Guide: Random Access to Your Planet-Size Data by Lars George
High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark by Holden Karau and Rachel Warren
Integrating Hadoop by William McKnight
Introducing Data Science: Big Data, Machine Learning, and more, using Python tools
by Davy Cielen, Arno Meysman and Mohamed Ali
Kick Start: Hadoop: Learn Hadoop in Hours! by Mario Meir-Huber
Leaders and Innovators: How Data-Driven Organizations Are Winning with Analytics (Wiley and SAS Business Series) by Tho H. Nguyen, Bill Franks and James Taylor
Learning Hadoop 2 by Garry Turkington and Gabriele Modena  
Learn Hadoop in 1 Day: Master Big Data with this complete Guide by Krishna Rungta
Learning Spark: Lightning-Fast Big Data Analysis by Holden Karau, Andy Konwinski, Patrick Wendell and Matei Zaharia
MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems by Donald Miner and Adam Shook
Mastering Hadoop by Sandeep Karanth
Moving Hadoop to the Cloud: Harnessing Cloud Features and Flexibility for Hadoop Clusters by Bill Havanki
Network Storage: Tools and Technologies for Storing Your Company's Data by James O'Reilly
NoSQL and SQL Data Modeling: Bringing Together Data, Semantics, and Software by Ted Hills
Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale (Addison-Wesley Data & Analytics) by Ofer Mendelevitch, Casey Stella and Douglas Eadline
Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools by Deepak Vohra
Practical Hadoop Migration: How to Integrate Your RDBMS with the Hadoop Ecosystem and Re-Architect Relational Applications to NoSQL by Bhushan Lakhe
Practical Hive: A Guide to Hadoop's Data Warehouse System by Scott Shaw, Andreas François Vermeulen, Ankur Gupta and David Kjerrumgaard
Practical Hadoop Security by Bhushan Lakhe
Practical Machine Learning with H2O: Powerful, Scalable Techniques for Deep Learning and AI by Darren Cook
Pro Apache Hadoop by Jason Venner, Sameer Wadkar and Madhu Siddalingaiah
Professional Hadoop by Benoy Antony, Konstantin Boudnik, Cheryl Adams, Branky Shao, Cazen Lee and Kai Sasaki
Pro Hadoop (Expert's Voice in Open Source) by Jason Venner
Pro Hadoop Data Analytics: Designing and Building Big Data Systems using the Hadoop Ecosystem by Kerry Koitzsch
Programming Hive: Data Warehouse and Query Language for Hadoop by Edward Capriolo, Dean Wampler and Jason Rutherglen
Programming Pig: Dataflow Scripting with Hadoop by Alan Gates, Daniel Dai
Pro Microsoft HDInsight: Hadoop on Windows by Debarchan Sarkar
Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython by Wes McKinney
Python Machine Learning by Sebastian Raschka
Real-World Hadoop by Ted Dunning and Ellen Friedman
SAS and Hadoop Technology: Overview by SAS Institute
Scaling Big Data with Hadoop and Solr - Second Edition by Hrishikesh Vijay Karambelkar
Securing Hadoop by Sudheesh Narayanan
Spark in Action by Petar Zecevic and Marko Bonaci
Top 50 Apache Hadoop Interview Questions and Answers by Knowledge Powerhouse
Using R to Unlock the Value of Big Data: Big Data Analytics with Oracle R Enterprise and Oracle R Connector for Hadoop by Mark Hornick and Tom Plunkett
Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture (VMware Press Technology) by George Trujillo, Charles Kim, Steve Jones, Rommel Garcia and Justin Murray

For a list of Apache Spark books, see this posting.

Leave a comment

Your email address will not be published. Required fields are marked *