Big Data Analytics with Hadoop 3 : Build highly effective analytics solutions to gain valuable insight into your big data.

Apache Hadoop is the most popular platform for big data processing to build powerful analytics solutions. This book shows you how to do just that, with the help of practical examples. You will be well-versed with the analytical capabilities of Hadoop ecosystem with Apache Spark and Apache Flink to p...

Full description

Saved in:
Bibliographic Details
Main Author: Alla, Sridhar
Format: eBook
Language:English
Published: Birmingham : Packt Publishing, 2018.
Subjects:
Online Access:Click for online access

MARC

LEADER 00000cam a2200000Mi 4500
001 on1039692448
003 OCoLC
005 20240809213013.0
006 m o d
007 cr |n|---|||||
008 180609s2018 enk o 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d MERUC  |d NLE  |d OCLCQ  |d LVT  |d IDB  |d OCLCF  |d UKMGB  |d UKAHL  |d C6I  |d OCLCO  |d OCLCQ  |d UX1  |d K6U  |d OCLCO  |d OCLCQ  |d OCLCO  |d SXB 
015 |a GBB8O1553  |2 bnb 
016 7 |a 018897104  |2 Uk 
019 |a 1175631986 
020 |a 9781788624954 
020 |a 1788624955 
020 |a 1788628845 
020 |a 9781788628846 
024 3 |a 9781788628846 
035 |a (OCoLC)1039692448  |z (OCoLC)1175631986 
037 |a 9781788624954  |b Packt Publishing 
050 4 |a QA76.9.B45 .A453 2018 
072 7 |a COM  |x 089000  |2 bisacsh 
049 |a HCDD 
100 1 |a Alla, Sridhar. 
245 1 0 |a Big Data Analytics with Hadoop 3 :  |b Build highly effective analytics solutions to gain valuable insight into your big data. 
260 |a Birmingham :  |b Packt Publishing,  |c 2018. 
300 |a 1 online resource (471 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
505 0 |a Cover; Title Page; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Hadoop; Hadoop Distributed File System; High availability; Intra-DataNode balancer; Erasure coding; Port numbers; MapReduce framework; Task-level native optimization; YARN; Opportunistic containers; Types of container execution ; YARN timeline service v. 2; Enhancing scalability and reliability; Usability improvements; Architecture; Other changes; Minimum required Java version ; Shell script rewrite; Shaded-client JARs; Installing Hadoop 3 ; Prerequisites; Downloading. 
505 8 |a InstallationSetup password-less ssh; Setting up the NameNode; Starting HDFS; Setting up the YARN service; Erasure Coding; Intra-DataNode balancer; Installing YARN timeline service v. 2; Setting up the HBase cluster; Simple deployment for HBase; Enabling the co-processor; Enabling timeline service v. 2; Running timeline service v. 2; Enabling MapReduce to write to timeline service v. 2; Summary; Chapter 2: Overview of Big Data Analytics; Introduction to data analytics; Inside the data analytics process; Introduction to big data; Variety of data; Velocity of data; Volume of data; Veracity of data. 
505 8 |a Variability of dataVisualization; Value; Distributed computing using Apache Hadoop; The MapReduce framework; Hive; Downloading and extracting the Hive binaries; Installing Derby; Using Hive; Creating a database; Creating a table; SELECT statement syntax; WHERE clauses; INSERT statement syntax; Primitive types; Complex types; Built-in operators and functions; Built-in operators; Built-in functions; Language capabilities; A cheat sheet on retrieving information ; Apache Spark; Visualization using Tableau; Summary; Chapter 3: Big Data Processing with MapReduce; The MapReduce framework; Dataset. 
505 8 |a Record readerMap; Combiner; Partitioner; Shuffle and sort; Reduce; Output format; MapReduce job types; Single mapper job; Single mapper reducer job; Multiple mappers reducer job; SingleMapperCombinerReducer job; Scenario; MapReduce patterns; Aggregation patterns; Average temperature by city; Record count; Min/max/count; Average/median/standard deviation; Filtering patterns; Join patterns; Inner join; Left anti join; Left outer join; Right outer join; Full outer join; Left semi join; Cross join; Summary; Chapter 4: Scientific Computing and Big Data Analysis with Python and Hadoop; Installation. 
505 8 |a Installing standard PythonInstalling Anaconda; Using Conda; Data analysis; Summary; Chapter 5: Statistical Big Data Computing with R and Hadoop; Introduction; Install R on workstations and connect to the data in Hadoop; Install R on a shared server and connect to Hadoop; Utilize Revolution R Open; Execute R inside of MapReduce using RMR2; Summary and outlook for pure open source options; Methods of integrating R and Hadoop; RHADOOP -- install R on workstations and connect to data in Hadoop; RHIPE -- execute R inside Hadoop MapReduce; R and Hadoop Streaming. 
505 8 |a RHIVE -- install R on workstations and connect to data in Hadoop. 
520 |a Apache Hadoop is the most popular platform for big data processing to build powerful analytics solutions. This book shows you how to do just that, with the help of practical examples. You will be well-versed with the analytical capabilities of Hadoop ecosystem with Apache Spark and Apache Flink to perform big data analytics by the end of this book. 
650 0 |a Big data. 
650 0 |a Cluster analysis. 
650 0 |a Electronic data processing  |x Distributed processing. 
650 7 |a Database design & theory.  |2 bicssc 
650 7 |a Data warehousing.  |2 bicssc 
650 7 |a Information architecture.  |2 bicssc 
650 7 |a Data capture & analysis.  |2 bicssc 
650 7 |a Computers  |x Database Management  |x Data Warehousing.  |2 bisacsh 
650 7 |a Computers  |x Data Modeling & Design.  |2 bisacsh 
650 7 |a Computers  |x Data Processing.  |2 bisacsh 
650 7 |a Big data  |2 fast 
650 7 |a Cluster analysis  |x Data processing  |2 fast 
650 7 |a Electronic data processing  |x Distributed processing  |2 fast 
776 0 8 |i Print version:  |a Alla, Sridhar.  |t Big Data Analytics with Hadoop 3 : Build highly effective analytics solutions to gain valuable insight into your big data.  |d Birmingham : Packt Publishing, ©2018 
856 4 0 |u https://ebookcentral.proquest.com/lib/holycrosscollege-ebooks/detail.action?docID=5405685  |y Click for online access 
903 |a EBC-AC 
994 |a 92  |b HCD