Apache Hive Essentials : Essential Techniques to Help You Process, and Get Unique Insights from, Big Data, 2nd Edition.

Apache Hive helps you deal with data summarization, queries, and analysis for huge amounts of data. This book will give you a background in big data, and familiarize you with your Hive working environment. Next you will cover advanced topics like performance and security in Hive and how to work effi...

Full description

Saved in:

Bibliographic Details
Main Author:	Du, Dayong
Format:	eBook
Language:	English
Published:	Birmingham : Packt Publishing Ltd, 2018.
Edition:	2nd ed.
Subjects:	Apache Hadoop. Apache Hadoop Electronic data processing > Distributed processing. Cloud computing. Big data. Databases-Design-Data processing. Data capture & analysis. Data warehousing. Databases. Computers > Data Processing. Computers > Database Management > Data Warehousing. Computers > Database Management > General. Big data Cloud computing Electronic data processing
Online Access:	Click for online access

MARC


LEADER	00000cam a2200000Mi 4500
001	on1044944891
003	OCoLC
005	20241006213017.0
006	m o d
007	cr \|n\|---\|\|\|\|\|
008	180721s2018 enk o 000 0 eng d
040			\|a EBLCP \|b eng \|e pn \|c EBLCP \|d MERUC \|d NLE \|d OCLCQ \|d LVT \|d C6I \|d OCLCQ \|d LOY \|d OCLCO \|d UX1 \|d K6U \|d OCLCO \|d OCLCQ \|d OCLCO \|d SXB
019			\|a 1175637308
020			\|a 9781789136517
020			\|a 1789136512
020			\|a 9781788995092
020			\|a 1788995090 \|q (Trade Paper)
024	3		\|a 9781788995092
035			\|a (OCoLC)1044944891 \|z (OCoLC)1175637308
037			\|a B10778 \|b 01201872
050		4	\|a QC100 \|b .D8 2018eb
049			\|a HCDD
100	1		\|a Du, Dayong.
245	1	0	\|a Apache Hive Essentials : \|b Essential Techniques to Help You Process, and Get Unique Insights from, Big Data, 2nd Edition.
250			\|a 2nd ed.
260			\|a Birmingham : \|b Packt Publishing Ltd, \|c 2018.
300			\|a 1 online resource (203 pages)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
588	0		\|a Print version record.
505	0		\|a Cover; Title Page; Copyright and Credits; Dedication; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Overview of Big Data and Hive; A short history; Introducing big data; The relational and NoSQL databases versus Hadoop; Batch, real-time, and stream processing; Overview of the Hadoop ecosystem; Hive overview; Summary; Chapter 2: Setting Up the Hive Environment; Installing Hive from Apache; Installing Hive from vendors; Using Hive in the cloud ; Using the Hive command; Using the Hive IDE; Summary; Chapter 3: Data Definition and Description; Understanding data types.
505	8		\|a Data type conversionsData Definition Language; Database; Tables; Table creation; Table description; Table cleaning; Table alteration; Partitions; Buckets; Views; Summary; Chapter 4: Data Correlation and Scope; Project data with SELECT; Filtering data with conditions; Linking data with JOIN; INNER JOIN; OUTER JOIN; Special joins; Combining data with UNION; Summary; Chapter 5: Data Manipulation; Data exchanging with LOAD; Data exchange with INSERT; Data exchange with [EX\|IM]PORT; Data sorting; Functions; Function tips for collections; Function tips for date and string; Virtual column functions.
505	8		\|a Transactions and locksTransactions; UPDATE statement; DELETE statement; MERGE statement; Locks; Summary; Chapter 6: Data Aggregation and Sampling; Basic aggregation ; Enhanced aggregation; Grouping sets; Rollup and Cube; Aggregation condition; Window functions; Window aggregate functions; Window sort functions; Window analytics functions; Window expression; Sampling; Random sampling; Bucket table sampling; Block sampling; Summary; Chapter 7: Performance Considerations; Performance utilities; EXPLAIN statement; ANALYZE statement; Logs; Design optimization; Partition table design.
505	8		\|a Bucket table designIndex design; Use skewed/temporary tables; Data optimization; File format; Compression; Storage optimization; Job optimization; Local mode; JVM reuse; Parallel execution; Join optimization; Common join; Map join; Bucket map join; Sort merge bucket (SMB) join; Sort merge bucket map (SMBM) join; Skew join; Job engine; Optimizer; Vectorization optimization; Cost-based optimization; Summary; Chapter 8: Extensibility Considerations; User-defined functions; UDF code template; UDAF code template; UDTF code template; Development and deployment; HPL/SQL; Streaming; SerDe; Summary.
505	8		\|a Chapter 9: Security ConsiderationsAuthentication; Metastore authentication; Hiveserver2 authentication; Authorization; Legacy mode; Storage-based mode; SQL standard-based mode; Mask and encryption; The data-hashing function; The data-masking function; The data-encryption function; Other methods; Summary; Chapter 10: Working with Other Tools; The JDBC/ODBC connector; NoSQL; The Hue/Ambari Hive view; HCatalog; Oozie; Spark; Hivemall; Summary; Other Books You May Enjoy; Index.
520			\|a Apache Hive helps you deal with data summarization, queries, and analysis for huge amounts of data. This book will give you a background in big data, and familiarize you with your Hive working environment. Next you will cover advanced topics like performance and security in Hive and how to work efficiently to find solutions to big data problems.
630	0	0	\|a Apache Hadoop.
630	0	7	\|a Apache Hadoop \|2 fast
650		0	\|a Electronic data processing \|z Distributed processing.
650		0	\|a Cloud computing.
650		0	\|a Big data.
650		0	\|a Databases-Design-Data processing.
650		7	\|a Data capture & analysis. \|2 bicssc
650		7	\|a Data warehousing. \|2 bicssc
650		7	\|a Databases. \|2 bicssc
650		7	\|a Computers \|x Data Processing. \|2 bisacsh
650		7	\|a Computers \|x Database Management \|x Data Warehousing. \|2 bisacsh
650		7	\|a Computers \|x Database Management \|x General. \|2 bisacsh
650		7	\|a Big data \|2 fast
650		7	\|a Cloud computing \|2 fast
650		7	\|a Electronic data processing \|2 fast
776	0	8	\|i Print version: \|a Du, Dayong. \|t Apache Hive Essentials : Essential Techniques to Help You Process, and Get Unique Insights from, Big Data, 2nd Edition. \|d Birmingham : Packt Publishing Ltd, ©2018 \|z 9781788995092
856	4	0	\|u https://ebookcentral.proquest.com/lib/holycrosscollege-ebooks/detail.action?docID=5446045 \|y Click for online access
903			\|a EBC-AC
994			\|a 92 \|b HCD

Apache Hive Essentials : Essential Techniques to Help You Process, and Get Unique Insights from, Big Data, 2nd Edition.

MARC

Similar Items