Beginning Apache Pig Big Data Processing Made Easy

Vaddeman, Balaswamy

Please use this identifier to cite or link to this item: http://lib.hpu.edu.vn/handle/123456789/25882

Title:	Beginning Apache Pig Big Data Processing Made Easy
Authors:	Vaddeman, Balaswamy
Keywords:	Apache Pig Big Data Apache Pig
Issue Date:	2017
Abstract:	Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications. The book is divided into four parts: the complete features of Apache Pig integration with other tools how to solve complex business problems and optimization of tools. Youll discover topics such as MapReduce and why it cannot meet every business need the features of Pig Latin such as data types for each load, store, joins, groups, and ordering how Pig workflows can be created submitting Pig jobs using Hue and working with Oozie. Youll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally youll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance. What You Will Learn Use all the features of Apache Pig Integrate Apache Pig with other tools Extend Apache Pig Optimize Pig Latin code Solve different use cases for Pig Latin Who This Book Is For All levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators
URI:	https://lib.hpu.edu.vn/handle/123456789/25882
Appears in Collections:	Technology

Files in This Item:

File	Description	Size	Format
0902_Beginning_Apache_Pig_Big_Data_Processing.pdf Restricted Access		10.44 MB	Adobe PDF	View/Open Request a copy

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets