Programming Pig: Dataflow Scripting with Hadoop
Super Savings Item! Save 39% on the Programming Pig: Dataflow Scripting with Hadoop by O Reilly Media at Translate This Website. Hurry! Limited time offer. Offer valid only while supplies last. For many organizations, Hadoop is the first step for dealing with massive amounts of data. The next step? Processing and analyzing datasets with the
For many organizations, Hadoop is the first step for dealing with massive amounts of data. The next step? Processing and analyzing datasets with the Apache Pig scripting platform. With Pig, you can batch-process data without having to create a full-fledged application, making it easy to experiment with new datasets.
Updated with use cases and programming examples, this second edition is the ideal learning tool for new and experienced users alike. You’ll find comprehensive coverage on key features such as the Pig Latin scripting language and the Grunt shell. When you need to analyze terabytes of data, this book shows you how to do it efficiently with Pig.
- Delve into Pig’s data model, including scalar and complex data types
- Write Pig Latin scripts to sort, group, join, project, and filter your data
- Use Grunt to work with the Hadoop Distributed File System (HDFS)
- Build complex data processing pipelines with Pig’s macros and modularity features
- Embed Pig Latin in Python for iterative processing and other advanced tasks
- Use Pig with Apache Tez to build high-performance batch and interactive data processing applications
- Create your own load and store functions to handle data formats and storage mechanisms
|Brand:||O Reilly Media|
|Item Weight:||0 pounds|
|Item Size:||0.7 x 9.1 x 9.1 inches|
|Package Weight:||1.37 pounds|
|Package Size:||6.93 x 0.79 x 0.79 inches|
Have questions about this item, or would like to inquire about a custom or bulk order?
If you have any questions about this product by O Reilly Media, contact us by completing and submitting the form below. If you are looking for a specif part number, please include it with your message.
Related Best Sellers
By Packt Publishing - ebooks Account
ean: 9781788831192, isbn: 9781788831192,
Develop, deploy, and streamline your data science projects with the most popular end-to-end platform, AnacondaKey FeaturesUse Anaconda to find solutions for clustering, classification, and linear regressionAnalyze your data efficiently with the most ...
By Manning Publications
ean: 9781633430273, isbn: 9781633430273,
Summary Think Like a Data Scientist presents a step-by-step approach to data science, combining analytic, programming, and business perspectives into easy-to-digest techniques and thought processes for solving real world data-centric problems. Purcha...
ean: 9781484234730, isbn: 1484234731,
Gain the basics of Ruby’s map, reduce, and select functions and discover how to use them to solve data-processing problems. This compact hands-on book explains how you can encode certain complex programs in 10 lines of Ruby code, an astonishingly s...
By Packt Publishing - ebooks Account
mpn: black & white illustrations, ean: 9781785280429, isbn: 9781785280429,
Key FeaturesQuickly get familiar with data science using PythonSave time - and effort - with all the essential tools explainedCreate effective data science projects and avoid common pitfalls with the help of examples and hints dictated by experienceB...
mpn: 183 black & white illustrations, 98 blac, ean: 9781852332181, isbn: 1852332182,
This book presents key machine vision techniques and algorithms, along with the associated Java source code. Special features include a complete self-contained treatment of all topics and techniques essential to the understanding and implementation o...
By Technics Publications, LLC
ean: 9781935504191, isbn: 1935504193,
Here you will learn how to develop an attractive, easily readable, conceptual, business-oriented entity/relationship model, using a variation on the UML Class Model notation. This book has two audiences: Data modelers (both analysts and database de...
By Brand: Morgan Kaufmann
ean: 9781558605763, isbn: 1558605762,
SQL for Smarties was hailed as the first book devoted explicitly to the advanced techniques you need to transform yourself into an expert SQL programmer. Now, in this fully updated second edition, SQL mastermind Joe Celko keeps you moving forward, us...
By Packt Publishing
ean: 9781788293334, isbn: 1788293339,
Explore GIS processing and learn to work with various tools and libraries in Python.Key FeaturesAnalyze and process geospatial data using Python libraries such as; Anaconda, GeoPandasLeverage new ArcGIS API to process geospatial data for the cloud.Ex...
By Brand: O'Reilly Media
ean: 9780596002732, isbn: 9780596002732,
Access Database Design & Programming takes you behind the details of the Access interface, focusing on the general knowledge necessary for Access power users or developers to create effective database applications. When using software products with g...
By Chapman and Hall/CRC
ean: 9781498732161, isbn: 149873216X,
Praise for the first edition: "The well-written, comprehensive book…[is] aiming to become a de facto reference for the language and its features and capabilities. The pace is appropriate for beginners; programming concepts are introduced progressi...