Login       My Wishlist
  My Cart
$0.00 / 0 items
 
Translate This Website
International Translation Network
 
International Access
Global Shipping Options Available
  Our Catalog   Computers & Technology   Databases & Big Data   Data Modeling & Design

Programming Pig: Dataflow Scripting with Hadoop


Super Savings Item! Save 39% on the Programming Pig: Dataflow Scripting with Hadoop by O Reilly Media at Translate This Website. Hurry! Limited time offer. Offer valid only while supplies last. For many organizations, Hadoop is the first step for dealing with massive amounts of data. The next step? Processing and analyzing datasets with the


Product Description

For many organizations, Hadoop is the first step for dealing with massive amounts of data. The next step? Processing and analyzing datasets with the Apache Pig scripting platform. With Pig, you can batch-process data without having to create a full-fledged application, making it easy to experiment with new datasets.

Updated with use cases and programming examples, this second edition is the ideal learning tool for new and experienced users alike. You’ll find comprehensive coverage on key features such as the Pig Latin scripting language and the Grunt shell. When you need to analyze terabytes of data, this book shows you how to do it efficiently with Pig.

  • Delve into Pig’s data model, including scalar and complex data types
  • Write Pig Latin scripts to sort, group, join, project, and filter your data
  • Use Grunt to work with the Hadoop Distributed File System (HDFS)
  • Build complex data processing pipelines with Pig’s macros and modularity features
  • Embed Pig Latin in Python for iterative processing and other advanced tasks
  • Use Pig with Apache Tez to build high-performance batch and interactive data processing applications
  • Create your own load and store functions to handle data formats and storage mechanisms

Additional Information

Manufacturer:O'Reilly Media
Brand:O Reilly Media
Publisher:O'Reilly Media
Studio:O'Reilly Media
EAN:9781491937099
Item Weight:0 pounds
Item Size:0.7 x 9.1 x 9.1 inches
Package Weight:1.37 pounds
Package Size:6.93 x 0.79 x 0.79 inches

Programming Pig: Dataflow Scripting with Hadoop by O Reilly Media

Buy Now:
Programming Pig: Dataflow Scripting with Hadoop

Brand: O Reilly Media
4.6 out of 5 stars with 104 reviews
Condition: New
Lead Time: 1 - 2 Business Days
Availability: In Stock
$39.99
$24.51
You Save: 39%


Quantity:  

 


 


Have questions about this item, or would like to inquire about a custom or bulk order?


If you have any questions about this product by O Reilly Media, contact us by completing and submitting the form below. If you are looking for a specif part number, please include it with your message.

First Name:
Last Last:
Email Address:
Your Message:

Related Best Sellers


Hands-On Data Science with Anaconda: Utilize the right mix of tools to create high-performance data science applications
By Packt Publishing - ebooks Account
ean: 9781788831192, isbn: 9781788831192,
Develop, deploy, and streamline your data science projects with the most popular end-to-end platform, AnacondaKey FeaturesUse Anaconda to find solutions for clustering, classification, and linear regressionAnalyze your data efficiently with the most ...

Think Like a Data Scientist: Tackle the data science process step-by-step
By Manning Publications
ean: 9781633430273, isbn: 9781633430273,
Summary Think Like a Data Scientist presents a step-by-step approach to data science, combining analytic, programming, and business perspectives into easy-to-digest techniques and thought processes for solving real world data-centric problems. Purcha...

Ruby Data Processing: Using Map, Reduce, and Select
By Apress
ean: 9781484234730, isbn: 1484234731,
Gain the basics of Ruby’s map, reduce, and select functions and discover how to use them to solve data-processing problems. This compact hands-on book explains how you can encode certain complex programs in 10 lines of Ruby code, an astonishingly s...

Python Data Science Essentials - Learn the fundamentals of Data Science with Python
By Packt Publishing - ebooks Account
mpn: black & white illustrations, ean: 9781785280429, isbn: 9781785280429,
Key FeaturesQuickly get familiar with data science using PythonSave time - and effort - with all the essential tools explainedCreate effective data science projects and avoid common pitfalls with the help of examples and hints dictated by experienceB...

Machine Vision Algorithms in Java: Techniques and Implementation
By Springer
mpn: 183 black & white illustrations, 98 blac, ean: 9781852332181, isbn: 1852332182,
This book presents key machine vision techniques and algorithms, along with the associated Java source code. Special features include a complete self-contained treatment of all topics and techniques essential to the understanding and implementation o...

UML and Data Modeling: A Reconciliation
By Technics Publications, LLC
ean: 9781935504191, isbn: 1935504193,
Here you will learn how to develop an attractive, easily readable, conceptual, business-oriented entity/relationship model, using a variation on the UML Class Model notation. This book has two audiences: Data modelers (both analysts and database de...

Joe Celko's SQL for Smarties: Advanced SQL Programming Second Edition (The Morgan Kaufmann Series in Data Management Systems)
By Brand: Morgan Kaufmann
ean: 9781558605763, isbn: 1558605762,
SQL for Smarties was hailed as the first book devoted explicitly to the advanced techniques you need to transform yourself into an expert SQL programmer. Now, in this fully updated second edition, SQL mastermind Joe Celko keeps you moving forward, us...

Mastering Geospatial Analysis with Python: Explore GIS processing and learn to work with GeoDjango, CARTOframes and MapboxGL-Jupyter
By Packt Publishing
ean: 9781788293334, isbn: 1788293339,
Explore GIS processing and learn to work with various tools and libraries in Python.Key FeaturesAnalyze and process geospatial data using Python libraries such as; Anaconda, GeoPandasLeverage new ArcGIS API to process geospatial data for the cloud.Ex...

Access Database Design & Programming (3rd Edition)
By Brand: O'Reilly Media
ean: 9780596002732, isbn: 9780596002732,
Access Database Design & Programming takes you behind the details of the Access interface, focusing on the general knowledge necessary for Access power users or developers to create effective database applications. When using software products with g...

Object-Orientation, Abstraction, and Data Structures Using Scala (Chapman & Hall/CRC Textbooks in Computing)
By Chapman and Hall/CRC
ean: 9781498732161, isbn: 149873216X,
Praise for the first edition: "The well-written, comprehensive book…[is] aiming to become a de facto reference for the language and its features and capabilities. The pace is appropriate for beginners; programming concepts are introduced progressi...



Privacy Policy / Terms of Service
© 2019 - translateth.is. All Rights Reserved.