04.07.2013 Views

Hadoop: The Definitive Guide - Cdn.oreilly.com

Hadoop: The Definitive Guide - Cdn.oreilly.com

Hadoop: The Definitive Guide - Cdn.oreilly.com

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Parallelism 409<br />

Parameter Substitution 410<br />

12. Hive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 413<br />

Installing Hive 414<br />

<strong>The</strong> Hive Shell 415<br />

An Example 416<br />

Running Hive 417<br />

Configuring Hive 417<br />

Hive Services 419<br />

<strong>The</strong> Metastore 421<br />

Comparison with Traditional Databases 423<br />

Schema on Read Versus Schema on Write 423<br />

Updates, Transactions, and Indexes 424<br />

HiveQL 425<br />

Data Types 426<br />

Operators and Functions 428<br />

Tables 429<br />

Managed Tables and External Tables 429<br />

Partitions and Buckets 431<br />

Storage Formats 435<br />

Importing Data 441<br />

Altering Tables 443<br />

Dropping Tables 443<br />

Querying Data 444<br />

Sorting and Aggregating 444<br />

MapReduce Scripts 445<br />

Joins 446<br />

Subqueries 449<br />

Views 450<br />

User-Defined Functions 451<br />

Writing a UDF 452<br />

Writing a UDAF 454<br />

13. HBase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459<br />

HBasics 459<br />

Backdrop 460<br />

Concepts 460<br />

Whirlwind Tour of the Data Model 460<br />

Implementation 461<br />

Installation 464<br />

Test Drive 465<br />

Clients 467<br />

Table of Contents | xi

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!