From Fedora Project Wiki
Line 15: Line 15:


== Detailed Description ==
== Detailed Description ==
Apache HBase is used when you need random, realtime read/write access to your Big Data.  Apache HBase hosts very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is a distributed, versioned, non-relational database modeled after Google's Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS.
The Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage. Apache Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.


== Benefit to Fedora ==
== Benefit to Fedora ==

Revision as of 19:40, 26 March 2014

Apache Hive

Summary

Apache Hive is a data warehouse built on top of Apache Hadoop.

Owner

Current status

  • Targeted release: Fedora 21
  • Last updated: 26 March 2014
  • Tracker bug: <will be assigned by the Wrangler>

Detailed Description

The Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage. Apache Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.

Benefit to Fedora

Apache HBase is the Big Data database used by many parts of the Hadoop Ecosystem. Including it in Fedora increases usefulness of the Apache Hadoop that is already in Fedora.

Scope

  • Proposal owners: The Hive package has been accepted into Fedora and provides all the functionality from the upstream release.
  • Other developers: N/A (not a System Wide Change)
  • Release engineering: N/A (not a System Wide Change)
  • Policies and guidelines: N/A (not a System Wide Change)

Upgrade/compatibility impact

N/A (not a System Wide Change)

How To Test

Basic shell exercises here

User Experience

Users should be able to write/run applications that use Apache Hive for their database.

Dependencies

Apache HBase

Contingency Plan

  • Contingency mechanism: N/A (not a System Wide Change)
  • Contingency deadline: N/A (not a System Wide Change)
  • Blocks release? N/A (not a System Wide Change)
  • Blocks product? N/A

Documentation

N/A (not a System Wide Change)

Release Notes

Fedora 21 includes Apache Hive, the Hadoop data warehouse.