Hive Installation
  Note: Install Cloudera
Repository
- Install the Cloudera yum, zypper/YaST or apt repository.
 - Install or upgrade CDH 5 and make sure it is functioning correctly.
 
Using Hive data in HBase is a common task. See Importing Data Into HBase.
For information about Hive on Spark, see Running Hive on Spark.
Use the following sections to install, update, and configure Hive.
- Installing Hive
 - Upgrading Hive
 - Configuring the Hive Metastore
 - Configuring HiveServer2
 - Starting the Metastore
 - File System Permissions
 - Starting, Stopping, and Using HiveServer2
 - Starting HiveServer1 and the Hive Console
 - Using Hive with HBase
 - Using the Hive Schema Tool
 - Installing the Hive JDBC Driver on Clients
 - Setting HADOOP_MAPRED_HOME
 - Configuring the Metastore to Use HDFS High Availability
 - Troubleshooting Hive
 - Viewing the Hive Documentation
 
Apache Hive is a powerful data warehousing application for Hadoop. It enables you to access your data using Hive QL, a language similar to SQL.
Install Hive on your client machine(s) from which you submit jobs; you do not need to install it on the nodes in your Hadoop cluster. As of CDH 5, Hive supports HCatalog which must be installed separately.
HiveServer2
HiveServer2 is an improved version of HiveServer that supports a Thrift API tailored for JDBC and ODBC clients, Kerberos authentication, and multi-client concurrency. The CLI for HiveServer2 is Beeline.
  Warning: Because of concurrency and
security issues, HiveServer1 and the Hive CLI are deprecated in CDH 5 and will be removed in a future release. Cloudera recommends you migrate to Beeline and HiveServer2 as soon as possible. The Hive CLI is not
needed if you are using Beeline with HiveServer2.| << Modifying Impala Startup Options | ©2016 Cloudera, Inc. All rights reserved | Installing Hive >> | 
| Terms and Conditions Privacy Policy |