Creating and Managing Databases in Hive
Learn how to create and manage databases in Hive to organize your tables effectively. This guide covers creating new databases, listing existing databases, and best practices for organizing your Hive data warehouse.
Creating Databases in Hive
Understanding Hive Databases
In Hive, a database acts as a container or namespace for storing tables. You can organize your tables into multiple databases to improve management and prevent naming conflicts. Hive provides a default database named default
.
Checking Existing Databases
Before creating a new database, it's a good idea to check what databases already exist:
HiveQL Command (Listing Databases)
hive> show databases;
This command will display a list of existing databases, including the default database.
Creating a New Database
To create a new database, use the following command. Replace demo
with your desired database name:
HiveQL Command (Creating a Database)
hive> CREATE DATABASE demo;
After executing this command, a new database named demo
will be created. Verify its creation by running show databases;
again.
Handling Duplicate Database Names
Database names must be unique. Attempting to create a database with an existing name will result in an error. To prevent this error, use the IF NOT EXISTS
clause:
HiveQL Command (Creating a Database if it Doesn't Exist)
hive> CREATE DATABASE IF NOT EXISTS demo;
This command will only create the database if it doesn't already exist.
Adding Properties to a Database
You can add properties to a database using key-value pairs:
HiveQL Command (Creating a Database with Properties)
hive> CREATE DATABASE demo
> WITH DBPROPERTIES ('creator' = 'Gaurav Chawla', 'date' = '2019-06-03');
Retrieving Database Information
To view a database's properties and other information:
HiveQL Command (Describing a Database)
hive> DESCRIBE DATABASE EXTENDED demo;