The Apache Hive ™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive.
In this tutorial, we will use the Ambari HDFS file view to store data files of truck drivers statistics. We will implement Hive queries to analyze, process and filter that data.