Post 32 | HDPCD | Defining a Bucketed Hive Table

Hello everyone to the next tutorial in the HDPCD certification series. In the last tutorial, we saw how to create a Partitioned Hive Table. In this tutorial, we are going to see how to create a Bucketed Hive table. The process is depicted in the following infographics. As you can see from the above picture, … Continue reading Post 32 | HDPCD | Defining a Bucketed Hive Table

Post 31 | HDPCD | Defining a Partitioned Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to create the hive external table. In this tutorial, we are going to see how to create a partitioned Hive table. For doing this, we are going to follow the following process. As you can see … Continue reading Post 31 | HDPCD | Defining a Partitioned Hive table

Post 30 | HDPCD | Define a Hive External Table

Hello, everyone! Welcome to the third tutorial in the Data Analysis section of the HDPCD certification. In the last tutorial, we saw how to create the hive-managed or internal table. In this tutorial, we are going to create the hive external table. So, let us start with the process. The following infographics show the process … Continue reading Post 30 | HDPCD | Define a Hive External Table

Post 29 | HDPCD | Define a Hive-managed Table

Hello, everyone. Welcome to the second post in the Data Analysis section of the HDPCD certification series. In the last tutorial, we saw the three ways in which we run the hive commands. In this tutorial, we are going to create the hive-managed table i.e. hive internal table. For creating a hive-managed or internal table, … Continue reading Post 29 | HDPCD | Define a Hive-managed Table

Post 28 | HDPCD | Write and Execute a Hive Query

Hello everyone, Welcome to the first tutorial of the DATA ANALYSIS section of the HDPCD certification. This section is going to contain a total of 24 posts, after which we will be finally done with the HDPCD certification tutorials. In the last tutorial of the DATA TRANSFORMATION section, we saw the process of invoking a … Continue reading Post 28 | HDPCD | Write and Execute a Hive Query

Post 18 | HDPCD | Storing Pig Relation in Hive Table

Wassup everyone. Thanks for coming back once again. This section is coming to an end with less than 10 tutorials remaining. Once we are done with this section (Apache Pig), we will start with the next section which focuses on Apache Hive. In the last tutorial, we saw the process of storing the data stored … Continue reading Post 18 | HDPCD | Storing Pig Relation in Hive Table

Post 14 | HDPCD | Data Transformation to match Hive Schema using Apache Pig

The last tutorial talked about transforming data by reducing the number of columns from input to output records. This tutorial is kind of similar, in which, we are going to take the data transformation process one step further. This tutorial focuses on matching your input records with the Hive table schema. This includes splitting the … Continue reading Post 14 | HDPCD | Data Transformation to match Hive Schema using Apache Pig