Post 32 | HDPCD | Defining a Bucketed Hive Table

Hello everyone to the next tutorial in the HDPCD certification series. In the last tutorial, we saw how to create a Partitioned Hive Table. In this tutorial, we are going to see how to create a Bucketed Hive table. The process is depicted in the following infographics. As you can see from the above picture, … Continue reading Post 32 | HDPCD | Defining a Bucketed Hive Table

Post 31 | HDPCD | Defining a Partitioned Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to create the hive external table. In this tutorial, we are going to see how to create a partitioned Hive table. For doing this, we are going to follow the following process. As you can see … Continue reading Post 31 | HDPCD | Defining a Partitioned Hive table

Post 28 | HDPCD | Write and Execute a Hive Query

Hello everyone, Welcome to the first tutorial of the DATA ANALYSIS section of the HDPCD certification. This section is going to contain a total of 24 posts, after which we will be finally done with the HDPCD certification tutorials. In the last tutorial of the DATA TRANSFORMATION section, we saw the process of invoking a … Continue reading Post 28 | HDPCD | Write and Execute a Hive Query

Post 27 | HDPCD | Invoke a User Defined Function in Apache Pig

Hello everyone, thanks for coming back to the last tutorial in the DATA TRANSFORMATION category of the HDPCD certification.¬†We are going to pick-off things from the last tutorial, in which, we saw how to define an ALIAS to a function present in the JAR file. In this tutorial, we are going to see how to … Continue reading Post 27 | HDPCD | Invoke a User Defined Function in Apache Pig

Post 26 | HDPCD | Define an ALIAS for a User Defined Function

Hi, everyone. Thank you for returning again to this certification series. In the last tutorial, we saw the process of registering the jar file in the Apache PIG session. This tutorial is an extension to the previous one and in this, we are going to see how to define an alias for the UDF present … Continue reading Post 26 | HDPCD | Define an ALIAS for a User Defined Function

Post 25 | HDPCD | Register a Jar file of UDF in Apache Pig

Hello, everyone.¬†Thanks for coming back again to continue with this certification series. In the last tutorial, we saw how to run any pig script with TEZ as the execution mode. In this tutorial, we are going to see how to register a JAR file to use the User Defined Function written and packages inside it. … Continue reading Post 25 | HDPCD | Register a Jar file of UDF in Apache Pig

Post 24 | HDPCD | Run a Pig job using TEZ

Hey, everyone. Thank you for giving me company on this beautiful journey of HDPCD certification. We are almost done with the Data Transformation section of the certification and are only left with Data Analysis section using Apache Hive. The section of Data Analysis, in my opinion, is easier than this section so you can say … Continue reading Post 24 | HDPCD | Run a Pig job using TEZ