Post 8 | HDPCD | Configure Flume Memory Channel

In the last tutorial, we saw the process to start the flume agent. This tutorial is an extension to the previous tutorial, so please refer to it before getting started with this tutorial. The last tutorial enabled us to start the flume agent, after which we can send the messages that we want over flume … Continue reading Post 8 | HDPCD | Configure Flume Memory Channel

Post 7 | HDPCD | Starting Flume Agent

Hello everyone, hope you are finding the tutorials quite useful. In the previous post, we performed the Sqoop Export operation. In this tutorial, we are going to start the flume agent. Flume is one of the projects of Apache Ecosystem. Apache Flume is a reliable and distributed service for moving a large amount of log … Continue reading Post 7 | HDPCD | Starting Flume Agent

Post 5 | HDPCD | RDBMS to Hive Import

Hello everyone, in this tutorial, we are going to see the 3rd objective in data ingestion category. The objective is listed on Hortonworks website under data ingestion and looks like this. In the previous┬ápost, we imported data into HDFS, here, we are going to import the data directly into Hive table. So, let us begin. … Continue reading Post 5 | HDPCD | RDBMS to Hive Import

Post 4 | HDPCD | Free-form Query Import

Hello everyone, in this tutorial, we are going to see the 2nd objective in data ingestion category. The objective is listed on Hortonworks website under data ingestion and looks like this. In the previous objective, we imported entire records from a MySQL table, whereas in this post, we are going to import data based on … Continue reading Post 4 | HDPCD | Free-form Query Import