Hadoop Fundamentals I

BigData represents a new and fast growing era in data exploration and utilization. These course materials cover Hadoop fundamentals. Hadoop is an open-source implementation of frameworks for reliable, scalable, distributed computing and data storage which can be used for managing Big Data. Course modules cover Hadoop architecture, HDFS, MapReduce, Querying Data (Pig, Hive, JAQL) and Moving Data (Flume). Hands-on labs, using IBM BigInsights, are included in the course. BigInsights, which is based on Hadoop, is available as a virtual image. The course is approximately 8 hours of course content, including time for the labs.

Agenda:

09:00 – 09:15 Welcome

09:15 – 09:45 What is big data?

09:45 – 10:15 Introduction to Hadoop

10:15 – 10:30 BigInsights

10:30 – 10:45 Hadoop architecture

10:45 – 11:00 Break

11:00 – 11:30 Lab 1 – Core Hadoop

11:30 – 12:00 MapReduce

12:00 – 12:30 Lab 2

13:30 – 14:00 Pig, Hive, and Jaql

14:00 – 14:30 Lab 3 – Pig, Hive, and Jaql

14:30 – 15:00 Moving data with Flume

15:00 – 15:15 Break

15:15 – 15:30 Lab 4 – Flume

15:30 – 16:00 IBM BigInsights, Hadoop for the Enterprise

16:00 – 16:45 Lab 5 – Big Insights

16:45 – 17:00 Quiz, Summary and Wrap up

Gain hands-on experience with Hadoop technologies and it’s use with IBM Big Insights to address business needs of today.

1 person is attending this meetup

Open in Google Maps