Course Outline
-
Introduction to Cloud Computing and Big Data solutions
-
Apache Hadoop evolution: HDFS, MapReduce, YARN
-
Installation and configuration of Hadoop in Pseudo-distributed mode
-
Running MapReduce jobs on Hadoop cluster
-
Hadoop cluster planning, installation and configuration
-
Hadoop ecosystem: Pig, Hive, Sqoop, HBase
- Big Data future: Impala, Cassandra
Requirements
- basic Linux administration skills
- basic programming skills
Testimonials (5)
Trainer's preparation & organization, and quality of materials provided on github.
Mateusz Rek - MicroStrategy Poland Sp. z o.o.
Course - Impala for Business Intelligence
The VM I liked very much The Teacher was very knowledgeable regarding the topic as well as other topics, he was very nice and friendly I liked the facility in Dubai.
Safar Alqahtani - Elm Information Security
Course - Big Data Analytics in Health
I thought he did a great job of tailoring the experience to the audience. This class is mostly designed to cover data analysis with HIVE, but me and my co-worker are doing HIVE administration with no real data analytics responsibilities.
ian reif - Franchise Tax Board
Course - Data Analysis with Hive/HiveQL
I genuinely enjoyed the many hands-on sessions.
Jacek Pieczątka
Course - Administrator Training for Apache Hadoop
The fact that all the data and software was ready to use on an already prepared VM, provided by the trainer in external disks.