Course Outline
Introduction to Hortonworks Data Platform (HDP)
Overview of Big Data and Apache Hadoop
Installing and Configuring HDP
Setting up, Deploying, and Managing Hadoop Cluster
Understanding and ConfiguringYARN and MapReduce
Overview of Job Scheduling
Ensuring Data Integrity
Understanding Enterprise Data Movement
Using HDFS Commands & Services
Transferring Data Using Flume
Working with Hive
Scheduling Workflow Using Oozie
Exploring Hadoop 2.x
Understanding Hbase Architecture
Monitoring HDP2 Services Using Ambari
New Features in HDP
Troubleshooting
Summary and Next Steps
Requirements
- An understanding of Hadoop and big data
- An understanding of Spark
- Familiarity with the command line
- System administration experience
Audience
- Hadoop administrators
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from €6840 online delivery, based on a group of 2 delegates, €2160 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Testimonials (5)
A lot of practical examples, different ways to approach the same problem, and sometimes not so obvious tricks how to improve the current solution
Rafal - Nordea
Course - Apache Spark MLlib
The live examples
Ahmet Bolat - Accenture Industrial SS
Course - Python, Spark, and Hadoop for Big Data
very interactive...
Richard Langford
Course - SMACK Stack for Data Science
Sufficient hands on, trainer is knowledgable
Chris Tan
Course - A Practical Introduction to Stream Processing
Get to learn spark streaming , databricks and aws redshift