Got a question? Call 1800 853 276   |   
Cloud Computing and Virtualisation

Cloudera Administrator Training: CDP Private Cloud Base

  • Length 4 days
  • Price $4620 inc GST
Course overview
View dates &
book now
  • Register interest

Why study this course

Thia course provides students with a comprehensive understanding of all the steps necessary to operate and maintain on-premises clusters using Cloudera Manager.

From installation and configuration through load balancing and tuning, this Cloudera training course is the best preparation for the real-world challenges faced by administrators who run CDP Private Cloud Base.

Request Course Information

By submitting an enquiry, you agree to our privacy policy and receiving email and other forms of communication from us. You can opt-out at any time.


What you’ll learn

Through instructor-led discussion and interactive, hands-on exercises, you willlearn how to:

  • Install Cloudera Manager

  • Use Cloudera Manager to install a CDP Private Cloud Base cluster

  • Configure and monitor the cluster using Cloudera Manager

  • Understand, evaluate, and select the most appropriate data storage option

  • Optimize cluster performance

  • Perform routine cluster maintenance tasks

  • Detect, troubleshoot, and repair problems with the cluster


Cloudera at DDLS

Cloudera provides a scalable, flexible, integrated platform that makes it easy to manage rapidly increasing volumes and varieties of data in your enterprise. Cloudera products and solutions enable you to deploy and manage Apache Hadoop and related projects, manipulate and analyse your data, and keep that data secure and protected.


Stay ahead of the technology curve

Don’t let your tech outpace the skills of your people

Quality instructors and content

Expert instructors with real world experience and the latest vendor- approved in-depth course content.

Partner-Preferred Supplier

Chosen and awarded by the world’s leading vendors as preferred training partner.

Ahead of the technology curve

No matter your chosen technologies or platforms, we can help you stay one step ahead.

Who is the course for?

This course is best suited to:

  • systems administrators


Course subjects

Cloudera Data Platform

  • Industry Trends for Big Data

  • The Challenge to Become DataDriven

  • The Enterprise Data Cloud

  • CDP Overview

  • CDP Form Factors

CDP Private Cloud Base Installation

  • Installation Overview

  • Cloudera Manager Installation

  • CDP Runtime Overview

  • Cloudera Manager Introduction

Cluster Configuration

  • Overview

  • Configuration Settings

  • Modifying Service Configurations

  • Configuration Files

  • Managing Role Instances

  • Adding New Services

  • Adding and Removing Hosts

Data Storage

  • Overview

  • HDFS Topology and Roles

  • HDFS Performance and Fault Tolerance

  • HDFS and Hadoop Security Overview

  • Working with HDFS

  • HBase Overview

  • Kudu Overview

  • Cloud Storage Overview

Data Ingest

  • Data Ingest Overview

  • File Formats

  • Ingesting Data using File Transfer or REST Interfaces

  • Importing Data from Relational Databases with Apache Sqoop

  • Ingesting Data Using NiFi

  • Best Practices for Importing Data

Data Flow

  • Overview of Cloudera Flow Management and NiFi

  • NiFi Architecture

  • Cloudera Edge Flow Management and MiNiFi

  • Controller Services

  • Apache Kafka Overview

  • Apache Kafka Cluster Architecture

  • Apache Kafka Command Line Tools

Data Access and Discovery

  • Apache Hive

  • Apache Impala

  • Apache Impala Tuning

  • Search Overview

  • Hue Overview

  • Managing and Configuring Hue

  • Hue Authentication and Authorization

  • CDSW Overview

Data Compute

  • YARN Overview

  • Running Applications on YARN

  • Viewing YARN Applications

  • YARN Application Logs

  • MapReduce Applications

  • YARN Memory and CPU Settings

  • Tez Overview

  • Hive on Tez

  • ACID for Hive

  • Spark Overview

  • How Spark Applications Run on YARN

  • Monitoring Spark Applications

  • Phoenix Overview

Managing Resources

  • Configuring cgroups with CPU Scheduling

  • The Capacity Scheduler

  • Managing Queues

  • Impala Query Scheduling

Planning Your Cluster

  • General Planning Considerations

  • Choosing the Right Hardware

  • Network Considerations

  • CDP Private Cloud Considerations

  • Configuring Nodes

Advanced Cluster Configuration

  • Configuring Service Ports

  • Tuning HDFS and MapReduce

  • Managing Cluster Growth

  • Erasure Coding

  • Enabling HDFS High Availability

Cluster Maintenance

  • Checking HDFS Status

  • Copying Data Between Clusters

  • Rebalancing Data in HDFS

  • HDFS Directory Snapshots

  • Host Maintenance

  • Upgrading a Cluster

Cluster Monitoring

  • Cloudera Manager Monitoring Features

  • Health Tests

  • Events and Alerts

  • Charts and Reports

  • Monitoring Recommendations

Cluster Troubleshooting

  • Overview

  • Troubleshooting Tools

  • Misconfiguration Examples

Security

  • Data Governance with SDX

  • Hadoop Security Concepts

  • Hadoop Authentication Using Kerberos

  • Hadoop Authorization

  • Hadoop Encryption

  • Securing a Hadoop Cluster

  • Apache Ranger

  • Apache Atlas

  • Backup and Recovery

Private Cloud / Public Cloud

  • CDP Overview

  • Private Cloud Capabilities

  • Public Cloud Capabilities

  • What is Kubernetes?

  • WXM Overview

  • Auto-scaling


Prerequisites

Participants should have basic Linux experience. Prior knowledge of CDP, nor earlier platforms such as Cloudera’s CDH or Hortonworks HDP, is not required.


Terms & Conditions

The supply of this course by DDLS is governed by the booking terms and conditions. Please read the terms and conditions carefully before enrolling in this course, as enrolment in the course is conditional on acceptance of these terms and conditions.



Request Course Information

By submitting an enquiry, you agree to our privacy policy and receiving email and other forms of communication from us. You can opt-out at any time.