Certified Apache Hadoop Developer Training - 2 day course


Certified Apache Hadoop Developer

Apache Hadoop, the open source data management software that helps organizations analyse massive volumes of structured and unstructured data, is a very hot topic across the tech industry. Employed by such big named websites as eBay, Facebook, and Yahoo, Hadoop is being tagged by many as one of the most desired tech skills for 2013 and coming years along with Cloud Computing.

Why Learn Big Data?
90% of the data in the world today is less than 2 year old.
18 Moths is the estimated time for digital universe to double.
2.6 Quintillion bytes is produced every day.

  • Understand Big Data & Hadoop Ecosystem
  • Hadoop Distributed File System – HDFS
  • Use Map Reduce API and write common algorithms
  • Best practices for developing and debugging map reduce programs
  • Advanced Map Reduce Concepts & Algorithms
  • Hadoop Best Practices & Tip and Techniques
  • Managing and Monitoring Hadoop Cluster
  • Importing and exporting data using Sqoop
  • Leverage Hive & Pig for analysis
  • Running Hadoop on Cloud



This workshop will help you to understand Big Data Hadoop and its Ecosystem and will give hands on trainings via live Big Data project. UNICOM will conduct this 2-day lab-based training would be taken by highly acclaimed certified guide with limited batch size ensuring focus and quality.

Take Away from the Course:
•    There will be 11 practical exercises during the 2 days of Training.
•    Hand out of the presentation will be provided to the participants
•    Understanding of What and Why of Hadoop with its Eco-System Components.
•    Ability to write Map Reduce programs in a given scenario
•    Ability to correctly architect and implement the Best Practices in Hadoop Development
•    Ability to manage the different Hadoop Components when talking to each other.

Hardware and Softwares Required for this training:

Delegates attending this course are required to carry a laptop with below configuration. Usually we suggest a 64 bit machine and not a 32 bit machine. However you can still try with a 32 bit machine but we do not gaurantee if the programs will run without any error. 

Windows user
64 bit OS, Min 4 GB RAM
VMWare Player 5.0.0
Linux  VM– Ubuntu 12.04 LTS : {Unicom will be providing the VM(virtual machine)}
Eclipse 3.6+
Putty – For opening Telnet sessions to the Linux VM
WinSCP – For transferring files between Windows and Linux VM

Linux/Mac Users (preferably a 64 bit machine): 
Min 4 GB RAM 
Eclipse 3.6+
JDK 1.6 or higher installed on your machine
SSH installed

Systems with said configuration can be arranged through us with an additional cost.


• What is Big Data & Why Hadoop?
  • Big Data Characteristics, Challenges with traditional system
• Hadoop Overview & it’s Ecosystem
  • Anatomy of Hadoop Cluster, Installing and Configuring Hadoop
  • Hands-On Exercise
• HDFS – Hadoop Distributed File System
  • Name Nodes and Data Nodes
  • Hands-On Exercise
• Map Reduce Anatomy
  • How Map Reduce Works?
  • The Mapper & Reducer, Input Formats & Output Formats, Data Type & Customer Writable
• Developing Map Reduce Programs
  • Setting up Eclipse Development Environment, Creating Map Reduce Projects, Debugging and Unit Testing Map     Reduce Code, Testing with MRUnit
  • Hands-On Exercise
• Advanced Map Reduce Concepts
  • Combiner, Partitioner, Counter, Compression, Setup and teardown, Speculative Execution, Zero Reducer and Distributed Cache
  • Hands-On Exercise
• Advanced Map Reduce Algorithms
  • Sorting, Searching and Indexing, Multiple Inputs, Chaining multiple jobs
  • Joins, Handling Binary & Unstructured data
  • Hands-On Exercise
• Advanced Tips & Techniques
  • Determining optimal number of reducers, skipping bad records
  • Partitioning into multiple output files & Passing parameters to tasks
  • Optimizing Hadoop Cluster & Performance Tuning
• Monitoring & Management of Hadoop
  • Managing HDFS with Tools like fsck and dfsadmin
  • Using HDFS & Job Tracker Web UI
  • Routine Administration Procedures
  • Commissioning and decommissioning of nodes
  • Hands-On Exercise
• Using Hive & Pig
  • Hive Basics & Pig Basics
  • Hands-On Exercise
• Sqoop
  • Importing and exporting data from using RDBMS
  • Hands-On Exercise
• Apache Mahout basics
   • Introduction to Machine Learning
   • Introduction to Recommendations, Clustering, Classification
   • Running a Mahout recommender job on Hadoop
• Apache HBase
   • Introduction to HBase
   • Basic architecture, data model
   • Hands on
• Hadoop Best Practices and Use Cases

Instructor Bio

• MR.Venkata Billa is having with over 10years’ experience in IT training development of Java, Business Intelligence, Hadoop, BIGDATA & BIGDATA Analytics he is a Experienced professional and results-oriented instructional designer/technical trainer/curriculum developer/educator in both business and corporate environments on Hadoop, BIGDATA, Business Intelligence Tools.

Education Qualification:

B-TECH  in computer science & information technology.


Delivered Trainings for the Top IT Companies in India and around the globe. 


Sun Certified Java Programmer (SCJP1.4)

Cloudera Certifies Hadoop Developer (CCDH410)

Strong working knowledge and Training Experience of:

Hadoop Bigdata, HBASE, SPARK , SCALA,JAVA (J2SE, J2EE), SQL, and Cloud computing: SALESFORCE CRM.

New Advanced Hadoop technologies: NOSQL technologies: MONGODB, HBASE.

Business Intelligence tools: PENTAHO,COGNOS, TABLEAU, QLIKVIEW

MS Office XP: Word, Excel, and PowerPoint

Macromedia: Dreamweaver and Fireworks MX

TopStyle: HTML and CSS editor

WebEx: interactive online conferencing tool

Interwise: e-Learning collaborative communications tool

Blackboard: e-Education program.

In-House Training

Customised and tailored in-house training gives your business the competitive edge by focussing on self-improvement and investing in your people. In-house solutions allow you to leverage our expertise. We can design and deliver a professional training programme customised to your training needs, provide targeted objectives & learning outcomes, supply highly trained instructors, and provide stimulating content backed up by flexible and cost-effective options.

Call +91 9538878798( India), +44 20 8144 7792( UK) or email contact@unicomlearning.com

In-House Training is an increasingly popular option as it allows you:

Flexibility in Dates & Timing
You can choose dates and timing that will suit you

Cost Effectiveness
Save up to 20% over public training (or more if you have more than 10 people; For Certified ScrumMaster, group size should be more than 20)

Customised Training Content

We will work closely with you in customising course content to suit your team and organisation's objectives and learning outcomes

Course will be conducted in a safe environment and we're happy to sign confidentiality agreements if required

Let us come to you and save on substantial travel and accommodation costs whilst reducing your organisation's carbon footprint

Related Courses

Certification in Apache Mahout

Certified Apache Mahout Scalable, commercial-friendly machine learning for building intelligent applications This course will introduce you to the basic blocks of machine learning, and where Mahout fits in. We will majorly be looking at recommendation systems, what are their types, how to choose a similarity algorithm, and a typical design of a recommendation system. We will be exploring many ex...more

Certification in Big Data Analytics

The Big Data Analytics course has been designed by R experts – people who have used R to solve a variety of business problems in domains like retail, financial services, telecom and healthcare. The course is designed to provide knowledge on how to perform the essential big data analytics tasks using R. Main steps involved in execution of an analytics project are covered using a case study on...more

Certification in Cassandra DB - 2 Day Course

Certified Cassandra DB and NoSQL The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance.This course will introduced you to the NoSQL datastores and understand where Cassandra fits in. You will understand how Cassandra doesn\'t have a single point of failure, it\'s comparison with relational databases, how it\'s easy to ...more

Certified Big Data & Hadoop Master Class

Apache Hadoop Master Class By attending this 01 day classroom training you will gain a good understanding of the Hadoop technology stack including MapReduce, HDFS, Hive, Pig and HBase. This course also includes extensive guidance in applying the right economic, technological and business criteria to the evaluation of Big Data adoption in your organisation.This course is suitable for those who wis...more


Why Learn Big Data?
90% of the data in the world today is less than 2 year old.
18 Moths is the estimated time for digital universe to double.
2.6 Quintillion bytes is produced every day.

  • Big Data: The next big opportunity
  • Big Data Market to Grow to $16.9 Billion
  • Big Data – Big Careers
  • Hadoop wins over enterprise IT
  • 2012 – Year of Big Data
  • Big Data Job Opportunities
  • 20k Big Data Jobs India

What is Course Pre-Requisites?
The participants should have basic understanding or knowledge of java and linux. Prior knowledge of Hadoop is not required

I have a lot of data, but how do I know if it's "Big Data?"
Every company that has data likely has "Big Data," and it grows continuously. Big Data is any type of data, including structured and unstructured data such as log files, customer service information, retail data, text, database information and so on. All of this data can now be analyzed in aggregate, across types and formats, to help make more informed business decisions and drive new solutions.

How much Hands-on is involved?
Around 50% of the training time is dedicated to the Hands-On training. Altogether 11 exercises are lab based.

How can I make payment?
Payment can be made via Cheque / DD / Online Funds transfer / Cash Payment.

Cheque should be drawn in favour of "Unicom training and Seminars Pvt Ltd" payable at Bangalore

NEFT Payment:
Account Name: UNICOM Training & Seminars Pvt Ltd
Bank Name : State Bank of India
Bank Address: Ground Floor, K V Plaza, Green Glen Layout, Outer Ring Road, Bangalore.
A/c Number : 31729010535
IFSC : SBIN0012706
A/c Type: Current

What is Course timing?
0900 – 1700 each day

What is the course Fee?
2 Days Course - Rs 24,000 + 12.36% (Service Tax)

Whom do I contact for more details?
+91-9538878795 or contact@unicomlearning.com

Course dates
Date Location Duration
Price Register & Pay

Group Discount:
5% on Program fee for a group of    03-05
10% on Program Fee for a group of  06-10
15% on Program Fee for a group of  11-15

Confirm your CANCELLATION in writing up to 15 working days before the event and receive a refund less a 10% service charge. Regrettably, no refunds can be made for cancellations received less than 15 working days prior to the event.

However, SUBSTITUTIONS are welcome at any time and is done at no extra cost. The organisers reserve the right to amend the programme if necessary.

INDEMNITY: Should for any reason outside the control of UNICOM Training & Seminars (P) ltd (hereafter called UNICOM), the venue or the speakers change, or the event be cancelled due to industrial action, adverse weather conditions, or an act of terrorism, UNICOM will endeavour to reschedule, but the client hereby indemnifies and holds UNICOM harmless from and against any and all costs, damages and expenses, including attorneys fees, which are incurred by the client. The construction validity and performance of this Agreement shall be governed by all aspects by the laws of India to the exclusive jurisdiction of whose court the Parties hereby agree to submit."

Download Brochure




(C) 2010 UNICOM Training and Seminars Pvt. Ltd. All rights reserved.
M: +91 9538878795, +91 9538878799, +44 20 7193 7900. Email: contact@unicomlearning.com
  • *PRINCE2® is a Registered Trade Mark of the Office of Government Commerce in the United Kingdom and other countries
  • *The Swirl logoTM is a Trade Mark of the Office of Government Commerce
  • *"PMI", "PMBOK" and "PMP" are registered marks of the Project Management Institute, Inc.
  • *"ITIL" is registered trademark of the Office of Government Commerce (OGC), UK.
  • *"CSM" is registered trademark of Scrum Alliance
  • *"ERP" is registered trademark of GARP