20131004

Big Data

Big Data & Hadoop Developer Training












Dates: 18th, 19th & 20th Oct 2013 at Bangalore
3 Days Instructor Led Hands-On Training
Overview:
Apache Hadoop, the open source data management software that helps organizations analyze massive volumes of structured and unstructured data, is a very hot topic across the tech industry. Employed by such big named websites as eBay, Facebook and Yahoo, Hadoop is being tagged by many as one of the most desired tech skills for 2012 and coming years along with Cloud Computing.
What participants will learn?
Intended Audience:
The attendees will learn below topics through lectures and hands-on exercises
  – Understand Big Data & Hadoop Ecosystem
  – Hadoop Distributed File System – HDFS
  – Use Map Reduce API and write common algorithms
  – Best practices for developing and debugging map reduce programs
  – Advanced Map Reduce Concepts & Algorithms
  – Hadoop Best Practices & Tip and Techniques
  – Managing and Monitoring Hadoop Cluster
  – Importing and exporting data using Sqoop
  – Leverage Hive & Pig for analysis
  – Running Hadoop on Cloud
Architects and developers, who wish to write, build and maintain Apache Hadoop jobs.
Course Prerequisites:
The participants should have basic understanding or knowledge of java and linux.
Faculty Profile:
He has about 15+ years of industry experience working on enterprise java, SOA and Cloud computing platforms. He has worked with TCS, HP, Patni and worked on large scale projects for customers like Motorola, Home Depot, CKWB Bank, P&G in the roles of solution and technical architect. He provides consulting and training on Cloud Computing, Big data & Hadoop, Google App Engine, and Amazon Web Services.
Course Outline:
What is Big Data & Why Hadoop?
     • Big Data Characteristics, Challenges with traditional system
Hadoop Overview & it’s Ecosystem
     • Anatomy of Hadoop Cluster, Installing and Configuring Hadoop
     • Hands-On Exercise
HDFS – Hadoop Distributed File System
     • HDFS Architecture, Name Nodes, Data Nodes and Secondary Name Node
     • Hands-On Exercise
Map Reduce Anatomy
     • How Map Reduce Works?
     • The Mapper & Reducer, , Data Type, Input& Output Formats
Developing Map Reduce Programs
     • Setting up Eclipse Development Environment, Creating Map Reduce Projects, Debugging and Unit Testing
     • Developing a map reduce algorithm on real world scenario
     • Hands On Exercises
Advanced Map Reduce Concepts
     • Combiner, Partitioner, Counter, Compression, Setup and teardown, Speculative Execution, Zero Reducer and Distributed Cache
Advanced Map Reduce Algorithms
     • Sorting, Searching, Multiple Inputs, Chaining multiple jobs
     • Joins, Handling Binary & Unstructured data
Advanced Tips & Techniques
     • Determining optimal number of reducers, skipping bad records
     • Partitioning into multiple output files & Passing parameters to tasks
     • Hadoop Cluster sizing and capacity planning
Monitoring & Management of Hadoop
Managing HDFS with Tools like fsck and dfsadmin
     • Using HDFS & Job Tracker Web UI
     • Routine Administration Procedures
     • Hands On Exercises
Sqoop
     • Importing and Exporting data from using RDBMS
     • Hands On Exercises – Import and Export
Hive
     • Hive Basics, Internal & External Tables, Partitioning, Buckets
     • Writing queries – Joins, Union, Dynamic partitioning, Sampling
     • Hands On Exercise – Structured data analysis
Pig
     • Pig Basics, Loading data files
     • Writing queries – SPLIT, FILTER, JOIN, GROUP, SAMPLE, ILLUSTRATE etc.
     • Hands On Exercise – Semi-structured Data Analysis
Setting up a Hadoop Cluster ( 2 Nodes )
     • Demo by Instructor
Hadoop Best Practices

Time: 09:30am to 05:30pm 
Fee Details:
Rs. 17,800.00 + 12.36% Service Tax Per Participant
Subject to availability of seats. Terms & Conditions
Registration is first come first serve basis.

Payment Options:
Account NameKnowledgeWorks IT Consulting Pvt. Ltd.,
Bank Name: Kotak Mahindra Bank
Bank  Account Number: 4811265641
Account TypeCurrent Account (CA)
Beneficiary Bank Address:  Jayanagar Branch, Bangalore
RTGS / NEFT / IFSC CodeKKBK0000421
Venue:
KnowledgeWorks IT Consulting Pvt. Ltd.,
No: 65, Sri Vinayaka Towers, 3rd Floor,
8th B Main, 27th Cross, Jayanagar 4th Block
Bangalore – 560011
For any clarifications, Please contact:
Mr. Sudhindra D N (+91 9886221314)
T: +91 80 26630622, 22459941 
E: sudhi@knowledgeworks.co.in