Course outline
The course delivers the key concepts of Big Data . Participants will get familiar with the main technologies involved and with the architectures behind them. Among other, the course will go over the Hadoop Eco system, Spark and NoSQL databases. The course will also discuss the challenges faced by Big Data developers and what are the recommended tools to use in a given situation.
On completing this course delegates will be able to effectively design a process that handles a massive amounts of data using up to data Big Data technologies.
Upcoming meetings
There are no upcoming meetings for this course. Contact us to schedule this course, which will be customized specifically for your organization.
info@hackerupro.comModules
- Key concepts
- Use cases
- Major technologies involved
- Problems with Traditional Large-scale Systems
- The Hadoop Eco-System
- Distributed Processing on a Cluster
- Storage: HDFS Architecture
- Storage: Using HDFS
- Resource Management: YARN Architecture
- Sqoop, Flume and Kafka
- Introduction to Hive
- Introduction to Spark
- Other tools
- Basic concepts
- NoSQL families
- Major players in the market
Prerequisites
- 01 Basic knowledge of database concepts and development environments