Big Data Overview

arrow_back Data Analytics

This course provides an in-depth overview of the choices you have in processing Big Data. It provides an introduction to what Big Data is, the types of data you might have, approaches to working on and processing the data, and the capabilities, strengths, and weaknesses of those approaches.
After taking this course, you will have a clear understanding of what Big Data is and of the various types of data you may encounter. You will know different techniques and technologies for working with your data, where those technologies are a good fit, and where they are not. You will be well prepared for evaluating what approach is best suited for your needs.

This course provides an in-depth overview of the choices you have in processing Big Data. It provides an introduction to what Big Data is, the types of data you might have, approaches to working on and processing the data, and the capabilities, strengths, and weaknesses of those approaches.

After taking this course, you will have a clear understanding of what Big Data is and of the various types of data you may encounter. You will know different techniques and technologies for working with your data, where those technologies are a good fit, and where they are not. You will be well prepared for evaluating what approach is best suited for your needs.

Course Objectives:

  • Understand what Big Data is
  • Know the difference between “data-at-rest” and “data-in-motion”
  • Understand what map-reduce / Hadoop is, and what it can do
  • Be aware of query technologies for easily querying with Hadoop (e.g. Hive, Pig, and others)
  • Understand what NoSQL databases are and what they can do
  • Become familiar with the choices in the NoSQL landscape
  • Understand the strengths and weaknesses of different NoSQL technologies
  • Be well-informed on your choices in Big Data processing, and evaluate them for your needs

Modules

Module 1: Understanding Big Data

  • Big Data Characteristics
  • Relational Model Overview
  • Working with Big Data
  • Data Consistency and CAP

Module 2: NewSQL Databases

  • NewSQL Overview
  • Product Overviews
  • Summary

Module 3: NoSQL Overview

  • Differences from Relational Model
  • Types of NoSQL Stores
    • Document Data Model
    • Graph Data Model
    • Key/Value
    • Wide Columnar
    • Hadoop

Module 4: Hadoop and MapReduce

  • Overview
  • HDFS
  • YARN
  • MapReduce
  • Summary

Module 5: Other Processing Technologies

  • Apache Pig and Hive
  • Apache Impala
  • Apache Storm
  • Apache Spark

Module 6: MongoDB

  • Overview and Architecture
  • Summary of Strenghts/Weaknesses

Module 7: Cassandra Database

  • Overview and Architecture
  • Summary of Strenghts/Weaknesses

Module 8: Other Databases and Tools

Solicitar información






GRUPO TRENTIA como responsable del tratamiento tratará tus datos con la finalidad de dar respuesta a tu consulta o petición. Puedes acceder, rectificar y suprimir tus datos, así como ejercer otros derechos consultando la información adicional y detallada sobre protección de datos en nuestra Política de Privacidad.

Nos gustaría que nos prestaras tu consentimiento para: