comScoreTag

Certificate in Big Data Modelling and Analytics Workshop I

Certificate in Big Data Modelling and Analytics Workshop I
Organizer: PEAK
Jul 13, 2019 (Sat)
9:30am - 6:00pm
VTC Tower, 27 Wood Road, Wanchai
$ 5,300

Introduction

The course aims to provide students with knowledge in Big Data platforms, data modelling, virtualization and analytics. Participants are expected to get familiar with different big data platforms, programming methodologies and virtualization tools for analytics. The programme provides an insight on how business world is using Big Data to improve their business models.

Objectives

  1. Candidates can setup Big Hadoop Data Platform and start Big Data process into HBase and extract HBase data for other integration and business cases.
  2. Candidate can setup Hive for advanced level data process engine for other integrated module.
  3. Candidate can apply streaming engine to stream social media contents into Hive and moving data between Hadoop, HBase to Hive
  4. Candidate will learn how to create Java Program to access HBase Table directly including Table operation, data processing
  5. Candidate will learn how to create Java Program to access Hive JDBC for data processing
  6. Candidate will learn basic R programming and some basic modelling such as Decision Tree, Linear Regression, Non-linear Regression, NeuralNetwork with Demo Data from R and some stock market data from R virtualization tool.
  7. Candidate will learn how to use Qlikview for data presentation (Personal Edition)
  8. Create Dashboard from QlikView with basic analytic features and business cases
  9. Candidate will understand, what is Advance Data Analytics Tools (Raid Miner, Alteryx and business cases)
  10. Candidate will learn some basic troubleshooting skill during the exercise.

Course Contents

  1. Hadoop Big Data Platform
  • Hadoop Big Data Framework & Platform
  • Hbase Introduction, Setup and Practice
  • Hive Introduction, Setup and Practice
  • Flume Introduction, Setup and Practice
  1. R Programming
  • External Integration with R Studio
  • Data Modelling (Decision Tree, Linear Regression, Non-linear Regression, Neural Network)
  1. Data Analytics 
  • Introduce Data Virtualization Tool for Analytics (QlikView)
  • Introduce Data Virtualization Tool for Analytics (RaidMiner, Alteryx)
For more details and registration, please click here.