Course Outline
Introduction
- Apache Spark vs Hadoop MapReduce
Overview of Apache Spark Features and Architecture for Government
Choosing a Programming Language for Government
Setting up Apache Spark for Government
Creating a Sample Application for Government
Choosing the Data Set for Government
Running Data Analysis on the Data for Government
Processing of Structured Data with Spark SQL for Government
Processing Streaming Data with Spark Streaming for Government
Integrating Apache Spark with Third-Party Machine Learning Tools for Government
Using Apache Spark for Graph Processing for Government
Optimizing Apache Spark for Government
Troubleshooting for Government
Summary and Conclusion for Government
Requirements
- Proficiency in using the Linux command line for government tasks
- A foundational knowledge of data processing methodologies
- Programming expertise in Java, Scala, Python, or R
Target Audience
- Software developers for government applications
Testimonials (5)
The live examples
Ahmet Bolat - Accenture Industrial SS
Course - Python, Spark, and Hadoop for Big Data
very interactive...
Richard Langford
Course - SMACK Stack for Data Science
Sufficient hands on, trainer is knowledgable
Chris Tan
Course - A Practical Introduction to Stream Processing
Get to learn spark streaming , databricks and aws redshift
Lim Meng Tee - Jobstreet.com Shared Services Sdn. Bhd.
Course - Apache Spark in the Cloud
practice tasks