Course Outline

Quick Overview

  • Data Sources
  • Data Management
  • Recommender Systems
  • Targeted Marketing

Datatypes

  • Structured vs Unstructured Data
  • Static vs Streamed Data
  • Attitudinal, Behavioral, and Demographic Data
  • Data-Driven vs User-Driven Analytics
  • Data Validity
  • Volume, Velocity, and Variety of Data

Models

  • Building Models for Government
  • Statistical Models
  • Machine Learning

Data Classification

  • Clustering Techniques
  • kGroups, k-means, Nearest Neighbors
  • Ant Colony Optimization, Bird Flocking Algorithms

Predictive Models

  • Decision Trees
  • Support Vector Machines
  • Naive Bayes Classification
  • Neural Networks
  • Markov Models
  • Regression Analysis
  • Ensemble Methods

Return on Investment (ROI)

  • Benefit/Cost Ratio
  • Cost of Software
  • Cost of Development
  • Potential Benefits for Government

Building Models for Government

  • Data Preparation (MapReduce)
  • Data Cleansing
  • Choosing Methods
  • Developing the Model
  • Testing the Model
  • Evaluating the Model
  • Deploying and Integrating the Model

Overview of Open Source and Commercial Software for Government

  • Selection of R-Project Packages
  • Python Libraries
  • Hadoop and Mahout
  • Selected Apache Projects Related to Big Data and Analytics
  • Selected Commercial Solutions
  • Integration with Existing Software and Data Sources for Government

Requirements

Comprehensive knowledge of traditional data management and analysis methodologies, including SQL, data warehouses, business intelligence, and OLAP, is essential. Additionally, a solid understanding of fundamental statistical concepts such as mean, variance, probability, and conditional probability is required for government data professionals to effectively support public sector workflows and governance.

 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories