Course Outline

Introduction

Installing and Configuring Dataiku Data Science Studio (DSS) for Government

  • System requirements for Dataiku DSS in government environments
  • Setting up Apache Hadoop and Apache Spark integrations for government use cases
  • Configuring Dataiku DSS with web proxies to ensure secure access
  • Migrating from other platforms to Dataiku DSS, tailored for government agencies

Overview of Dataiku DSS Features and Architecture for Government

  • Core objects and graphs foundational to Dataiku DSS operations in the public sector
  • Understanding recipes in Dataiku DSS for government data processing
  • Types of datasets supported by Dataiku DSS, suitable for government applications

Creating a Dataiku DSS Project for Government

Defining Datasets to Connect to Data Resources in Dataiku DSS for Government

  • Working with DSS connectors and file formats to support government data sources
  • Comparing standard DSS formats with Hadoop-specific formats relevant to government operations
  • Uploading files for a Dataiku DSS project, ensuring compliance with government standards

Overview of the Server Filesystem in Dataiku DSS for Government

Creating and Using Managed Folders in Dataiku DSS for Government

  • Utilizing the Dataiku DSS recipe for merging folders to streamline government data management
  • Differentiating between local and non-local managed folders for enhanced security and efficiency in government projects

Constructing a Filesystem Dataset Using Managed Folder Contents for Government

  • Performing cleanups with a DSS code recipe to maintain data integrity for government use

Working with Metrics Datasets and Internal Stats Datasets in Dataiku DSS for Government

Implementing the DSS Download Recipe for HTTP Dataset for Government Applications

Relocating SQL Datasets and HDFS Datasets Using Dataiku DSS for Government

Ordering Datasets in Dataiku DSS for Government

  • Understanding writer ordering versus read-time ordering to optimize data processing for government projects

Exploring and Preparing Data Visuals for a Dataiku DSS Project for Government

Overview of Dataiku Schemas, Storage Types, and Meanings for Government

Performing Data Cleansing, Normalization, and Enrichment Scripts in Dataiku DSS for Government

Working with Dataiku DSS Charts Interface and Types of Visual Aggregations for Government

Utilizing the Interactive Statistics Feature of DSS for Government

  • Conducting univariate analysis versus bivariate analysis to support government data insights
  • Leveraging the Principal Component Analysis (PCA) DSS tool for advanced government analytics

Overview of Machine Learning with Dataiku DSS for Government

  • Differentiating between supervised ML and unsupervised ML in government applications
  • Referencing DSS ML algorithms and feature handling to support government data models
  • Exploring deep learning capabilities within Dataiku DSS for government use cases

Overview of the Flow Derived from DSS Datasets and Recipes for Government

Transforming Existing Datasets in DSS with Visual Recipes for Government

Utilizing DSS Recipes Based on User-Defined Code for Government Projects

Optimizing Code Exploration and Experimentation with DSS Code Notebooks for Government

Writing Advanced DSS Visualizations and Custom Frontend Features with Webapps for Government

Working with Dataiku DSS Code Reports Feature for Government

Sharing Data Project Elements and Familiarizing with the DSS Dashboard for Government

Designing and Packaging a Dataiku DSS Project as a Reusable Application for Government

Overview of Advanced Methods in Dataiku DSS for Government

  • Implementing optimized datasets partitioning using DSS to enhance government data management
  • Executing specific DSS processing parts through computations in Kubernetes containers to support scalable government operations

Overview of Collaboration and Version Control in Dataiku DSS for Government

Implementing Automation Scenarios, Metrics, and Checks for DSS Project Testing for Government

Deploying and Updating a Project with the DSS Automation Node and Bundles for Government

Working with Real-Time APIs in Dataiku DSS for Government

  • Utilizing additional APIs and REST APIs in DSS to support real-time government data needs

Analyzing and Forecasting Dataiku DSS Time Series for Government

Securing a Project in Dataiku DSS for Government

  • Managing project permissions and dashboard authorizations to ensure secure government data access
  • Implementing advanced security options to protect sensitive government information

Integrating Dataiku DSS with the Cloud for Government

Troubleshooting in Dataiku DSS for Government

Summary and Conclusion for Government Use of Dataiku DSS

Requirements

  • Proficiency with Python, SQL, and R programming languages for government applications
  • Fundamental understanding of data processing using Apache Hadoop and Spark in a public sector context
  • Knowledge of machine learning principles and data modeling techniques
  • Experience with statistical analysis and data science methodologies for government use
  • Skills in visualizing and effectively communicating data insights

Audience

  • Engineers
  • Data Scientists
  • Data Analysts
 21 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories