Course Outline
Introduction
Installing and Configuring Dataiku Data Science Studio (DSS) for Government
- System requirements for Dataiku DSS in government environments
- Setting up Apache Hadoop and Apache Spark integrations for government use cases
- Configuring Dataiku DSS with web proxies to ensure secure access
- Migrating from other platforms to Dataiku DSS, tailored for government agencies
Overview of Dataiku DSS Features and Architecture for Government
- Core objects and graphs foundational to Dataiku DSS operations in the public sector
- Understanding recipes in Dataiku DSS for government data processing
- Types of datasets supported by Dataiku DSS, suitable for government applications
Creating a Dataiku DSS Project for Government
Defining Datasets to Connect to Data Resources in Dataiku DSS for Government
- Working with DSS connectors and file formats to support government data sources
- Comparing standard DSS formats with Hadoop-specific formats relevant to government operations
- Uploading files for a Dataiku DSS project, ensuring compliance with government standards
Overview of the Server Filesystem in Dataiku DSS for Government
Creating and Using Managed Folders in Dataiku DSS for Government
- Utilizing the Dataiku DSS recipe for merging folders to streamline government data management
- Differentiating between local and non-local managed folders for enhanced security and efficiency in government projects
Constructing a Filesystem Dataset Using Managed Folder Contents for Government
- Performing cleanups with a DSS code recipe to maintain data integrity for government use
Working with Metrics Datasets and Internal Stats Datasets in Dataiku DSS for Government
Implementing the DSS Download Recipe for HTTP Dataset for Government Applications
Relocating SQL Datasets and HDFS Datasets Using Dataiku DSS for Government
Ordering Datasets in Dataiku DSS for Government
- Understanding writer ordering versus read-time ordering to optimize data processing for government projects
Exploring and Preparing Data Visuals for a Dataiku DSS Project for Government
Overview of Dataiku Schemas, Storage Types, and Meanings for Government
Performing Data Cleansing, Normalization, and Enrichment Scripts in Dataiku DSS for Government
Working with Dataiku DSS Charts Interface and Types of Visual Aggregations for Government
Utilizing the Interactive Statistics Feature of DSS for Government
- Conducting univariate analysis versus bivariate analysis to support government data insights
- Leveraging the Principal Component Analysis (PCA) DSS tool for advanced government analytics
Overview of Machine Learning with Dataiku DSS for Government
- Differentiating between supervised ML and unsupervised ML in government applications
- Referencing DSS ML algorithms and feature handling to support government data models
- Exploring deep learning capabilities within Dataiku DSS for government use cases
Overview of the Flow Derived from DSS Datasets and Recipes for Government
Transforming Existing Datasets in DSS with Visual Recipes for Government
Utilizing DSS Recipes Based on User-Defined Code for Government Projects
Optimizing Code Exploration and Experimentation with DSS Code Notebooks for Government
Writing Advanced DSS Visualizations and Custom Frontend Features with Webapps for Government
Working with Dataiku DSS Code Reports Feature for Government
Sharing Data Project Elements and Familiarizing with the DSS Dashboard for Government
Designing and Packaging a Dataiku DSS Project as a Reusable Application for Government
Overview of Advanced Methods in Dataiku DSS for Government
- Implementing optimized datasets partitioning using DSS to enhance government data management
- Executing specific DSS processing parts through computations in Kubernetes containers to support scalable government operations
Overview of Collaboration and Version Control in Dataiku DSS for Government
Implementing Automation Scenarios, Metrics, and Checks for DSS Project Testing for Government
Deploying and Updating a Project with the DSS Automation Node and Bundles for Government
Working with Real-Time APIs in Dataiku DSS for Government
- Utilizing additional APIs and REST APIs in DSS to support real-time government data needs
Analyzing and Forecasting Dataiku DSS Time Series for Government
Securing a Project in Dataiku DSS for Government
- Managing project permissions and dashboard authorizations to ensure secure government data access
- Implementing advanced security options to protect sensitive government information
Integrating Dataiku DSS with the Cloud for Government
Troubleshooting in Dataiku DSS for Government
Summary and Conclusion for Government Use of Dataiku DSS
Requirements
- Proficiency with Python, SQL, and R programming languages for government applications
- Fundamental understanding of data processing using Apache Hadoop and Spark in a public sector context
- Knowledge of machine learning principles and data modeling techniques
- Experience with statistical analysis and data science methodologies for government use
- Skills in visualizing and effectively communicating data insights
Audience
- Engineers
- Data Scientists
- Data Analysts