Course Outline

Greenplum Architecture for Government

  • Parallel processing and symmetric multi-processing capabilities
  • Segment roles and cluster configuration best practices
  • Scalability considerations and data movement strategies
  • Overview of the Greenplum Data Warehouse architecture for government

Greenplum Table Structures for Government

  • Differentiating between distributed and randomly assigned tables
  • Comparing heap and append-only table types
  • Row versus columnar storage formats for optimal performance
  • Partitioned and clustered tables for efficient data management

Data Distribution and Hashing for Government

  • Understanding hashing logic and distribution keys
  • Handling skew and its performance impact
  • Utilizing hash maps and row placement strategies

Indexes and Performance Optimization for Government

  • Clustered versus non-clustered indexes
  • Use cases for B-tree and bitmap indexes
  • Index scan operations and storage behavior

Physical Database Design for Government

  • Normalization techniques and logical model design
  • User access strategies and distribution analysis
  • Data demographics and indexing decisions

Denormalization Techniques for Government

  • Derived data, summary tables, and pre-joins
  • Columnar tables as a vertical partitioning method
  • Data marts and materialized views for enhanced query performance

Advanced SQL and Query Execution for Government

  • Join strategies and data redistribution techniques
  • OLAP (Online Analytical Processing) and window functions
  • Temporary tables, subqueries, and derived tables

EXPLAIN Plans and Query Tuning for Government

  • Reading and interpreting EXPLAIN output
  • Cost analysis and plan optimization methods
  • Join movement and segment-local operations

Greenplum Utilities and Best Practices for Government

  • ANALYZE and VACUUM commands for database maintenance
  • Data loading and movement using the Nexus tool
  • Security, permissions, and performance optimization tips for government

Summary and Next Steps for Government

Requirements

  • An understanding of relational databases and SQL for government applications.
  • Experience with data warehousing or analytical systems in a public sector context.
  • Familiarity with Linux command line operations to support governmental IT infrastructure.

Audience

  • Data architects and engineers working for government agencies.
  • Database administrators and technical leads responsible for public sector data management.
  • BI developers and analytics specialists utilizing Greenplum in government projects.
 21 Hours

Number of participants


Price per participant

Testimonials (1)

Upcoming Courses

Related Categories