Course Outline

  • Section 1: Introduction to Big Data and NoSQL for Government
    • NoSQL overview for government applications
    • CAP theorem and its implications for government data systems
    • Scenarios where NoSQL is appropriate for government use cases
    • Columnar storage techniques for efficient data retrieval in government databases
    • The NoSQL ecosystem and its relevance to government IT infrastructure
  • Section 2: Cassandra Basics for Government
    • Design and architecture of Cassandra for government systems
    • Cassandra nodes, clusters, and datacenters in a government context
    • Keyspaces, tables, rows, and columns in government datasets
    • Partitioning, replication, and token management for government data integrity
    • Quorum and consistency levels to ensure reliable government data operations
    • Labs: Interacting with Cassandra using CQLSH for government applications
  • Section 3: Data Modeling – Part 1 for Government
    • Introduction to CQL for government data modeling
    • CQL datatypes suitable for government datasets
    • Creating keyspaces and tables for government use cases
    • Selecting appropriate columns and types for government data
    • Choosing primary keys to optimize government data queries
    • Data layout strategies for efficient row and column management in government systems
    • Time-to-live (TTL) settings for managing temporary government data
    • Querying with CQL for effective government data retrieval
    • CQL updates to maintain accurate government records
    • Collections (list, map, set) for organizing complex government data
    • Labs: Various data modeling exercises using CQL; experimenting with queries and supported data types in a government context
  • Section 4: Data Modeling – Part 2 for Government
    • Creating and using secondary indexes to enhance government data queries
    • Composite keys (partition keys and clustering keys) for efficient government data organization
    • Handling time series data in government applications
    • Best practices for managing time series data in government systems
    • Using counters for precise government data aggregation
    • Lightweight transactions (LWT) to ensure data consistency in government operations
    • Labs: Creating and using indexes; modeling time series data for government applications
  • Section 5: Data Modeling Labs – Group Design Session for Government
    • Presentation of multiple use cases from various government domains
    • Students work in groups to design and model data solutions for government scenarios
    • Discussion and analysis of different designs, evaluating decision-making processes
    • Labs: Implement one of the government scenarios using the designed models
  • Section 6: Cassandra Drivers for Government Applications
    • Introduction to the Java driver for government use
    • Performing CRUD (Create, Read, Update, Delete) operations using the Java client in government systems
    • Asynchronous queries for efficient data processing in government applications
    • Labs: Using the Java API for Cassandra in a government context
  • Section 7: Cassandra Internals for Government
    • Understanding the underlying design of Cassandra for government systems
    • SSTables, memtables, and commit logs in government data storage
    • The read path and write path in government data operations
    • Caching mechanisms to optimize government data performance
    • Vnodes for distributed government data management
  • Section 8: Administration of Cassandra for Government
    • Hardware selection criteria for government Cassandra deployments
    • Cassandra distributions suitable for government environments
    • Installing and configuring Cassandra for government use
    • Running benchmarks to evaluate government data performance
    • Tooling for monitoring performance and node activities in government systems
      • DataStax OpsCenter for comprehensive government data management
    • Diagnosing and resolving Cassandra performance issues in government applications
    • Investigating and recovering from a node crash in government systems
    • Understanding data repair, deletion, and replication for government data integrity
    • Other troubleshooting tools and tips for maintaining government Cassandra systems
    • Cassandra best practices (compaction, garbage collection) for efficient government data management
  • Section 9: Bonus Lab (Time Permitting) for Government
    • Implementing a music service like Pandora/Spotify using Cassandra for government applications

Requirements

  • Proficient in the Java programming language
  • Experienced in a Linux environment (navigating command line, editing files with vi or nano)

Lab Environment:

A functional Cassandra environment will be provided for students. Participants will require an SSH client and a web browser to access the cluster.

Zero Installation: There is no need to install Cassandra on students' machines, ensuring a streamlined experience for government training sessions.

 21 Hours

Number of participants


Price per participant

Testimonials (3)

Upcoming Courses

Related Categories