Course Outline

  • Section 1: Introduction to Big Data / NoSQL for Government
    • NoSQL overview
    • CAP theorem
    • When is NoSQL appropriate for government use cases
    • Columnar storage
    • NoSQL ecosystem for government applications
  • Section 2: Cassandra Basics
    • Design and architecture
    • Cassandra nodes, clusters, datacenters
    • Keyspaces, tables, rows, and columns
    • Partitioning, replication, tokens
    • Quorum and consistency levels
    • Labs: Interacting with Cassandra using CQLSH for government data
  • Section 3: Data Modeling – Part 1
    • Introduction to CQL
    • CQL Datatypes
    • Creating keyspaces and tables for government datasets
    • Choosing columns and types for government data
    • Choosing primary keys for government applications
    • Data layout for rows and columns in a government context
    • Time to live (TTL) for government records
    • Querying with CQL for government data
    • CQL updates for government databases
    • Collections (list / map / set) for government use
    • Labs: Various data modeling exercises using CQL; experimenting with queries and supported data types for government applications
  • Section 4: Data Modeling – Part 2
    • Creating and using secondary indexes for government datasets
    • Composite keys (partition keys and clustering keys) for government data
    • Time series data in a government context
    • Best practices for time series data in government applications
    • Counters for government use
    • Lightweight transactions (LWT) for government databases
    • Labs: Creating and using indexes; modeling time series data for government scenarios
  • Section 5: Data Modeling Labs: Group Design Session
    • Multiple use cases from various domains are presented, including government-specific examples
    • Students work in groups to come up with designs and models for government applications
    • Discuss various designs, analyze decisions for government data management
    • Lab: Implement one of the scenarios for a government use case
  • Section 6: Cassandra Drivers
    • Introduction to Java driver for government applications
    • CRUD (Create / Read / Update, Delete) operations using Java client in a government context
    • Asynchronous queries for government data processing
    • Labs: Using Java API for Cassandra in government projects
  • Section 7: Cassandra Internals
    • Understand Cassandra design under the hood for government use
    • SSTables, memtables, commit log for government data storage
    • Read path and write path in a government context
    • Caching for government applications
    • Vnodes for government data distribution
  • Section 8: Administration
    • Hardware selection for government infrastructure
    • Cassandra distributions suitable for government use
    • Cassandra best practices (compaction, garbage collection) for government data management
    • Troubleshooting tools and tips for government administrators
    • Lab: Students install Cassandra, run benchmarks in a government environment
  • Section 9: Bonus Lab (Time Permitting)
    • Implement a music service like Pandora / Spotify on Cassandra for government use

Requirements

  • Proficient in the Java programming language for government applications.
  • Experienced in a Linux environment, including navigating the command line and editing files using vi or nano for government tasks.
 21 Hours

Number of participants


Price per participant

Testimonials (7)

Upcoming Courses

Related Categories