Course Outline

  • Section 1: Introduction to Big Data / NoSQL for Government
    • NoSQL overview
    • CAP theorem
    • When is NoSQL appropriate for government
    • Columnar storage
    • NoSQL ecosystem
  • Section 2: Cassandra Basics for Government
    • Design and architecture
    • Cassandra nodes, clusters, datacenters
    • Keyspaces, tables, rows, and columns
    • Partitioning, replication, tokens
    • Quorum and consistency levels
    • Labs: Interacting with Cassandra using CQLSH for government applications
  • Section 3: Data Modeling – Part 1 for Government
    • Introduction to CQL
    • CQL datatypes
    • Creating keyspaces and tables
    • Choosing columns and types
    • Choosing primary keys
    • Data layout for rows and columns
    • Time to live (TTL)
    • Querying with CQL
    • CQL updates
    • Collections (list, map, set)
    • Labs: Various data modeling exercises using CQL; experimenting with queries and supported data types for government use cases
  • Section 4: Data Modeling – Part 2 for Government
    • Creating and using secondary indexes
    • Composite keys (partition keys and clustering keys)
    • Time series data
    • Best practices for time series data in government applications
    • Counters
    • Lightweight transactions (LWT)
    • Labs: Creating and using indexes; modeling time series data for government projects
  • Section 5: Data Modeling Labs – Group Design Session for Government
    • Multiple use cases from various domains are presented
    • Students work in groups to develop designs and models for government applications
    • Discuss various designs, analyze decisions
    • Lab: Implement one of the scenarios for a government project
  • Section 6: Cassandra Drivers for Government
    • Introduction to Java driver
    • CRUD (Create, Read, Update, Delete) operations using Java client for government systems
    • Asynchronous queries
    • Labs: Using Java API for Cassandra in government applications
  • Section 7: Cassandra Internals for Government
    • Understanding Cassandra design under the hood
    • SSTables, memtables, commit log
    • Read path and write path
    • Caching
    • Vnodes
  • Section 8: Administration for Government
    • Hardware selection for government systems
    • Cassandra distributions suitable for government use
    • Installing Cassandra in a government environment
    • Running benchmarks for government applications
    • Tooling for monitoring performance and node activities
      • DataStax OpsCenter
    • Diagnosing Cassandra performance issues in government systems
    • Investigating a node crash in government environments
    • Understanding data repair, deletion, and replication for government applications
    • Other troubleshooting tools and tips for government use
    • Cassandra best practices (compaction, garbage collection) for government systems
  • Section 9: Bonus Lab (Time Permitting) for Government
    • Implement a music service like Pandora / Spotify on Cassandra for government use

Requirements

  • proficient in the Java programming language
  • experienced in a Linux environment (navigating the command line, editing files with vi or nano)

Lab Environment:

A fully functional Cassandra environment will be provided for government students. Participants will need an SSH client and a web browser to access the cluster.

Zero Installation: There is no requirement for students to install Cassandra on their personal devices!

 21 Hours

Number of participants


Price per participant

Testimonials (3)

Upcoming Courses

Related Categories