Course Outline

  • Section 1: Introduction to Big Data / NoSQL for Government
    • NoSQL Overview
    • CAP Theorem
    • When is NoSQL Appropriate
    • Columnar Storage
    • NoSQL Ecosystem
  • Section 2: Cassandra Basics for Government
    • Design and Architecture
    • Cassandra Nodes, Clusters, Datacenters
    • Keyspaces, Tables, Rows, and Columns
    • Partitioning, Replication, Tokens
    • Quorum and Consistency Levels
    • Labs: Interacting with Cassandra Using CQLSH
  • Section 3: Data Modeling – Part 1 for Government
    • Introduction to CQL
    • CQL Datatypes
    • Creating Keyspaces & Tables
    • Choosing Columns and Types
    • Choosing Primary Keys
    • Data Layout for Rows and Columns
    • Time to Live (TTL)
    • Querying with CQL
    • CQL Updates
    • Collections (List / Map / Set)
    • Labs: Various Data Modeling Exercises Using CQL; Experimenting with Queries and Supported Data Types
  • Section 4: Data Modeling – Part 2 for Government
    • Creating and Using Secondary Indexes
    • Composite Keys (Partition Keys and Clustering Keys)
    • Time Series Data
    • Best Practices for Time Series Data
    • Counters
    • Lightweight Transactions (LWT)
    • Labs: Creating and Using Indexes; Modeling Time Series Data
  • Section 5: Cassandra Internals for Government
    • Understanding Cassandra Design Under the Hood
    • SSTables, Memtables, Commit Log
  • Section 6: Administration for Government
    • Hardware Selection
    • Cassandra Distributions
    • Cassandra Nodes Communication
    • Writing and Reading Data to/From the Storage Engine
    • Data Directories
    • Anti-Entropy Operations
    • Cassandra Compaction
    • Choosing and Implementing Compaction Strategies
    • Cassandra Best Practices (Compaction, Garbage Collection)
    • Creating a Test Cassandra Instance with Low Memory Footprint
    • Troubleshooting Tools and Tips
    • Lab: Students Install Cassandra, Run Benchmarks

Requirements

  • Proficient in a Linux environment (navigating the command line, editing files with vi or nano)
  • For on-site courses, participants should bring a laptop or desktop equipped with 8 GB of RAM
  • For remote courses, a functional Cassandra lab will be provided for government use, requiring only a web browser
 14 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories