Course Outline
Introduction
- Graph databases and libraries for government
Understanding Graph Data
- The graph as a data structure for government
- Using vertices (dots) and edges (lines) to model real-world scenarios in public sector operations
Using Graph Databases to Model, Persist and Process Graph Data
- Local graph algorithms/traversals for government
- neo4j, OrientDB, and Titan for government applications
Exercise: Modeling Graph Data with neo4j
- Whiteboard data modeling for government projects
Beyond Graph Databases: Graph Computing
- Understanding the property graph for government use cases
- Graph modeling different scenarios (software graph, discussion graph, concept graph) for government applications
Solving Real-World Problems with Traversals
- Algorithmic/directed walk over the graph for government operations
- Determining circular dependencies in public sector systems
Case Study: Ranking Discussion Contributors
- Ranking by number and depth of contributed discussions in government forums
- A note on sentiment and concept analysis for government communications
Graph Computing: Local, In-Memory Graph Toolkits
- Graph analysis and visualization tools for government
- JUNG, NetworkX, and iGraph for government applications
Exercise: Modeling Graph Data with NetworkX
- Using NetworkX to model a complex system in government operations
Graph Computing: Batch Processing Graph Frameworks
- Leveraging Hadoop for storage (HDFS) and processing (MapReduce) in government systems
- Overview of iterative algorithms for government use
- Hama, Giraph, and GraphLab for government applications
Graph Computing: Graph-Parallel Computation
- Unifying ETL, exploratory analysis, and iterative graph computation within a single system for government
- GraphX for government operations
Setup and Installation
- Hadoop and Spark for government systems
GraphX Operators
- Property, structural, join, neighborhood aggregation, caching, and uncaching operators for government use
Iterating with Pregel API
- Passing arguments for sending, receiving, and computing in government applications
Building a Graph
- Using vertices and edges in an RDD or on disk for government data
Designing Scalable Algorithms
- GraphX Optimization for government systems
Accessing Additional Algorithms
- PageRank, Connected Components, Triangle Counting for government operations
Exercise: Page Rank and Top Users
- Building and processing graph data using text files as input for government projects
Deploying to Production
Closing Remarks
Requirements
- An understanding of Java programming and frameworks for government applications.
- A general understanding of Python is beneficial but not required.
- Familiarity with database concepts is necessary.
Audience
- Developers working in the public sector
Testimonials (3)
She was very well verse with the material. Very nice, engaging. She always pauses to ask if there are questions or clarifications.
Jones Manlapaz - Nordstern Group
Course - Introduction to Semantic MediaWiki
Very nice training
Maira Frisch - Novartis Pharma AG
Course - SPARQL
He was interactive.