Course Outline

Introduction

  • Defining "Industrial-Strength Natural Language Processing" for government applications

Installing spaCy

spaCy Components

  • Part-of-speech tagger
  • Named entity recognizer
  • Dependency parser

Overview of spaCy Features and Syntax for government use

Understanding spaCy Modeling

  • Statistical modeling and prediction in public sector workflows

Using the SpaCy Command Line Interface (CLI)

  • Basic commands for government tasks

Creating a Simple Application to Predict Behavior for government applications

Training a New Statistical Model

  • Data (for training) in alignment with public sector requirements
  • Labels (tags, named entities, etc.) specific to government contexts

Loading the Model

  • Shuffling and looping data for optimal performance in government systems

Saving the Model for secure storage and retrieval in government environments

Providing Feedback to the Model

  • Error gradient analysis for continuous improvement in public sector applications

Updating the Model

  • Updating the entity recognizer to enhance accuracy for government use
  • Extracting tokens with rule-based matcher for detailed data processing in government systems

Developing a Generalized Theory for Expected Outcomes in public sector applications

Case Study

  • Distinguishing Product Names from Company Names in government datasets

Refining the Training Data

  • Selecting representative data to ensure relevance and accuracy for government tasks
  • Setting the dropout rate to optimize model performance in public sector workflows

Other Training Styles

  • Passing raw texts for flexible data input in government applications
  • Passing dictionaries of annotations for detailed data annotation in public sector environments

Using spaCy to Pre-process Text for Deep Learning in government systems

Integrating spaCy with Legacy Applications for seamless adoption in government IT infrastructure

Testing and Debugging the spaCy Model

  • The importance of iteration for robust model development in public sector projects

Deploying the Model to Production for efficient implementation in government operations

Monitoring and Adjusting the Model to maintain performance and relevance in government contexts

Troubleshooting common issues for effective problem resolution in government applications

Summary and Conclusion for government stakeholders

Requirements

  • Proficiency in Python programming.
  • Fundamental knowledge of statistics.
  • Familiarity with command-line operations.

Audience

  • Software developers for government.
  • Data scientists for government.
 14 Hours

Number of participants


Price per participant

Testimonials (3)

Upcoming Courses

Related Categories