Web Scraping with Python Training Course
Web Scraping is a technique for extracting data from websites and saving it to local files or databases.
This instructor-led, live training (online or onsite) is aimed at developers who wish to use Python to automate the process of crawling multiple websites to extract data for processing and analysis.
By the end of this training, participants will be able to:
- Install and configure Python and all relevant packages.
- Retrieve and parse data stored across various websites.
- Understand how websites function and how their HTML is structured.
- Construct spiders to crawl the web at scale.
- Use Selenium to crawl AJAX-driven web pages.
Format of the Course
- Interactive lecture and discussion.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- This course assumes prior knowledge of programming.
- To request a customized training for government, please contact us to arrange.
Course Outline
Introduction
Setting Up the Development Environment for Government
Python Primer: Data Structures, Conditionals, File Handling, etc.
Python Packages for Web Scraping for Government: Scrapy and BeautifulSoup
Understanding How a Website Works for Government Operations
Structure of HTML for Government Websites
Making a Web Request for Government Applications
Scraping an HTML Page for Government Data Collection
Working with XPath and CSS Selectors for Government Projects
Filtering Data Using Regular Expressions for Government Analysis
Creating a Web Crawler for Government Use
Crawling AJAX and JavaScript Pages with Selenium for Government Applications
Web Scraping Best Practices for Government Compliance
Troubleshooting Common Issues in Government Web Scraping
Summary and Conclusion for Government Operations
Requirements
- Programming experience, preferably in Python. If participants have programming experience in a language other than Python, the training can be extended to include additional introductory Python exercises for government.
Audience
- Developers
Runs with a minimum of 4 + people. For 1-to-1 or private group training, request a quote.
Web Scraping with Python Training Course - Booking
Web Scraping with Python Training Course - Enquiry
Web Scraping with Python - Consultancy Enquiry
Testimonials (1)
Many different examples and topics has been covered, from basic investigation to login management and dynamic page management.
Daniele Tagliaferro - Creditsafe Italia Srl
Course - Web Scraping with Python
Upcoming Courses
Related Courses
Advanced Python: Best Practices and Design Patterns
28 HoursAgentic AI Engineering with Python — Build Autonomous Agents
21 HoursIntroduction to Data Science and AI using Python
35 HoursArtificial Intelligence with Python (Intermediate Level)
35 HoursAlgorithmic Trading with Python and R
14 HoursThis instructor-led, live training (online or onsite) is aimed at business analysts who wish to automate trade processes using algorithmic trading, Python, and R for government.
By the end of this training, participants will be able to:
- Utilize algorithms to execute the buying and selling of securities at precise intervals rapidly.
- Minimize costs associated with trading through algorithmic methods.
- Automatically monitor stock prices and execute trades accordingly.
Applied AI from Scratch in Python
28 HoursAWS Cloud9 and Python: A Practical Guide
14 HoursPython and Blockchain
28 HoursBuilding Chatbots in Python
21 HoursGPU Programming with CUDA and Python
14 HoursThis instructor-led, live training in US (online or onsite) is aimed at intermediate-level developers who wish to use CUDA to build Python applications that run in parallel on NVIDIA GPUs for government projects.
By the end of this training, participants will be able to:
- Leverage the Numba compiler to accelerate Python applications running on NVIDIA GPUs for government use.
- Develop, compile, and deploy custom CUDA kernels for government applications.
- Effectively manage GPU memory in government computing environments.
- Transform a CPU-based application into a GPU-accelerated application suitable for government operations.
Scaling Data Analysis with Python and Dask
14 HoursData Analysis with Python, Pandas and Numpy
14 HoursThis instructor-led, live training in US (online or onsite) is aimed at intermediate-level Python developers and data analysts who wish to enhance their skills in data analysis and manipulation using Pandas and NumPy for government applications.
By the end of this training, participants will be able to:
- Set up a development environment that includes Python, Pandas, and NumPy.
- Create a data analysis application using Pandas and NumPy for government workflows.
- Perform advanced data wrangling, sorting, and filtering operations.
- Conduct aggregate operations and analyze time series data.
- Visualize data using Matplotlib and other visualization libraries.
- Debug and optimize their data analysis code to ensure compliance with public sector governance standards.