This week's colloquium: "Web Scraping in Python"by Yohai Meiron from SciNet. The Compute Ontario Colloquia are weekly Zoom presentations on Advanced Research Computing, High Performance Computing, Research Data Management, and Research Software topics, delivered by staff from three Compute Ontario consortia (CAC, SciNet, SHARCNET) and guest speakers. The colloquia are one hour long and include time for questions. No enrollment or registration is required. Most presentations are recorded and uploaded to the hosting consortium video channel.
Web scraping is a method used to extract data from websites. It involves programmatically downloading web pages and parsing their HTML to extract the necessary information. It can be used to harvest data for the purpose of statistical analysis, training machine learning models, and creating alerts. In this talk, we'll discuss how to use basic programming skills in Python to scrape the web. We'll examine the technical and ethical aspects of the method, as well as practical applications.