Managing large amounts of data can be a challenging task. Processing large numbers of files incur heavy overhead of IO communications. This course explores several options such as using Apptainer Overlay and SQLite to pack and reduce a large number of files to few files, and hence, improving IO performance. Python scripts are used throughout the course.

Format: On-line (Zoom).

Date: Mon, 28 Nov 2022 - 1:00 pm
Data Science Credits: 3