Managing large amounts of data can be a challenging task. Processing large numbers of files incur heavy overhead of IO communications. This course explores several options such as using Apptainer Overlay and SQLite to pack and reduce a large number of files to few files, and hence, improving IO performance. Python scripts are used throughout the course.

Format: On-line (Zoom).

Enseignant: Ching-Hsing Yu
Date: : lun., 28 nov. 2022 - 1:00 pm
Nombre de crédits - science des données: 3