Javascript is currently not supported, or is disabled by this browser. Please enable Javascript for full functionality.

    May 21, 2024  
2022-2023 Graduate Catalog 
2022-2023 Graduate Catalog [ARCHIVED CATALOG]

Add to My Catalog (opens a new window)

ADTA 5240 - Harvesting, Storing and Retrieving Data

3 hours

Provides an introduction to collecting, storing, managing, retrieving and processing datasets. Techniques for large and small datasets are considered, as both are needed in data science applications. Traditional survey and experimental design principles for data collection as well as script-based programming techniques for large-scale data harvesting from third party sources are covered. Data wrangling methodologies are introduced for cleaning and merging datasets, storing data for later analysis and constructing derived datasets. Various storage and process architectures are introduced with a focus on how approaches depend on applications, data velocity and end users. Emphasizes applications and includes many hands-on projects.

Prerequisite(s): None.

Course specific fees (in addition to tuition and mandatory):
Academic (AF) per hour: $24.70

Add to My Catalog (opens a new window)