1
Engage

Air Quality Data Sources

Learning Objectives

The Data Landscape

"Air quality monitoring has evolved from sparse regulatory networks to dense deployments of low-cost sensors and global satellite coverage. How do we integrate these different data sources while understanding their strengths and limitations?"

Types of Air Quality Monitoring

TypeInstrumentsStrengthsLimitations
Regulatory (FRM/FEM)BAM, TEOM, chemiluminescenceHigh accuracy, QA/QC, legal standingSparse network, expensive, time lag
Low-cost sensorsPMS, SPS30, electrochemicalDense deployment, real-time, affordableVariable accuracy, drift, interference
SatelliteMODIS, TROPOMI, GOESGlobal coverage, spatial patternsColumn measurements, cloud interference
Mobile monitoringVehicle-mounted sensorsSpatial resolution, hot spotsTemporal coverage, expensive

EPA Data Systems

Air Quality System (AQS)

  • Historical data back to 1980
  • All criteria pollutants and HAPs
  • Hourly, daily, annual summaries
  • Site metadata and methods
  • Access: EPA AQS API or download

AirNow

  • Real-time data (hourly updates)
  • AQI calculations and forecasts
  • Fire and smoke data integration
  • Public-facing with visualizations
  • Access: AirNow API, open data

Data Quality Metrics

Key Quality Indicators

  • Accuracy: Closeness to true value (bias). Assessed via colocation with reference instruments.
  • Precision: Reproducibility of measurements. Coefficient of variation (CV) or standard deviation.
  • Completeness: Percentage of valid data points. EPA requires ≥75% for regulatory purposes.
  • Detection limit: Minimum concentration reliably measured.
  • Drift: Change in response over time. Requires periodic calibration.

For low-cost sensors: EPA recommends R2 ≥ 0.7 and slope 1.0 ± 0.35 vs. reference for "performance target" designation.

Satellite-Derived Products

Common Products

ProductSourceResolutionApplication
AOD (Aerosol Optical Depth)MODIS, VIIRS3-10 kmPM2.5 estimation
Tropospheric NO2TROPOMI3.5 x 7 kmEmission mapping
CO columnMOPITT, TROPOMI22 kmFire, transport
O3 columnOMI, TROPOMI13 x 24 kmStratospheric ozone

Key limitation: Satellites measure column-integrated values, not surface concentrations. Statistical models needed to estimate ground-level pollution.

Activity: Data Exploration

  1. Access EPA AQS data for your state for the past 5 years (PM2.5 daily means)
  2. Download data from at least 3 monitoring sites in different settings (urban, suburban, rural)
  3. Calculate data completeness for each site each year
  4. Identify any sites with low-cost sensors (AirNow Fire and Smoke Map)
  5. Compare the number of regulatory monitors vs. PurpleAir sensors in your county

Questions to explore:

  • How representative are regulatory monitors of neighborhood-level exposure?
  • What populations might be underserved by current monitoring?
  • How could low-cost sensors fill monitoring gaps?

Key Takeaway

Air quality data comes from diverse sources with different strengths and limitations. Regulatory monitoring provides accurate, legally defensible data but with sparse coverage. Low-cost sensors enable dense networks but require careful calibration and quality control. Satellites offer global coverage but measure different quantities than ground monitors. Effective data science integrates these sources while understanding their uncertainties.

← Unit Overview Lesson 2: Statistical Methods →