Journal cover Journal topic
Atmospheric Chemistry and Physics An interactive open-access journal of the European Geosciences Union
Journal topic
Atmos. Chem. Phys., 19, 8363-8381, 2019
https://doi.org/10.5194/acp-19-8363-2019
Atmos. Chem. Phys., 19, 8363-8381, 2019
https://doi.org/10.5194/acp-19-8363-2019

Research article 02 Jul 2019

Research article | 02 Jul 2019

# Contributions to local- and regional-scale formaldehyde concentrations

Contributions to local- and regional-scale formaldehyde concentrations
Lucas A. J. Bastien1,2, Nancy J. Brown2, and Robert A. Harley1,2 Lucas A. J. Bastien et al.
• 1Department of Civil and Environmental Engineering, University of California at Berkeley, Davis Hall, Berkeley, CA 94720, USA
• 2Energy Analysis and Environmental Impacts Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
Abstract

Reducing ambient formaldehyde concentrations is a complex task because formaldehyde is both a primary and a secondary air pollutant, with significant anthropogenic and biogenic sources of volatile organic compound (VOC) precursor emissions. This work uses adjoint sensitivity analysis in a chemical transport model to identify emission sources and chemical reactions that influence formaldehyde mixing ratios in the San Francisco Bay Area and within three urbanized sub-areas. For each of these receptors, the use of the adjoint technique allows for efficient calculation of the sensitivity of formaldehyde to emissions of NOx, formaldehyde, and VOC precursors occurring at any location and time. Formaldehyde mixing ratios are found to be generally higher in summer than in winter. The opposite seasonal trend is observed for the sensitivities of these mixing ratios to formaldehyde emissions. In other words, even though formaldehyde is higher in summer, reducing formaldehyde emissions has a greater impact in winter. In winter, 85 %–90 % of the sensitivity to emissions is attributed to direct formaldehyde emissions. In summer, this contribution is smaller and more variable, ranging from 27 % to 75 % among the receptor areas investigated in this study. Higher relative contributions of secondary formation versus direct emissions are associated with receptors located farther away from heavily urbanized and emission-rich areas. In particular, the relative contribution of biogenic VOC emissions (15 %–43 % in summer) is largest for these receptors. Ethene and other alkenes are the most influential anthropogenic precursors to secondary formaldehyde. Isoprene is the most influential biogenic precursor. Sensitivities of formaldehyde to NOx emissions are generally negative but relatively small in magnitude compared to sensitivities to VOC emissions. The magnitude of anthropogenic emissions of organic compounds other than formaldehyde is found to correlate reasonably well with their influence on population-weighted formaldehyde mixing ratios at the air-basin scale. This correlation does not hold for ambient formaldehyde in smaller urbanized sub-areas. The magnitude of biogenic emissions does not correlate with their influence in either case.

1 Introduction

Formaldehyde is one of the major contributors to increased cancer risk associated with exposure to hazardous air pollutants in the United States (US Environmental Protection Agency; ; ). Based on observations, estimated that the cancer risk associated with exposure to ambient formaldehyde in major US cities was of the order of 50 per million people, with values reaching 150 per million people in the southeast part of the country. In contrast, estimated using a modeling approach that the upper bound for this cancer risk was in the range of 20–40 per million people. Average ambient formaldehyde mixing ratios in US urban areas typically range from 1 to 3 ppb , where 1 ppb = 1 nmol mol−1. Peak mixing ratios can be an order of magnitude larger . In contrast, background mixing ratios are estimated to be in the range of 0.1 to 0.2 ppb .

Formaldehyde is directly emitted by anthropogenic activities that include a number of industrial processes and combustion of fossil fuels in stationary applications and motor vehicles . Formaldehyde is also formed photochemically in the atmosphere through oxidation of other volatile organic compounds . Examples of chemical pathways that can lead to secondary formation of formaldehyde are as follows: addition of OH, ozone, or the nitrate radical to the C=C double bond of alkenes and photolysis of acetaldehyde. In the presence of sunlight, formaldehyde undergoes photolysis following one of two pathways :

$\begin{array}{}\text{(R1)}& \mathrm{HCHO}+h\mathit{\nu }\to {\mathrm{H}}^{\mathrm{•}}+{\mathrm{HCO}}^{\mathrm{•}}\stackrel{\mathrm{2}{\mathrm{O}}_{\mathrm{2}}}{\to }\mathrm{2}{\mathrm{HO}}_{\mathrm{2}}^{\mathrm{•}}+\mathrm{CO},\end{array}$

or

$\begin{array}{}\text{(R2)}& \mathrm{HCHO}+h\mathit{\nu }\to {\mathrm{H}}_{\mathrm{2}}+\mathrm{CO},\end{array}$

where h is Plank's constant and hν is the energy of a photon of frequency ν. In addition to photolysis, removal of formaldehyde occurs through abstraction of a hydrogen atom by OH radicals and to a lesser extent by free chlorine atoms and nitrate radicals. These reactions form formyl radicals and, respectively, water, hydrochloric acid, and nitric acid . For example, with OH,

$\begin{array}{}\text{(R3)}& \mathrm{HCHO}+{\mathrm{OH}}^{\mathrm{•}}\to {\mathrm{H}}_{\mathrm{2}}\mathrm{O}+{\mathrm{HCO}}^{\mathrm{•}}.\end{array}$

As indicated in the reactions presented above, formaldehyde is a source of radicals and therefore affects the formation and removal of other air pollutants such as ozone and secondary organic aerosols . While other carbonyl species undergo photolysis and are also sources of radicals, the photolysis rate coefficient and therefore the potential for radical production are much larger for formaldehyde .

Mitigating exposure to ambient formaldehyde requires a clear understanding of the contributions of primary versus secondary and anthropogenic versus biogenic sources of formaldehyde. Various approaches have been used to estimate the relative importance of direct emissions versus secondary formation of formaldehyde. One approach consists of studying the statistical relationships between observed mixing ratios or concentrations of formaldehyde and species that are predominantly of primary origin (e.g., CO) versus species that are predominantly of secondary origin (e.g., O3). Using this approach, the average contribution of primary emissions to total ambient formaldehyde in Houston, Texas, was estimated to be a third to a half ; similar findings are reported for Mexico City . discuss the limitations and sources of uncertainties inherent to these statistical methods. Using an emission inventory constrained by observations and product yields of formaldehyde from oxidation of some of its precursors, they attributed over 90 % of ambient formaldehyde over the Houston region to secondary formation. Regardless of the method being used, results derived for one location may not be directly applicable to another because several factors, including emissions and meteorology, can influence secondary formation of formaldehyde and of other species. For example, many petrochemical industries are located in Houston and its surroundings. Emissions from this area may be particularly different from those of other large cities.

The contribution of direct emissions versus secondary formation of aldehydes has also been investigated using modeling approaches. used a chemical transport model where species that form formaldehyde are tracked and allocated to the emitted precursor species from which they originated. These investigators estimated the relative contribution of secondary formation to total ambient formaldehyde in the eastern US to be 70 %–90 % in July versus 30 %–90 % in January, with significant spatial variability. They also found that alkenes were often the most significant precursors to formaldehyde, with isoprene contributing 15 %–60 % of secondary formation in certain areas in July. Uncertainties in emission inventories, particularly for alkenes, an incomplete understanding of atmospheric reactions that influence formaldehyde concentrations, and the condensation of chemical mechanisms in air quality models have been proposed as likely sources of uncertainties in model simulations . Additionally, formaldehyde concentrations exhibit sharp spatial gradients at sub-kilometer scales , further complicating modeling efforts, as horizontal grid sizes in model simulations typically range from 4 to 32 km in regional air quality models .

Sensitivity analysis is another modeling approach used to investigate the drivers of air pollution. The aim is to quantify the effects of model parameters such as emission sources on air quality . Most sensitivity analysis techniques can be classified as being either source-oriented or receptor-oriented techniques. Source-oriented techniques, such as the decoupled direct method , require one large-scale simulation for each parameter of interest but allow for quantifying sensitivities of multiple air quality metrics at little additional cost. In contrast, receptor-oriented techniques, such as the adjoint method , require one large-scale simulation for each air quality metric of interest but allow for the efficient calculation of sensitivities to large numbers of model parameters. While there have been numerous analyses of ozone and particulate matter, little attention has been devoted to analysis of formaldehyde mixing ratios. used the path-integral method with sensitivities calculated with the DDM and found that the relative participation of different anthropogenic emission sectors (e.g., point sources and light duty vehicles) in the overall reduction of formaldehyde depended on the order in which these emissions were removed. The path-integral method (Dunker2015) uses sensitivity coefficients to apportion changes in mixing ratios associated with specific emission changes. More recently, used the DDM to apportion US ambient formaldehyde to its precursor emissions. They estimated the relative contribution of secondary formation to be in the range of 60 %–90 %, with alkenes being the most influential precursors.

Our understanding of the relative contributions to formaldehyde exposure of primary versus secondary formation and of biogenic versus anthropogenic emissions is still incomplete. In particular, we have yet to fully characterize how this apportionment varies across sub-areas of an urbanized region. The present study uses a chemical transport model and its adjoint to identify the emission sources and chemical reactions that influence ambient formaldehyde overall for the San Francisco Bay Area and within three urbanized sub-areas. The relative influence of primary versus secondary formation, of anthropogenic versus biogenic emissions, and of different anthropogenic emission sectors is quantified for each urban sub-area and for the entire air basin. The sensitivities of formaldehyde to rate coefficients of over 200 chemical reactions are calculated and ranked. The study is conducted for two month-long simulation periods, in July and December, to study seasonal effects on the analysis. The use of the adjoint approach allows for spatial, temporal, and by-species resolution in the calculation of sensitivities at a reasonable computational cost, such that reaction rate coefficients and emissions of each species at each location and at each hour of the simulations can be treated as distinct model parameters. There are billions of such sensitivities, and the computational cost of this endeavor would be prohibitive if a source-oriented sensitivity analysis technique were used in place of the adjoint method. The approach employed here can be used, for example, to draw “maps of influence”; these identify upwind areas whose emissions have the greatest influence on model responses. To the best of our knowledge, this study is the first to use adjoint sensitivity analysis to identify the emissions and chemical reactions that have the most influence on regional- and local-scale ambient formaldehyde.

2 Methods

## 2.1 Chemical transport modeling, air quality metrics, and sensitivity analysis

Chemical transport models (CTMs) are used to synthesize understanding of atmospheric processes relevant to air pollution, test new scientific hypotheses, estimate pollutant levels where there are no available observations, and estimate past or future changes in air quality due, for example, to emission reductions or climate change. CTMs estimate mixing ratios of airborne pollutants of interest over a user-defined gridded domain and simulation period by numerically solving a system of coupled, and in general nonlinear, partial differential equations that express conservation of mass . Model input parameters include meteorology, emissions of pollutants of interest and their precursors, description of relevant atmospheric chemical reactions, mixing ratios at the beginning of the simulation period (initial conditions), and pollutant levels in air entering the modeling domain through lateral boundaries and from above (boundary conditions).

CTMs estimate space- and time-resolved fields of pollutant mixing ratios but typically do not provide information about which model parameters (e.g., which emission sources and which chemical reactions) contribute to specific air pollution problems. Mathematical and numerical tools that aim at elucidating these relationships have been implemented in CTMs. In particular, sensitivity analysis in air quality models quantifies changes in a specified air quality metric R resulting from variations in model parameters ${\mathit{\alpha }}_{\mathrm{1}},{\mathit{\alpha }}_{\mathrm{2}},\mathrm{\dots },{\mathit{\alpha }}_{N}$ through the calculation of first-order sensitivity coefficients of the form

$\begin{array}{}\text{(1)}& {S}_{{\mathit{\alpha }}_{i}}^{+}=\frac{\partial R}{\partial {\mathit{\alpha }}_{i}},\end{array}$

or, in semi-normalized form,

$\begin{array}{}\text{(2)}& {S}_{{\mathit{\alpha }}_{i}}^{×}=\frac{\partial R}{\partial {\mathit{\alpha }}_{i}}{\mathit{\alpha }}_{i}.\end{array}$

Air quality metrics of interest are calculated from model output mixing ratios. This study uses sensitivity analysis to identify emission sources and chemical reactions that influence metrics related to exposure to ambient formaldehyde. More precisely, the metrics of interest are defined as population-weighted formaldehyde mixing ratios of the form

$\begin{array}{}\text{(3)}& R=\underset{\mathit{\omega }}{\int }\underset{T}{\int }{P}_{\mathit{\omega }}\left({x}_{\mathrm{1}},{x}_{\mathrm{2}}\right)C\left({x}_{\mathrm{1}},{x}_{\mathrm{2}},t\right)\text{d}t\text{d}{x}_{\mathrm{1}}\text{d}{x}_{\mathrm{2}},\end{array}$

where ω is the receptor area of interest, T is the simulation period excluding spin-up, Pω is the population-weighting probability density function over receptor ω, C is the surface mixing ratio of formaldehyde, and (x1,x2) and t are the horizontal position and time coordinates, respectively. Population-weighting functions are based on 2010 US census data. Four receptor areas are investigated (Fig. 1). Three of these receptor areas (Pittsburg–Antioch, San Jose, and the East Bay) are urban areas that have been identified by the Bay Area Air Quality Management District as being heavily impacted by air pollution (Bay Area Air Quality Management District BAAQMD2014). The fourth receptor is defined as the entire San Francisco Bay Area air basin. Each model response R is a real-valued function that depends on the model parameters ${\mathit{\alpha }}_{\mathrm{1}},{\mathit{\alpha }}_{\mathrm{2}},\mathrm{\dots },{\mathit{\alpha }}_{N}$: $R=R\left(\mathit{\alpha }\right)=R\left({\mathit{\alpha }}_{\mathrm{1}},{\mathit{\alpha }}_{\mathrm{2}},\mathrm{\dots },{\mathit{\alpha }}_{N}\right)$. Model parameters are denoted collectively as a vector $\mathit{\alpha }=\left({\mathit{\alpha }}_{\mathrm{1}},{\mathit{\alpha }}_{\mathrm{2}},\mathrm{\dots },{\mathit{\alpha }}_{N}\right)$, where N is the total number of parameters.

Figure 1Modeling domain and receptor areas investigated in this work. The color scale shows population density as of the 2010 US Census. Gray lines indicate major roadways.

Sensitivity analysis is used to calculate and compare changes ΔR in model response resulting from different perturbations in model parameters at any location in the domain and any time in the simulation period. This study focuses on two categories of parameters: emissions and reaction rate coefficients. Changes in parameters are calculated with respect to nominal values, denoted ${\mathit{\alpha }}^{\mathrm{0}}=\left({\mathit{\alpha }}_{\mathrm{1}}^{\mathrm{0}},{\mathit{\alpha }}_{\mathrm{2}}^{\mathrm{0}},\mathrm{\dots },{\mathit{\alpha }}_{N}^{\mathrm{0}}\right)$. Nominal values are meant to describe actual conditions of the reference year. Assuming that R is differentiable at α0, the change ΔR in model response corresponding to a change in model parameters from nominal values α0 to arbitrary values α is given by

$\begin{array}{}\text{(4)}& \mathrm{\Delta }R& =R\left(\mathit{\alpha }\right)-R\left({\mathit{\alpha }}^{\mathrm{0}}\right)\text{(5)}& & =\sum _{i=\mathrm{1}}^{N}{\frac{\partial R}{\partial {\mathit{\alpha }}_{i}}}_{\left({\mathit{\alpha }}^{\mathrm{0}}\right)}\mathrm{\Delta }{\mathit{\alpha }}_{i}+{h}_{{\mathit{\alpha }}^{\mathrm{0}}}\left(\mathit{\alpha }\right)\end{array}$

where $\mathrm{\Delta }{\mathit{\alpha }}_{i}=\left({\mathit{\alpha }}_{i}-{\mathit{\alpha }}_{i}^{\mathrm{0}}\right)$ is the change in parameter αi and ${h}_{{\mathit{\alpha }}^{\mathrm{0}}}$ is a real-valued function (a priori a different function for each set of nominal values α0) with domain in N and that has the property

$\begin{array}{}\text{(6)}& \underset{\begin{array}{c}\mathrm{\Delta }{\mathit{\alpha }}_{i}\to \mathrm{0}\\ \mathrm{i}=\left\{\mathrm{1},\mathrm{2},\mathrm{\dots },\mathrm{N}\right\}\end{array}}{lim}{h}_{{\mathit{\alpha }}^{\mathrm{0}}}\left(\mathit{\alpha }\right)=\mathrm{0}.\end{array}$

Provided that R is sufficiently differentiable at α0, the value of ${h}_{{\mathit{\alpha }}^{\mathrm{0}}}\left(\mathit{\alpha }\right)$ can be calculated from the knowledge of higher-order derivatives of the form ${\partial }^{\mathit{\beta }}R/\left(\partial {\mathit{\alpha }}_{\mathrm{1}}^{{\mathit{\beta }}_{\mathrm{1}}}\partial {\mathit{\alpha }}_{\mathrm{2}}^{{\mathit{\beta }}_{\mathrm{2}}}\mathrm{\dots }\partial {\mathit{\alpha }}_{N}^{{\mathit{\beta }}_{N}}\right)$ evaluated at α0, where $\mathit{\beta }={\mathit{\beta }}_{\mathrm{1}}+{\mathit{\beta }}_{\mathrm{2}}+\mathrm{\dots }+{\mathit{\beta }}_{N}\mathit{⩾}\mathrm{2}$. This study focuses on first-order sensitivities and makes the assumption that model responses vary linearly with model parameters, in which case Eq. (5) simplifies to

$\begin{array}{}\text{(7)}& \mathrm{\Delta }R=\sum _{i=\mathrm{1}}^{N}{\frac{\partial R}{\partial {\mathit{\alpha }}_{i}}}_{\left({\mathit{\alpha }}^{\mathrm{0}}\right)}\mathrm{\Delta }{\mathit{\alpha }}_{i}=\sum _{i=\mathrm{1}}^{N}{S}_{{\mathit{\alpha }}_{i}}^{+}\mathrm{\Delta }{\mathit{\alpha }}_{i}\end{array}$

The value of ${h}_{{\mathit{\alpha }}^{\mathrm{0}}}\left(\mathit{\alpha }\right)$ measures the error made when using Eq. (7) in lieu of Eq. (5). As indicated by Eq. (6), the linear approximation holds exactly for infinitesimally small perturbations but not necessarily for larger perturbations. Previous modeling studies investigated the range of perturbations over which Eq. (7) holds reasonably well. used Green's function approach in a box model to calculate first- and second-order sensitivities of ozone to initial concentrations of precursors and to reaction rate coefficients. They found that using first-order sensitivities to estimate changes in modeled ozone resulting from perturbing model parameters yielded acceptable results for perturbations of at most 25 % of the baseline values. found good agreement (r2 0.99) between first-order sensitivities of ozone to domain-wide NO and VOC emissions calculated using the DDM on the one hand and a central finite-difference approximation with ±10 % perturbations on the other hand. Additionally, Hakami et al. found that modeled ozone exhibited more nonlinear behavior when (i) perturbing NOx emissions compared to perturbing VOC emissions; (ii) ozone concentrations were high; and (iii) ozone chemistry was in the transition regime between VOC- and NOx-controlled conditions. found that changes in modeled ozone resulting from 10 % perturbations in domain-wide NOx and VOC emissions were accurately estimated using a Taylor series truncated after the first-order term. Overall, these studies suggest that ozone responses vary nearly linearly with emissions of precursor species for perturbations up to about 10 % of the nominal value and up to 25 % in some cases. Since formaldehyde is a ubiquitous intermediate species in atmospheric oxidation cycles, we hypothesize that formaldehyde responses also vary approximately linearly with precursor emissions for such perturbations. Using the sensitivity results presented in this study for large (> 25 %) perturbations that are likely to result in nonlinear behavior should be avoided. In particular, nonlinear behavior is likely to be significant when first-order sensitivities are large and in regions where ozone chemistry is near the transition regime between VOC- and NOx-controlled conditions .

Both the first-order sensitivity ${S}_{{\mathit{\alpha }}_{i}}^{+}$ and its semi-normalized counterpart ${S}_{{\mathit{\alpha }}_{i}}^{×}$ are useful in quantifying the sensitivity of air quality metrics to model parameters, but these quantities differ in their interpretation. ${S}_{{\mathit{\alpha }}_{i}}^{+}$ measures the change ΔR in metric R resulting from increasing the value of parameter αi by an amount Δαi, whereas ${S}_{{\mathit{\alpha }}_{i}}^{×}$ measures the change ΔR in metric R resulting from scaling parameter αi using a multiplicative factor (e.g., ξ=0.1 for a 10 % increase in αi):

$\begin{array}{}\text{(8)}& & \mathrm{\Delta }R\approx {S}_{{\mathit{\alpha }}_{i}}^{+}\mathrm{\Delta }{\mathit{\alpha }}_{i},\text{(9)}& & \mathrm{\Delta }R\approx {S}_{{\mathit{\alpha }}_{i}}^{×}\mathit{\xi }.\end{array}$

Semi-normalized sensitivities are zero whenever the underlying parameter is itself zero. In contrast, this statement does not hold in the general case for ${S}_{{\mathit{\alpha }}_{i}}^{+}$. All model parameters of interest in this study vary in time and space, and the values of these parameters at each location and at each time step are regarded as distinct parameters. Sensitivities calculated here are therefore local quantities. For example, the semi-normalized sensitivity of a model response R to the formaldehyde emission rate EHCHO(x,t) at location x and time t is ${S}_{{E}_{\mathrm{HCHO}}\left(x,t\right)}^{×}$. Semi-normalized sensitivities may be grouped together. For example, the semi-normalized sensitivity to VOC emissions is the sum of the semi-normalized sensitivities to emissions of each individual species that is used to represent VOCs in the model.

## 2.2 Computational approach

The Community Multiscale Air Quality model (CMAQ; Byun and Schere2006) is used at 4 km horizontal resolution to simulate atmospheric formaldehyde mixing ratios in the San Francisco Bay Area during two month-long simulation periods: summer (July) and winter (December). The modeling domain consists of 38 × 52 grid cells in the horizontal directions and 35 vertical layers, extending to about 15 km above the ground level. Atmospheric photochemistry is simulated with the SAPRC99 chemical mechanism (Carter2000). SAPRC99 comprises 72 model species and 211 chemical reactions (listed in Carter2000, Appendix A). Some model species represent individual chemical species such as formaldehyde, ozone, ethene, and isoprene. Other model species lump multiple organic species together based on similarities in molecular structures and/or reactivity. Lumped model species include ALK1 through ALK5 (“alkanes and other non-aromatic compounds that react only with OH”; Carter2000), OLE1 and OLE2 (alkenes other than ethene), ARO1 and ARO2 (aromatics), and TERP (terpenes). The numbers in lumped model species names for a given molecular structure (e.g., OLE1 versus OLE2) separate species according to their reactivity with the OH radical, with a higher number meaning higher reactivity (Carter2000).

Hourly gridded emissions at 4 km horizontal resolution were provided by the Bay Area Air Quality Management District (BAAQMD2009) for the year 2005. The emissions data account for weekday–weekend and seasonal variations in anthropogenic and biogenic emissions. Meteorological conditions were modeled at 4 km resolution using the MM5 model (BAAQMD2009). A study by suggests that year 2000 meteorological conditions used here represent typical conditions for multi-week simulation periods. Initial and boundary conditions for 34 out of the 72 model species described in the SAPRC99 chemical mechanism were derived from the global-scale MOZART-4 modeling system . Boundary conditions for these species vary in the vertical direction and also vary hourly throughout the simulation. The remaining 38 SAPRC99 model species could not be mapped to any MOZART-4 species. Initial and boundary conditions for these species were set to zero. A model spin-up period of 2 d is used to reduce the influence of uncertainties in initial conditions on this analysis.

CMAQ is used to calculate model responses of interest for nominal values of the model parameters. The adjoint of CMAQ is used to calculate the local sensitivity coefficients ${S}_{{\mathit{\alpha }}_{i}}^{+}$ and ${S}_{{\mathit{\alpha }}_{i}}^{×}$. The use of adjoint methods to calculate first-order sensitivity coefficients has been described elsewhere . As previously explained, the adjoint sensitivity technique is particularly attractive when the number of model parameters of interest is large compared to the number of model responses of interest. Eight model responses are investigated here (2 seasons × 4 receptor areas), but there are billions of model parameters: emissions and reaction rate coefficients at each location and each hourly time step. The computer code used in this study is based on the work by with modifications described in . Additionally, the CB-IV chemical mechanism used in was replaced by the SAPRC99 chemical mechanism. The chemical solver and adjoint sensitivity routines for SAPRC99 were created with the Kinetic Pre-Processor version 2.2.3. The original adjoint code was further adapted so that it could be run using multiple processors in a parallel computing environment. This study expands on previous work of , where emission sources that influenced mixing ratios of two primary air pollutants (benzene and diesel black carbon) were mapped. Benzene and diesel black carbon were treated as non-reactive, whereas in the present study formaldehyde is reactive and is also formed photochemically from other VOC precursors.

3 Results and discussion

## 3.1 Model evaluation

The forward model is the component of the model that estimates ambient mixing ratios of model species for a given set of input parameters. In other words, the forward model relates an emission inventory and a set of meteorological conditions to the resulting air quality in the modeling domain, accounting for atmospheric chemistry and other relevant processes. In contrast, the adjoint of the forward model is the component of the model used to calculate first-order sensitivities, i.e., rates of change of air quality when input parameters such as emissions are perturbed. Once the forward model has been run for a given set of input parameters, the adjoint model can be run once for each air quality metric of interest without rerunning the forward model. The accuracy of these two components is evaluated separately.

The forward model is evaluated by comparing modeled mixing ratios of a number of species to ground-based observations from routine monitoring stations (Figs. S1–S16 and Table S1). Hourly measurements of O3 were available at 20 sites; hourly measurements of NO, NO2, and CO were available at 13 sites; and daily observations (24 h averages) of formaldehyde, acetaldehyde, and acetone were available at 3 sites. The model captures the magnitude and diurnal variations of summertime O3 mixing ratios at a majority of the sites. The corresponding mean bias ranges from −4.4 to 10.5 ppb and is within ±5 ppb at 12 out of the 20 locations where observations are available. The corresponding root-mean-square error (RMSE) ranges from 3.5 to 12.7 ppb. Comparing average daily profiles of O3 + NO2 mixing ratios yields mean bias in the range of −2.37.0 ppb and RMSE in the range 2.3–7.6 ppb. Model performance is weaker for summer-season NO and NO2, and NOx=NO+NO2 when compared to results for O3 and O3+NO2. NOx is sometimes dramatically overestimated. In contrast, CO is consistently underestimated by the model. This issue may be a consequence of the size of model grid cells: in the model, local emissions are instantaneously diluted within a 4 km × 4 km grid cell. This artificial dilution has more influence on pollutants that are directly emitted, such as CO and NO, than on secondary pollutants such as ozone.

Model performance is less satisfactory in winter for ozone, oxides of nitrogen, and carbon monoxide (Table S1), which suggests that wintertime atmospheric chemistry is not as well represented by the model. However, results presented in the following sections indicate that atmospheric chemistry has little influence on formaldehyde concentrations in winter. In this case, formaldehyde behaves mostly as a non-reactive tracer species. The performance of the model to predict tracer species concentrations in both summer and winter has previously been evaluated by . Here, the model evaluation includes comparisons of modeled versus observed daily averaged mixing ratios of formaldehyde, acetaldehyde, and acetone (Fig. S14–S16). Mean modeled mixing ratios of formaldehyde are within ±26 % of observed values at all sites except Arkansas Street in San Francisco in July (relative difference of 34 %). There is no consistent trend (overestimation versus underestimation) in these differences. In contrast, mean mixing ratios of acetaldehyde are consistently underestimated by the model but are within 34 % of observed values at all sites except Chapel Way in Fremont in July (relative difference of −55 %). The model consistently underestimates acetone mixing ratios by 56 % to 81 %. A possible explanation for the underestimations of acetone is that emissions may be significantly underestimated .

The adjoint of the forward model is evaluated by comparing the change ΔR in model response estimated using adjoint sensitivities versus values calculated using a brute-force approach, which involves running the forward model with perturbed parameters and comparing the outputs with those from the baseline simulation. Since the calculation of each brute-force estimate requires a separate model run, this evaluation is conducted for episodes of 1 d in July and December, in each case the first day of the corresponding month-long simulation period. The model responses used for evaluation are domain-wide population-weighted mixing ratios of formaldehyde and ozone. The ozone responses are defined similarly to the formaldehyde responses (i.e., Eq. 3). Additive 24 h perturbations to emissions of NO, formaldehyde, ethene, and isoprene at several locations within the modeling domain are used for the evaluation. The magnitude of the perturbations for a given species is set to be roughly 10 % of the maximum emission rate observed at any location of the modeling domain and throughout the simulation period. Adjoint and brute-force estimates of ΔR are in good agreement for perturbations in formaldehyde, ethene, and isoprene emissions (Fig. 2), with a corresponding coefficient of determination r2=0.95 or above. The agreement is less satisfactory for perturbations in NO emissions, and including the corresponding data points in the analysis lowers the coefficient of determination to the 0.86–0.93 range. Discrepancies between adjoint-based and brute-force estimates are generally larger when ΔR is large.

Figure 2Relative change in model response calculated from first-order adjoint sensitivities versus using a brute-force approach. The model responses used for the evaluation are domain-wide population-weighted mixing ratios of (a) formaldehyde and (b) ozone. Simulation periods used for this evaluation are episodes of 1 d in July (red filled symbols) and December (blue unfilled symbols). Each point corresponds to the effect of an additive 24 h perturbation in emissions applied to a single grid cell. Insets zoom in on a section of the corresponding plots.

The good agreement between brute-force and adjoint sensitivities of formaldehyde to emissions of precursors other than NO supports the hypothesis that formaldehyde responses vary nearly linearly with these emissions for perturbations on the order of 10 % of baseline values; the brute-force approach includes nonlinearities, whereas first-order adjoint sensitivities do not. This hypothesis is likely to hold for somewhat larger perturbations, since the perturbations used for creating Fig. 2 were in most cases larger than 10 % of baseline values. A likely explanation for the larger mismatch for the NO shown in Fig. 2 is that nonlinearities in the relationship between model responses and emissions are stronger for NOx than for emissions of other species.

## 3.2 Sensitivity apportionment

This section identifies species whose emissions most strongly influence formaldehyde model responses. All the sensitivity values presented and discussed in this section are “overall sensitivities”, obtained by summing individual semi-normalized sensitivities over the modeling domain and the month-long duration of the simulation. The next section discusses the influence of the location of sources relative to receptors on model responses. It is worth noting that summing non-normalized sensitivities over the modeling domain would measure the air quality effects of adding a fixed amount of emissions at all locations, including over the ocean, where there are few anthropogenic sources. The summing procedure is only applied to semi-normalized sensitivities, and the overall sensitivity of a model response to emissions of species i measures the effects on that response of uniformly scaling up the existing emissions of this species. Sensitivities to initial conditions of all model species were also calculated but were found to be negligible compared to emission sensitivities, indicating that the spin-up and simulation periods are long enough such that initial conditions have negligible influence on model responses of interest here.

Table 1Model responses in parts per billion. Model responses are population-weighted monthly average formaldehyde mixing ratios for the San Francisco Bay Area (air basin) and selected sub-areas.

The magnitude of population-weighted formaldehyde mixing ratios (i.e., the model responses; Table 1) is in the 1.3–1.5 ppb range for all receptor areas in July, decreasing by 13 % to 61 % when going from July to December. San Jose sees the smallest seasonal decrease, and Pittsburg–Antioch has the largest decrease. In contrast, mixing ratios of many formaldehyde precursors are higher in December than in July (Tables S2–S5).

Figure 3Semi-normalized sensitivity of the air-basin-wide population-weighted formaldehyde mixing ratio to emissions in (a) July and (b) December. An apportionment by precursor species of the sensitivity to overall VOC emissions is shown in the pie chart insets.

Figure 3a and b show the emission sensitivity of the air-basin-wide response for summer (July) and winter (December), respectively. Organic species other than methane are grouped together as VOCs, and likewise NO and NO2 are grouped together as NOx. Of the remaining species, those whose relative contribution is smaller than 5 % are lumped together as “others” in the gray portions of each subplot, being lumped separately for positive and negative sensitivities. The sensitivity to VOC emissions is further separated into the contributions of different source sectors: biogenic emissions, on-road vehicle emissions, and other anthropogenic emissions. Apportionment of the overall sensitivity to VOC emissions is shown in the pie chart insets. Figures similar to Fig. 3 but for the other receptor areas investigated here can be found in the Supplement (Figs. S17–S19). For a more detailed apportionment of the emission sensitivity of each model response, see Table S6.

In both summer and winter, the emission sensitivity of the air-basin-wide response is dominated by anthropogenic VOC emissions. The contribution from biogenic emissions is about a third of that due to anthropogenic emissions in summer; the biogenic contribution is negligible in winter. The air-basin-wide response (R) decreases by a factor of 1.4 between summer and winter, while its sensitivity to emissions ($\partial R/\partial E$) almost doubles. In other words, even though the population-weighted formaldehyde mixing ratio is larger in summer, scaling anthropogenic and/or biogenic emissions uniformly year-round would result in a larger mixing ratio change in winter. In practice, the effects of scaling emissions up or down should be limited to a range or perturbations where the relationship between emissions and formaldehyde mixing ratios is nearly linear. Although the boundaries of this linear range are not fully characterized, sensitivity coefficients reported here should not be applied to the case where all emissions are removed from the simulations (i.e., a 100 % perturbation). If primary emissions of formaldehyde and precursors are removed, formaldehyde mixing ratios should approach background values of a few tens of parts per billion in both summer and winter. Changes in model responses would in this case be larger in summer than in winter, which contradicts our findings. The use of first-order sensitivities to characterize the behavior of this nonlinear system is nevertheless applicable in identifying emissions that should be prioritized in control strategies that aim to reduce ambient formaldehyde. To summarize, controlling anthropogenic emissions is less effective, to a first order, at reducing mixing ratios when mixing ratios are higher (i.e., in summer). A similar seasonal trend in the emission sensitivity is observed for the other receptor areas except for Pittsburg–Antioch, where overall emission sensitivity varies little by season. The increase in emission sensitivity between summer and winter is driven by the 3-fold increase in the sensitivity to formaldehyde emissions. In contrast, sensitivity to emissions of most other species is smaller in winter than in summer. In fact, almost 90 % of the sensitivity in December is attributed to formaldehyde emissions, indicating that secondary formation of formaldehyde is negligible in winter. Secondary formation is very significant in summer, when the influence of emissions of organic precursors is comparable to the influence of direct formaldehyde emissions. The influence of organic precursors is dominated by alkenes, of which isoprene is the largest contributor. The individual contributions of non-alkene species are all smaller than 5 %. OLE2 is the only organic precursor other than formaldehyde that consistently yields higher sensitivities in winter than in summer, meaning that controlling these reactive alkenes has a greater impact on formaldehyde during winter.

Figure 4Apportionment by model species (pie charts) of the VOC emissions semi-normalized sensitivity of the formaldehyde response in the San Francisco Bay Area air basin as a whole and within three of its urbanized sub-areas in (a) July and (b) December.

The relative contribution of direct formaldehyde emissions to the VOC emission sensitivity in summer varies significantly across the receptor areas investigated here (Fig. 4): 46 % for the air basin as a whole, 60 % for the East Bay, 46 % for San Jose, and 24 % for Pittsburg–Antioch, located further inland and downwind. Results presented below in Sect. 3.5 indicate that the zones of emissions influence are larger and extend farther upwind from the receptor area for precursors compared to formaldehyde itself. As a consequence, the relative importance of precursor emissions versus directly emitted formaldehyde increases for receptor areas located further downwind of emission-rich areas. This trend is attributed to the increased opportunity for secondary formation associated with the longer transport time required for distant emissions to reach receptors. For example, in July, the Pittsburg–Antioch receptor is relatively far ( 50 km) downwind of many high-emission areas, including San Francisco and the East Bay. The influence of direct formaldehyde emissions on air quality at this receptor is small compared to emissions of other species. In contrast, the East Bay is  10 km downwind of the San Francisco Peninsula, and the relative influence of direct formaldehyde emissions is considerably higher for this receptor. The San Jose and air-basin-wide responses represent intermediate cases. This finding applies to both anthropogenic and biogenic emissions. In particular, the relative contribution of biogenic emissions to overall VOC emission sensitivity is largest for Pittsburg–Antioch (39 %) and smallest for the East Bay (12 %). On-road vehicle emissions of VOCs account for roughly 40 % of the overall sensitivity to anthropogenic VOC emissions for all model responses investigated here (Table S6). The sensitivity of formaldehyde responses to NOx emissions is negative in both seasons for all receptor areas, with a magnitude that ranges from 10 % to 28 % of the corresponding overall sensitivity to emissions. The negative sign of this sensitivity means that decreasing NOx emissions would increase formaldehyde concentrations and vice versa. Causes of this negative sign are discussed in Sect. 3.4.

## 3.3 Drivers of seasonal changes in sensitivity

Seasonal changes in the emission sensitivities discussed above cannot be explained by changes in emissions alone: emissions of formaldehyde and its precursors vary little between summer and winter in this study. A notable exception is biogenic emissions of isoprene and terpenes, which are about 8 and 2 times higher in July than in December, respectively. Terpenes contribute little to the overall sensitivity to emissions: less than 6 % in both seasons for all model responses. Isoprene is responsible for 11 % to 37 % of the overall emission sensitivity for the July responses but has little influence in December.

Table 2Monthly average production and loss rates of formaldehyde by different mechanisms. Values are summed over the modeling domain.

The following two key mechanisms are possible contributors to the seasonal change of sensitivity to formaldehyde emissions. First, the mixing height in the modeling domain is generally lower in winter than in summer. Formaldehyde emissions are therefore mixed within a smaller vertical column in winter, resulting in higher near-surface mixing ratios. This phenomenon, in turn, makes surface-level model responses more sensitive to formaldehyde emissions. A previous study by suggests that the mixing height has a great influence on the emission sensitivity of surface mixing ratios for primary pollutants; it is likely that it would have a comparable influence on the sensitivity of formaldehyde mixing ratios to formaldehyde emissions. The second mechanism is seasonal variations of formaldehyde lifetime in the atmosphere. Table 2 shows month-long averages of rates of formaldehyde removal, shown separately for the July and December simulations and separately for the following removal processes: photolysis, chemical reactions other than photolysis, and dry deposition. Removal by wet deposition is neglected, as precipitation events are rare in the simulations. Photochemical removal decreases by 1 to 2 orders of magnitude between July and December, and removal by dry deposition decreases by a factor of 2.5 between these two periods. Dry deposition fluxes are estimated as the product of surface-layer concentrations and deposition velocities. While deposition fluxes depend on atmospheric concentrations of relevant pollutants, there are also important seasonal effects on deposition velocities. Here, the decrease in dry deposition between seasons results from the combined effects of reduced formaldehyde concentrations (Table 1) and, in some areas, reduced vertical transport and mixing within the surface layer in winter compared to summer (Fig. S20).

Reduced losses by dry deposition and by photochemical reactions both contribute to a longer atmospheric lifetime of formaldehyde in winter. Table 2 shows total production and loss of formaldehyde in the modeling domain. The difference between these values is approximately equal to net transport out of the modeling domain. In winter, a larger fraction of the formaldehyde produced is transported out of the modeling domain rather than being lost due to chemical reactions or surface deposition. This trend is consistent with formaldehyde having a longer atmospheric lifetime in winter compared to summer. This extended lifetime may in turn increase the sensitivity of model responses to directly emitted formaldehyde. The influence on sensitivities of photochemical removal of formaldehyde is investigated by running model simulations in which formaldehyde is treated as a tracer species: chemistry is turned off and there is neither secondary formation nor photochemical removal of formaldehyde. In these simulations, removal of formaldehyde is due only to dry deposition and transport out of the modeling domain. The atmospheric lifetime of formaldehyde therefore increases. When the residence time of a particular molecule increases, on average these molecules stay in the modeling domain for longer periods of time and their concentrations increase. In other words, sensitivities of model responses to formaldehyde emissions are expected to be larger when chemistry is turned off. The goals of these additional simulations are (i) to characterize the influence of photochemical removal on sensitivities of model responses to formaldehyde emissions and (ii) to investigate whether the large ( 200 %) increase in these sensitivities from July to December can be explained by the reduction of photochemical removal between these two seasons. With chemistry turned off in the model, formaldehyde emission sensitivities increased by 9 % to 28 % in July but by only a few percent in December (Table 3). Although the relative increase in sensitivity is larger in July for all receptor areas, the actual magnitude of this change is similar in July and December for two of these receptors (air basin and San Jose) because sensitivities to formaldehyde emissions are larger in December. Overall, seasonal variations in photochemical removal rates of formaldehyde do not explain the large ( 200 %) increase in formaldehyde emission sensitivity of model responses from July to December. These results suggest that reduced transport and mixing are the main drivers of the enhanced influence on air quality of directly emitted formaldehyde in December compared to July.

Table 3Increase (ppb) in the formaldehyde emissions semi-normalized sensitivity of model responses resulting from turning off chemistry in the model. The numbers in parentheses indicate the magnitude of the increase relative to the original value.

## 3.4 Sensitivities to reaction rate coefficients

Comparing sensitivities to reaction rate coefficients gives insights into which species and chemical reactions contribute the most to photochemical production and removal of formaldehyde. Rate sensitivities for each model response and for each of the 211 chemical reactions represented in the SAPRC99 chemical mechanism were calculated and ranked by magnitude. Table 4 shows the 16 reactions with the largest sensitivities for the air-basin-wide response, shown separately for July and December. A negative rate sensitivity indicates that increasing the reaction rate coefficient would decrease the formaldehyde response and vice versa. It should be noted that ordering reactions by rate sensitivity versus ordering them by reaction rate yields different rankings. Rate sensitivities account for both the rate and the formaldehyde yield of the reaction. In other words, they account for how often the reaction occurs and how many molecules of formaldehyde are eventually formed or destroyed every time the reaction occurs. The rate coefficient sensitivities presented in Table 4 do not indicate, however, which reactions produce the most formaldehyde. Rather, they indicate how much formaldehyde concentrations would be affected if rate coefficients were to change. Only semi-normalized sensitivities are considered here. Non-normalized sensitivities cannot be ranked in the same manner because they have different dimensions depending on whether the reaction is uni-, bi-, or termolecular.

Table 4Reaction rates (averaged over the modeling domain and simulation period) and semi-normalized sensitivities ${S}_{k}^{×}$ (summed over the modeling domain and simulation period) of the air-basin-wide response to reaction rate coefficients for the 16 reactions for which ${S}_{k}^{×}$ has the largest magnitude. Reactions are ranked separately for each season.

* O2 may be omitted as a reactant. PAN is peroxyacetyl nitrate.

We first discuss the summertime results. In this season, attack by the OH radical (Reaction R3) is the highest-ranking reaction among the five photochemical pathways for formaldehyde removal represented in the model. While the average formaldehyde photolysis rate is only 45 % larger for the channel that yields stable products (Reaction R2) than for the channel that yields radicals (Reaction R1), the rate sensitivity corresponding to Reaction R2 is about twice as large in magnitude as for Reaction R1. The reason for this difference is that the HO2 radicals generated as products of Reaction R1 can themselves participate in secondary formation of formaldehyde, thus partially offsetting the initial loss of formaldehyde. The net effect of the reaction is nevertheless a loss of formaldehyde, as indicated by the negative sign of the corresponding sensitivity. The OH radical can remove formaldehyde from the atmosphere, but it can also oxidize other organic species to eventually form more formaldehyde. The latter effect prevails over the former, as indicated by the positive rate sensitivity of the OH-forming reaction involving H2O and O(1D).

The highest-ranking reaction that yields a positive sensitivity is the reaction of methane with the OH radical. This reaction leads to the formation of methyl peroxy radical, which can itself lead to formaldehyde. Other high-ranking pathways to secondary formaldehyde are attack of methanol by the OH radical and the photolysis of methacrolein, glyoxal, and methyl vinyl ketone. In Sect. 3.2, alkenes – more specifically isoprene, ethene, OLE1, and OLE2 – were identified as the main precursors to secondary formaldehyde in July. Yet none of the reactions that directly involve any of these species appear in Table 4 for this season (the highest-ranking one among these reactions – the reaction of ethene with OH – ranks 18th). These two seemingly contradictory observations suggest that these alkenes are reactive enough that the factor that limits formation of formaldehyde is not the rate at which the reactions occur (low sensitivity to reaction rates) but rather the availability of the alkenes as reactants (high sensitivity to emissions). For species such as glyoxal and methacrolein, emissions are small but secondary formation is large. These species do contribute significantly to secondary formation of formaldehyde, yet the corresponding emission sensitivities are small because these species are formed chemically within rather than being directly emitted into the atmosphere; they can also be transported from upwind regions.

As discussed previously, secondary formation of formaldehyde is far less significant in December than in July. This trend is also reflected in the high-ranking rate sensitivities, most of which are smaller in magnitude in December.

## 3.5 Spatial distributions of sensitivities

The apportionment analysis presented in Sect. 3.2 identifies the chemical species whose emissions have a large influence on formaldehyde mixing ratios within the study domain. To mitigate air pollution problems effectively, the influence of emissions as a function of location and time must be understood. The influence of a source depends not only on its strength but also on parameters such as location relative to receptor, meteorological conditions, and the chemical state of the atmosphere. This section investigates how the spatial distribution of emission sources affects their influence on air pollution within different receptor areas. We also consider whether the magnitude of an emission source correlates with the influence of that source.

Figure 5Semi-normalized sensitivity of model responses to individual emission sources versus the magnitude of these sources. Values are shown separately for the July (red circles) and December (blue triangles) simulation periods along with the corresponding linear regression affine lines (solid red line for July and dashed blue line for December). The corresponding coefficients of determination, r2, are indicated on the top of each subplot. In this figure, an “individual emission source” corresponds to emissions of a model species in a single model grid cell. Values are averaged over the simulation period.

Figure 5 shows the relationship between the magnitude of emissions of selected species and their influence on the air-basin-wide (left column) and East Bay (right column) responses, shown separately for July and December. Figure 5 also shows, for each model response, each model species, and each simulation period, the corresponding linear regression line and coefficient of determination, r2. The formaldehyde emission sensitivity of the air-basin-wide model response correlates only moderately well with the magnitude of these emissions, both in July (r2=0.46) and December (r2=0.59). The largest formaldehyde emission source in the modeling domain is located near Fairfield. As this source is comparatively far from and often downwind of the most populated areas of the air basin, it contributes little to the air-basin-wide response and thus drives down the corresponding correlation between emissions and sensitivity. Removing this point from the analysis yields better correlations (r2=0.70 and 0.76 in July and December, respectively). There is no correlation between the magnitude of formaldehyde emission sources and the corresponding sensitivities for any of the more local model responses investigated in this study: Pittsburg–Antioch, San Jose, and the East Bay. For example, r2=0.18 and 0.12 in July and December, respectively, for the East Bay (Fig. 5b). As discussed further below, the location of emission sources relative to the receptor becomes crucial in determining their effects on air quality within localized receptor areas.

The sensitivity of model responses to anthropogenic emissions of organic precursors correlates more strongly with the magnitude of these emissions than was the case for formaldehyde emissions. This finding holds for the air-basin-wide response and for all the local responses under investigation, with stronger correlations for the air-basin-wide response than for the local responses. Examples are shown in Fig. 5c (air-basin-wide response) and Fig. 5d (East Bay response) for ethene. The strength of the correlations between the NOx emission sensitivity of formaldehyde model responses and the magnitude of these emissions (Fig. 5e and f) is similar to that observed for anthropogenic emissions of organic precursors. The correlation between sensitivity to biogenic emissions and the magnitude of these emissions is weak in most cases, as illustrated by Fig. 5g and h for isoprene. There are large biogenic emission source regions which do not contribute significantly to local and regional formaldehyde exposure, while other smaller sources, located upwind of densely populated areas, have a much greater influence.

The spatial distributions of anthropogenic emissions are similar for most model species, and the strength of the correlations discussed here gives an indication of how strongly the influence of emission sources depends on their location relative to the receptor area. A weak correlation suggests a high dependence on location. Results discussed above suggest that the location of emission sources has a greater influence for directly emitted formaldehyde than for anthropogenic precursors to secondary formation. This conclusion is further supported by the spatial distributions of sensitivity to emissions of formaldehyde and other precursors, as discussed below. Non-normalized sensitivity to emissions at location x measures the change in model response resulting from adding new emissions at that location, where the amount of pollutant thus added is independent of the magnitude of existing emissions. In other words, it measures how much a certain location can affect a specified model response. A spatially invariant non-normalized sensitivity means that the influence that emissions have on air quality does not depend on their location, only on their magnitude. If non-normalized sensitivity is not uniform, then the magnitude of its gradient indicates how much the location of emissions matters in determining their impact on air quality.

Figure 6Non-normalized and semi-normalized sensitivity of the San Jose formaldehyde response in July to emissions of formaldehyde and ethene. At each location, the sensitivity is summed over the duration of the simulation period. The San Jose receptor area is outlined in green.

Figure 6a and b show the non-normalized sensitivity for July of the San Jose response to formaldehyde and ethene emissions, respectively. The sensitivity to formaldehyde emissions exhibits strong spatial gradients, indicating that the influence of direct formaldehyde emissions strongly depends on location. In contrast, the sensitivity to ethene emissions exhibits weaker spatial gradients over a larger region, especially along the prevailing direction of surface winds (roughly northwest to southeast in the case of San Jose). The influence of ethene emissions is not as dependent on the location of the emissions compared to formaldehyde. Similar features are observed in the spatial distributions of non-normalized sensitivities for the other summer-season responses.

The contribution of an existing emission source to a model response is measured by the corresponding semi-normalized sensitivity, which is calculated as the product of the potential influence of the location of this source and its magnitude. Mathematically,

$\begin{array}{}\text{(10)}& {S}_{E}^{×}={S}_{E}^{+}×E,\end{array}$

where the three terms are, respectively, semi-normalized sensitivity to emissions, non-normalized sensitivity to emissions, and magnitude of emissions. Figure 6c and d show the semi-normalized sensitivity for July of the San Jose response to formaldehyde and ethene emissions, respectively. The influence of formaldehyde emissions is dominated by local sources, while the influence of ethene emissions is more affected by sources located further upwind of the receptor area.

Figure 7Semi-normalized sensitivity S× of the formaldehyde model response at different receptors to emissions of various species or groups of species. In each grid cell, sensitivity values are summed over the duration of the simulation period. The boundary of the receptor area under consideration in each case is indicated by a green or pink line.

Similar observations can be made for the other local responses investigated in this study. For example, Fig. 7a and b show the spatial distributions of semi-normalized sensitivity of the Pittsburg–Antioch response in July to emissions of formaldehyde and VOC precursors, respectively. Precursor emissions originating from San Francisco have a strong influence on formaldehyde in Pittsburg–Antioch even though these emissions are located more than 50 km away from the receptor. Precursor emissions originating from the East Bay and from within the receptor itself, as well as from regions along this path, also have a significant influence on this response. In contrast, formaldehyde emissions originating from San Francisco and the East Bay do not contribute as much as local formaldehyde emissions to the sensitivity of the Pittsburg–Antioch response. The same trend is observed in December (Fig. 7e and f)

Compared to summer, the relative contribution of sources located within or near the receptor area is larger in winter for both formaldehyde and ethene emissions. This same seasonal pattern was also noted for benzene and diesel black carbon and was attributed to differences in meteorology between the two seasons. In July, near-surface winds are predominantly from the northwest or west. In December, near-surface winds are weaker and do not have a clear prevailing direction. The more quiescent meteorological conditions associated with the winter season increase the relative importance of nearby sources. Relevant upwind areas of influence extend west of the receptor areas in July and to a lesser extent east of the receptor areas in December. The relative contribution of local versus upwind formaldehyde emissions increases significantly for the Pittsburg–Antioch receptor between July and December (44 % to 77 %), decreases for San Jose (73 % to 62 %), and is 74 % in both seasons for the East Bay. As an additional example, Fig. 7c and g show the formaldehyde emissions that influence formaldehyde in the East Bay in July and December, respectively. Emissions originating from San Francisco have more influence on formaldehyde in the East Bay in July compared to December, and East Bay emissions become more influential in December. Figure 7d shows the semi-normalized sensitivity of the air-basin-wide July response to NOx emissions, which is negative in and around the most densely populated areas but is positive for highway emissions in the north and for ship emissions over the Pacific Ocean. These positive NOx sensitivities are, however, small in magnitude (see also Fig. 5e).

4 Conclusions

The adjoint of a chemical transport model is used to identify emission sources and chemical reactions that have the most influence on formaldehyde, separately for summer- and winter-season conditions. Air quality metrics investigated in this work are population-weighted mixing ratios in the San Francisco Bay Area as a whole and within three of its urbanized sub-areas. The results of the sensitivity analysis are used to quantify the relative importance of primary versus secondary formation, anthropogenic versus biogenic sources, and of different emission sectors on formaldehyde pollution. Secondary formation has little influence on formaldehyde in winter: 85 %–90 % of the sensitivity of formaldehyde to emissions is attributed to directly emitted formaldehyde, and 62 %–77 % of this contribution is attributed to local sources. In summer, the relative contribution of direct formaldehyde emissions is more variable and ranges between 27 % and 75 %. The remainder of the formaldehyde sensitivity is attributed to emissions of precursors, mainly isoprene, ethene, and other alkenes. Areas of influence extend farther upwind for precursors than for direct emissions of formaldehyde. The relative influence of secondary formation versus direct emissions on formaldehyde is largest for receptors located farther away from heavily urbanized and emission-rich areas. The relative importance of biogenic emissions is also largest for these receptors. This pattern is attributed to the increased opportunity for secondary formation associated with the longer transport time required for polluted air masses to reach distant receptors. Increasing NOx emissions decreases formaldehyde pollution at receptors (and, conversely, decreasing NOx emissions increases formaldehyde pollution), but the magnitude of this effect is relatively small. To summarize, winter-season formaldehyde is greatly influenced by direct local emissions of formaldehyde. In contrast, summer-season formaldehyde is influenced by both direct local emissions of formaldehyde and regional emissions of other organic precursors. The magnitude of anthropogenic emissions of species other than formaldehyde correlates reasonably well with their influence on regional formaldehyde exposure. They do not, however, correlate as much with their influence on pollution within specific urban sub-areas.

This study uses first-order sensitivity coefficients to characterize how formaldehyde at different receptors responds to changes in emissions and reaction rate coefficients. The use of first-order sensitivity coefficients to estimate changes in responses of nonlinear systems (Eqs. 8 and 9) while ignoring higher-order terms of the Taylor series (Eq. 5) is subject to restrictions. The changes in parameters must be relatively small so that the system response remains approximately linear. The first-order sensitivities presented here should therefore not be used to estimate the effects on ambient formaldehyde of very large emission changes such as 100 % reductions. Perturbing NO emissions was found to yield particularly nonlinear behavior. Additionally, first-order sensitivities should be recalculated if emissions or other model parameters were to change significantly from the baseline values used here. This study nevertheless provides a basis for identifying the emissions and chemical reactions that have the most influence on ambient formaldehyde.

Code and data availability
Code and data availability.

The model source code is included in the Supplement. Data sets used in this study will be provided upon request addressed by email to the corresponding author.

Supplement
Supplement.

Author contributions
Author contributions.

LAJB conducted the modeling work, data analysis, and paper writing under the supervision and with the advising of RAH and NJB.

Competing interests
Competing interests.

The authors declare that they have no conflict of interest.

Acknowledgements
Acknowledgements.

We thank Philip Martien, Yiqin Jia, and Cuong Tran from BAAQMD for their support and assistance. We also thank the two referees for their insightful comments.

Financial support
Financial support.

Financial support for this work was provided, in part, by the Bay Area Air Quality Management District (BAAQMD; grant no. 2013.082); the Carl W. Johnson Foundation; and the Director, Office of Science, Office of Basic Energy Sciences, Chemical Sciences, Geosciences, and Biosciences Division of the US Department of Energy under contract no. DE-AC02-05CH11231.

Review statement
Review statement.

This paper was edited by Rolf Sander and reviewed by David Parrish and one anonymous referee.

References

BAAQMD (Bay Area Air Quality Management District): Toxics Modeling to Support the Community Air Risk Evaluation (CARE) Program, Research and Modeling Section Publication No. 200906-002-TX, San Francisco, CA, June 2009. a, b

BAAQMD (Bay Area Air Quality Management District): Improving Air Quality & Health in Bay Area Communities, Community Air Risk Evaluation Program Retrospective & Path Forward (2004–2013), San Francisco, CA, April 2014. a

Ban-Weiss, G. A., McLaughlin, J. P., Harley, R. A., Kean, A. J., Grosjean, E., and Grosjean, D.: Carbonyl and Nitrogen Dioxide Emissions From Gasoline-and Diesel-Powered Motor Vehicles, Environ. Sci. Technol., 42, 3944–3950, https://doi.org/10.1021/es8002487, 2008. a

Bastien, L. A. J., McDonald, B. C., Brown, N. J., and Harley, R. A.: High-Resolution Mapping of Sources Contributing to Urban Air Pollution Using Adjoint Sensitivity Analysis: Benzene and Diesel Black Carbon, Environ. Sci. Technol., 49, 7276–7284, https://doi.org/10.1021/acs.est.5b00686, 2015. a, b, c, d, e, f, g, h

Byun, D. and Schere, K. L.: Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multiscale Air Quality (CMAQ) Modeling System, Appl. Mech. Rev., 59, 51–77, https://doi.org/10.1115/1.2128636, 2006. a

Carter, W. P. L.: Documentation of the SAPRC-99 Chemical Mechanism for VOC reactivity assessment, vol. 1, Documentation Text, Final Report to California Air Resources Board. Contract 92-329 and Contract 95-308, Report no. 00-AP-RT17-001-FR, 2000. a, b, c, d

Cohan, D. S. and Napelenok, S. L.: Air Quality Response Modeling for Decision Support, Atmosphere, 2, 407–425, https://doi.org/10.3390/atmos2030407, 2011. a

Cohan, D. S., Hakami, A., Hu, Y., and Russell, A. G.: Nonlinear Response of Ozone to Emissions: Source Apportionment and Sensitivity Analysis, Environ. Sci. Technol., 39, 6739–6748, https://doi.org/10.1021/es048664m, 2005. a, b

Daescu, D. N., Sandu, A., and Carmichael, G. R.: Direct and adjoint sensitivity analysis of chemical kinetic systems with KPP: II–numerical validation and applications, Atmos. Environ., 37, 5097–5114, https://doi.org/10.1016/j.atmosenv.2003.08.020, 2003. a

Dasgupta, P. K., Li, J., Zhang, G., Luke, W. T., McClenny, W. A., Stutz, J., and Fried, A.: Summertime Ambient Formaldehyde in Five U.S. Metropolitan Areas: Nashville, Atlanta, Houston, Philadelphia, and Tampa, Environ. Sci. Technol., 39, 4767–4783, https://doi.org/10.1021/es048327d, 2005. a

Dunker, A. M.: Efficient calculation of sensitivity coefficients for complex atmospheric models, Atmos. Environ., 15, 1155–1161, https://doi.org/10.1016/0004-6981(81)90305-X, 1981. a

Dunker, A. M.: The decoupled direct method for calculating sensitivity coefficients in chemical kinetics, J. Chem. Phys., 81, 2385–2393, https://doi.org/10.1063/1.447938, 1984. a

Dunker, A. M.: Path-integral method for the source apportionment of photochemical pollutants, Geosci. Model Dev., 8, 1763–1773, https://doi.org/10.5194/gmd-8-1763-2015, 2015. a

Dunker, A. M., Yarwood, G., Ortmann, J. P., and Wilson, G. M.: The Decoupled Direct Method for Sensitivity Analysis in a Three-Dimensional Air Quality Model – Implementation, Accuracy, and Efficiency, Environ. Sci. Technol., 36, 2965–2976, https://doi.org/10.1021/es0112691, 2002. a

Dunker, A. M., Koo, B., and Yarwood, G.: Source Apportionment of the Anthropogenic Increment of Ozone, Formaldehyde, and Nitrogen Dioxide by the Path-Integral Method in a 3D Model, Environ. Sci. Technol., 49, 6751–6759, https://doi.org/10.1021/acs.est.5b00467, 2015. a

Emmons, L. K., Walters, S., Hess, P. G., Lamarque, J.-F., Pfister, G. G., Fillmore, D., Granier, C., Guenther, A., Kinnison, D., Laepple, T., Orlando, J., Tie, X., Tyndall, G., Wiedinmyer, C., Baughcum, S. L., and Kloster, S.: Description and evaluation of the Model for Ozone and Related chemical Tracers, version 4 (MOZART-4), Geosci. Model Dev., 3, 43–67, https://doi.org/10.5194/gmd-3-43-2010, 2010. a

Finlayson-Pitts, B. J. and Pitts Jr., J. N.: Chemistry of the Upper and Lower Atmosphere: Theory, Experiments, and Applications, Academic Press, SanDiego, CA, 2000. a, b, c

Fountoukis, C., Koraj, D., Denier van der Gon, H. A. C., Charalampidis, P. E., Pilinis, C., and Pandis, S. N.: Impact of grid resolution on the predicted fine PM by a regional 3-D chemical transport model, Atmos. Environ., 68, 24–32, https://doi.org/10.1016/j.atmosenv.2012.11.008, 2013. a

Friedfeld, S., Fraser, M., Ensor, K., Tribble, S., Rehle, D., Leleux, D., and Tittel, F.: Statistical analysis of primary and secondary atmospheric formaldehyde, Atmos. Environ., 36, 4767–4775, https://doi.org/10.1016/S1352-2310(02)00558-7, 2002. a

Garcia, A. R., Volkamer, R., Molina, L. T., Molina, M. J., Samuelson, J., Mellqvist, J., Galle, B., Herndon, S. C., and Kolb, C. E.: Separation of emitted and photochemical formaldehyde in Mexico City using a statistical analysis and a new pair of gas-phase tracers, Atmos. Chem. Phys., 6, 4545–4557, https://doi.org/10.5194/acp-6-4545-2006, 2006. a

Grosjean, D., Grosjean, E., and Gertler, A. W.: On-Road Emissions of Carbonyls from Light-Duty and Heavy-Duty Vehicles, Environ. Sci. Technol., 35, 45–53, https://doi.org/10.1021/es001326a, 2001. a

Hakami, A., Odman, M. T., and Russell, A. G.: High-Order, Direct Sensitivity Analysis of Multidimensional Air Quality Models, Environ. Sci. Technol.,37, 2442–2452, https://doi.org/10.1021/es020677h, 2003. a, b, c

Hakami, A., Odman, M. T., and Russell, A. G.: Nonlinearity in atmospheric response: A direct sensitivity analysis approach, J. Geophys. Res.-Atmos., 109, D15303, https://doi.org/10.1029/2003JD004502, 2004. a, b

Hakami, A., Henze, D. K., Seinfeld, J. H., Singh, K., Sandu, A., Kim, S., Byun, D., and Li, Q.: The Adjoint of CMAQ, Environ. Sci. Technol., 41, 7807–7817, https://doi.org/10.1021/es070944p, 2007. a, b, c, d

Henze, D. K., Hakami, A., and Seinfeld, J. H.: Development of the adjoint of GEOS-Chem, Atmos. Chem. Phys., 7, 2413-2433, https://doi.org/10.5194/acp-7-2413-2007, 2007. a, b

Jin, L., Tonse, S., Cohan, D. S., Mao, X., Harley, R. A., and Brown, N. J.: Sensitivity Analysis of Ozone Formation and Transport for a Central California Air Pollution Episode, Environ. Sci. Technol., 42, 3683–3689, https://doi.org/10.1021/es072069d, 2008. a

Kheirbek, I., Johnson, S., Ross, Z., Pezeshki, G., Ito, K., Eisl, H., and Matte, T.: Spatial variability in levels of benzene, formaldehyde, and total benzene, toluene, ethylbenzene and xylenes in New York City: a land-use regression study, Environ. Health, 11, 51, https://doi.org/10.1186/1476-069X-11-51, 2012. a

Koo, B., Dunker, A. M., and Yarwood, G.: Implementing the Decoupled Direct Method for Sensitivity Analysis in a Particulate Matter Air Quality Model, Environ. Sci. Technol., 41, 2847–2854, https://doi.org/10.1021/es0619962, 2007. a

Luecken, D. J., Hutzell, W. T., Strum, M. L., and Pouliot, G. A.: Regional sources of atmospheric formaldehyde and acetaldehyde, and implications for atmospheric modeling, Atmos. Environ., 47, 477–490, https://doi.org/10.1016/j.atmosenv.2011.10.005, 2012. a, b, c

Luecken, D. J., Napelenok, S. L., Strum, M., Scheffe, R., and Phillips, S.: Sensitivity of Ambient Atmospheric Formaldehyde and Ozone to Precursor Species and Source Types Across the United States, Environ. Sci. Technol.,52, 4668–4675, https://doi.org/10.1021/acs.est.7b05509, 2018. a, b

Martien, P. T., Harley, R. A., and Cacuci, D. G.: Adjoint Sensitivity Analysis for a Three-Dimensional Photochemical Model: Implementation and Method Comparison, Environ. Sci. Technol., 40, 2663–2670, https://doi.org/10.1021/es0510257, 2006. a, b

McCarthy, M. C., Hafner, H. R., and Montzka, S. A.: Background Concentrations of 18 Air Toxics for North America, JAPCA J. Air Waste Ma., 56, 3–11,https://doi.org/10.1080/10473289.2006.10464436, 2006. a

McDonald, B. C., de Gouw, J. A., Gilman, J. B., Jathar, S. H., Akherati, A., Cappa, C. D., Jimenez, J. L., Lee-Taylor, J., Hayes, P. L., McKeen, S. A., Cui, Y. Y., Kim, S.-W., A., H. R., Frost, G. J., Roberts, J. M., Ryerson, T. B., and Trainer, M.: Volatile chemical products emerging as largest petrochemical source of urban organic emissions, Science, 359, 760–764, https://doi.org/10.1126/science.aaq0524, 2018. a

Napelenok, S. L., Cohan, D. S., Hu, Y., and Russell, A. G.: Decoupled direct 3D sensitivity analysis for particulate matter (DDM-3D/PM), Atmos. Environ., 40, 6112–6121, https://doi.org/10.1016/j.atmosenv.2006.05.039, 2006. a

Parrish, D. D., Ryerson, T. B., Mellqvist, J., Johansson, J., Fried, A., Richter, D., Walega, J. G., Washenfelder, R. A., de Gouw, J. A., Peischl, J., Aikin, K. C., McKeen, S. A., Frost, G. J., Fehsenfeld, F. C., and Herndon, S. C.: Primary and secondary sources of formaldehyde in urban atmospheres: Houston Texas region, Atmos. Chem. Phys., 12, 3273–3288, https://doi.org/10.5194/acp-12-3273-2012, 2012. a

Rappenglück, B., Dasgupta, P. K., Leuchner, M., Li, Q., and Luke, W.: Formaldehyde and its relation to CO, PAN, and SO2 in the Houston-Galveston airshed, Atmos. Chem. Phys., 10, 2413–2424, https://doi.org/10.5194/acp-10-2413-2010, 2010. a

Sandu, A. and Sander, R.: Technical note: Simulating chemical systems in Fortran90 and Matlab with the Kinetic PreProcessor KPP-2.1, Atmos. Chem. Phys., 6, 187–195, https://doi.org/10.5194/acp-6-187-2006, 2006. a

Sandu, A., Daescu, D. N., and Carmichael, G. R.: Direct and adjoint sensitivity analysis of chemical kinetic systems with KPP: Part I–theory and software tools, Atmos. Environ., 37, 5083–5096, https://doi.org/10.1016/j.atmosenv.2003.08.019, 2003. a

Sandu, A., Daescu, D. N., Carmichael, G. R., and Chai, T.: Adjoint sensitivity analysis of regional air quality models, J. Comput. Phys., 204, 222–252, https://doi.org/10.1016/j.jcp.2004.10.011, 2005. a, b

Seinfeld, J. H. and Pandis, S. N.: Atmospheric Chemistry and Physics: From Air Pollution to Climate Change, 3rd edn., John Wiley & Sons, Inc., Hoboken, New Jersey, 2016. a, b, c, d, e, f

Strum, M. and Scheffe, R.: National review of ambient air toxics observations, JAPCA J. Air Waste Ma., 66, 120–133, https://doi.org/10.1080/10962247.2015.1076538, 2016. a, b, c

USEPA (U.S. Environmental Protection Agency): Our Nation's Air, Status and Trends through 2010, Report number EPA-454/R-12-001, U.S. Environmental Protection Agency, Office of Air Quality Planning and Standards, Research Triangle Park, North Carolina, 2012. a

Vuilleumier, L., Harley, R. A., and Brown, N. J.: First- and Second-Order Sensitivity Analysis of a Photochemically Reactive System (a Green's Function Approach), Environ. Sci. Technol., 31, 1206–1217, https://doi.org/10.1021/es960727g, 1997. a, b

Xiaoyan, W., Huixiang, W., and Shaoli, W.: Ambient formaldehyde and its contributing factor to ozone and OH radical in a rural area, Atmos. Environ., 44, 2074–2078, https://doi.org/10.1016/j.atmosenv.2010.03.023, 2010.  a

Yang, Y.-J., Wilkinson, J. G., and Russell, A. G.: Fast, Direct Sensitivity Analysis of Multidimensional Photochemical Models, Environ. Sci. Technol., 31, 2859–2868, https://doi.org/10.1021/es970117w, 1997. a

Zheng, J., Zhang, R., Garzón, J. P., Huertas, M. E., Levy, M., Ma, Y., Torres-Jardón, R., Ruiz-Suárez, L. G., Russell, L., Takahama, S., Tan, H., Li, G., and Molina, L. T.: Measurements of formaldehyde at the U.S.–Mexico border during the Cal-Mex 2010 air quality study, Atmos. Environ., 70, 513–520, https://doi.org/10.1016/j.atmosenv.2012.09.041, 2013. a

Zhu, L., Jacob, D. J., Keutsch, F. N., Mickley, L. J., Scheffe, R., Strum, M., González Abad, G., Chance, K., Yang, K., Rappenglück, B., Millet, D. B., Baasandorj, M., Jaeglé, L., and Shah, V.: Formaldehyde (HCHO) As a Hazardous Air Pollutant: Mapping Surface Air Concentrations from Satellite and Inferring Cancer Risks in the United States, Environ. Sci. Technol., 51, 5650–5657, https://doi.org/10.1021/acs.est.7b01356, 2017. a