Evaluation of near-surface atmospheric composition reanalysis data in the metropolis of São Paulo, Brazil
Marina S. Paiva A , Marco A. Franco B and Luciana V. Rizzo
A
B
Abstract
Air quality monitoring stations are unequally distributed worldwide despite the relevance of the air pollution impacts. After validation, atmospheric composition reanalysis can fill information gaps in locations where air quality observations are absent. Reanalysis datasets are based on global emission inventories, often overlooking regional characteristics. This underscores the need for regional and local evaluation studies, which remain scarce in South America. This study presents the first evaluation of atmospheric composition reanalysis products in the Metropolitan Area of São Paulo (MASP), Brazil. Two reanalysis products were evaluated: MERRA-2 (Modern-Era Retrospective Analysis for Research and Applications) and CAMS (Copernicus Atmospheric Monitoring Service). Four air pollutants were assessed, considering monthly concentration time series between 2015 and 2019: ozone (O3), nitrogen dioxide (NO2), inhalable (PM10) and fine particulate matter (PM2.5). Near-surface reanalysis was compared with air quality monitoring stations in the MASP. CAMS correctly reproduced the seasonal and interannual variability of concentrations, with significant Pearson correlation coefficients in the range of 0.75–0.89. However, CAMS overestimated O3, PM2.5 and PM10 by 136, 50 and 16% respectively. By contrast, MERRA-2 failed to reproduce the main features of air pollutant seasonal variability in the MASP, especially for PM2.5 and PM10. Based on these findings, we conclude that CAMS adequately represents near-surface air quality conditions in the MASP, although bias corrections are required. This means that the CAMS reanalysis data may be used to obtain information about air quality conditions in cities where local monitoring is absent, at least in Brazilian cities near the MASP. Further studies are necessary to investigate the adequacy of CAMS in other Brazilian regions.
Keywords: air pollution, air quality, Brazil, CAMS, data assimilation, global weather models, MERRA-2, ozone, particulate matter, São Paulo.
1.Introduction
Air pollution is one of the main environmental problems in urban areas, with negative effects on human health (Cohen et al. 2017; Molina 2021). Air pollution monitoring is crucial to quantify impacts and to formulate mitigation strategies. However, the distribution of air quality monitoring stations is unequal worldwide. For example, Europe has ~12 monitoring stations per 10,000 km2, whereas in South America, this number is as small as 0.3 (Carvalho 2016). There are also inequalities within countries. In Brazil, most monitoring stations are located in the south-east, whereas air quality data is almost absent in the northern Brazilian regions (Vormittag et al. 2021). Data scarcity hinders accurately diagnosing air pollution conditions, and its associations with environmental and socioeconomic variables and health effects.
Given that in situ air quality data coverage is insufficient in some regions, alternative data sources need to be pursued. One option is to use satellite-based remote sensing, in which air pollutant concentrations are inferred from orbital measurements of spectral radiance (e.g. Goldberg et al. 2021; Ranjan et al. 2021). However, there are limitations in retrieving air quality conditions from satellite products. Low temporal resolution, cloud contamination and measurement biases associated with surface albedo and meteorological conditions are some of the challenges reported in previous studies (Ranjan et al. 2021). Moreover, satellite products typically provide data on the amount of air pollutants in the whole atmospheric column, which may not represent the surface air quality conditions, especially in areas influenced by the long-range transport of pollutants above the boundary layer (Damascena et al. 2021).
Atmospheric composition reanalysis can be an alternative to complement in situ air quality observations. Reanalysis products combine short-range weather forecasts and observations through data assimilation methods (Lahoz and Schneider 2014). Satellite remote sensing data are assimilated in the reanalysis process, providing globally complete and physically consistent estimates of weather variables and air pollutant concentrations at different atmospheric levels, including the near-surface level. Atmospheric composition reanalysis products are provided by operational centres of weather and climate forecast like the US NCEP (United States National Centres for Environmental Prediction) and ECMWF (European Centre for Medium-Range Weather Forecasts), using coupled GCM-CTM (general circulation model–chemical transport model) systems. The aforementioned operational centres deliver the atmospheric composition reanalysis MERRA-2 (Modern-Era Retrospective Analysis for Research and Applications; Gelaro et al. 2017) and CAMS (Copernicus Atmospheric Monitoring Service; Inness et al. 2019) respectively.
Before using reanalysis datasets to represent regional and local air quality conditions, it is crucial to assess their accuracy against independent in situ observations. Global evaluations of atmospheric composition reanalysis have shown that their accuracy varies with season and latitude zone (Buchard et al. 2017; Wagner et al. 2021; Jin et al. 2022; Lacima et al. 2022). Reanalysis estimates of atmospheric composition rely on anthropogenic and biogenic global emission inventories, which typically do not account for regional features at the country level, especially in the Southern Hemisphere (Castesana et al. 2022; Paton-Walsh et al. 2022). Additional sources of uncertainties include possible regional biases in meteorology and data availability for assimilation. Underlying aerosol models and optical assumptions may also lead to biases because the assimilation of aerosol data in reanalysis products is based solely on aerosol optical depth (AOD) observations, from which the mixing ratios of all particulate components are deconvoluted (Gelaro et al. 2017; Inness et al. 2019). The quite coarse spatial resolution of atmospheric composition reanalysis datasets (50–100 km) poses additional challenges to colocation with ground-based observations in air quality monitoring stations. As such, regional and local evaluation studies are necessary to characterise biases of reanalysis datasets against ground-based data.
Previous regional evaluation studies have focused on particulate matter (PM) reanalysis data in North America, Asia and Europe (e.g. Buchard et al. 2016; Ali et al. 2022; Lacima et al. 2022). In particular, Lacima et al. (2022) extended the pollutant analysis and evaluated the long-term performance of the CAMS and MERRA-2 global reanalyses in capturing surface air pollution over Europe from 2003 to 2020, focusing on O3, NO₂, CO, SO2, and inhalable (PM10) and fine particulate matter (PM2.5). Their findings highlight significant biases in both datasets, with CAMS generally outperforming MERRA-2, yet still requiring bias correction methods for improved applicability in environmental studies. To date, assessments of reanalysis products in South American countries, although existing, are few, and typically restricted to weather variables (Dufek et al. 2008; Albuquerque de Almeida et al. 2018), AOD (Júnior et al. 2025) and air pollutants (Casallas et al. 2022, 2024; Li et al. 2024), and these are mainly on the north-western part of the South American continent. Therefore, validation studies for air pollutant reanalysis data are still very scarce for this continent.
The current study aims to assess the accuracy of two atmospheric composition reanalysis datasets, MERRA-2 and CAMS, against ground-based air quality observations in the Brazilian megacity of São Paulo. This megacity was chosen as a starting point to validate reanalysis datasets in Brazil because of its extensive air quality monitoring network and its relevant atmospheric pollution issues (de Fatima Andrade et al. 2017). The vehicle fleet in Brazilian cities typically runs on biofuel blends, with singular impacts on atmospheric chemistry and the production of secondary pollutants (Brito et al. 2015, 2018). Distinct emission patterns and chemical signatures may affect the performance of reanalysis products in Brazilian cities. The accuracy of MERRA-2 and CAMS reanalysis datasets were accessed for ozone (O3), nitrogen dioxide (NO2), inhalable particulate matter (PM10), and fine particulate matter (PM2.5), delivering specific calibration factors. We also assessed the performance of the reanalysis products in reproducing seasonal cycles and monthly variability, and hypothesised the reasons behind the observed biases.
2.Methods
2.1. Study area
The Metropolitan Area of São Paulo (MASP), the largest megacity in Latin America, covers ~8100 km2, enclosing 39 municipalities, and hosts a population of 21.9 million inhabitants (Instituto Brasileiro de Geografia e Estatística 2023). MASP is located in the state of Sao Paulo, 60 km from the coast. With a demographic density of ~2710 inhabitants km–2, MASP exemplifies the complexities faced by urban regions experiencing rapid expansion, which strains natural resources and causes deterioration of living conditions. Vehicles are the predominant source of air pollutants in the MASP, powered by a mixture of fossil and biofuels (de Fatima Andrade et al. 2017). According to the Sao Paulo state environmental agency, light and heavy-duty vehicles are responsible for 60% of NOx (nitrogen oxide radicals), 71% of hydrocarbons, 40% of PM10 and 37% of PM2.5 emissions in the MASP (Companhia Ambiental do Estado de São Paulo 2024). The widespread use of biofuels results in particular atmospheric chemistry conditions in the MASP, favouring the formation of secondary pollutants like ozone and secondary organic aerosols (Salvo and Geiger 2014; Vara-Vela et al. 2016). Secondary aerosols comprise 25% of PM10 and 51% of PM2.5 in the MASP (Companhia Ambiental do Estado de São Paulo 2024). Other relevant emission sources include industrial (40% of NOx and 10% of PM10), soil dust (25% of PM10) and biomass burning (7% of PM2.5) according to Companhia Ambiental do Estado de São Paulo (2024). The climate in the region is subtropical, marked by a wet season from December to February and a dry season from June to August. Air quality is monitored in the MASP by the Sao Paulo State Environmental Agency, Companhia Ambiental do Estado de São Paulo (CETESB).
2.2. Ground-based data
Eight ground-based stations from the CETESB network were selected to represent the average air quality monitoring conditions in the MASP (Table 1). The criteria for choosing the stations included data coverage during the study period (2015–19) and the spatial representativeness of the stations (Companhia Ambiental do Estado de São Paulo 2016), which refers to the region for which a point measurement is representative, showing a consistent temporal evolution. Priority was given to stations with urban (>4 km), neighbourhood (0.5–4 km) and medium (101–500 m) spatial representativeness, avoiding stations strongly affected by local emissions. Additionally, the selected stations were strategically distributed across the metropolitan region to ensure the representation of diverse areas. Daily and monthly concentration averages were calculated considering data from eight monitoring stations, representing the average conditions in the MASP. Supplementary Fig. S1 shows the variability of concentrations within the eight stations. The period from 2015 to 2019 was selected based on the following criteria: (i) it is relatively recent, being representative of current atmospheric conditions and atmospheric pollutant emission patterns; (ii) it excludes the years affected by the COVID-19 pandemic, which exhibited atypical emission patterns in certain months.
Stations | O3 | NO2 | PM2.5 | PM10 | |
---|---|---|---|---|---|
Ibirapuera | Urban | Urban | Urban | – | |
Pq Dom Pedro II | Neighbourhood | Neighbourhood | – | Neighbourhood | |
São Caetano | Medium | Medium | Medium | Medium | |
Itaquera | Urban | – | – | – | |
IPEN | Urban | Urban | Urban | – | |
Nossa Senhora do Ó | Neighbourhood | – | – | Medium | |
Interlagos | Urban | Urban | – | Neighbourhood | |
Pico do Jaraguá | Urban | Urban | Urban | – |
2.3. CAMS reanalysis data
The CAMS global reanalysis (EAC4) provides reanalysis datasets for atmospheric composition, including aerosols and reactive trace gases. These datasets are produced by the ECMWF. Combining numerical weather prediction, aerosol chemistry models and satellite-based observations, CAMS uses the four-dimensional variational (4D-VAR) data assimilation technique within the CY42R1 version of the ECMWF Integrated Forecasting System (IFS) to deliver complete and physically consistent information on aerosol and trace gas concentrations (Inness et al. 2019). Atmospheric data are available across 60 hybrid sigma and pressure levels, extending up to 0.1 hPa, interpolated to 25 pressure levels, 10 potential temperature levels and 1 potential vorticity level. This study used the CAMS reanalysis data on O3, NO2, PM10 and PM2.5 concentrations near the surface (2 m), with a spatial resolution of 0.75 × 0.75° and a temporal resolution of 3 h (Table 2) from 2015 to 2019. The data were aggregated as daily and monthly averages of air pollutant concentrations. Six grid points intersecting the metropolitan area were selected, labelled C1–6 in Fig. 1: C1 (−23.25, −47.25°); C2 (−23.25, −46.50°); C3 (−23.25, −45.75°); C4 (−24.00, −45.75°); C5 (−24.00, −46.50°); C6 (−24.00, −45.75°).
Location of the study area in Brazil showing the São Paulo state as an insert (left). MASP is depicted on the right, showing the municipal divisions and the coastline. Selected air quality monitoring stations are marked as red stars, CAMS grid centroids in purple, and MERRA-2 grid centroids in yellow. All CAMS grid points are over land areas, except for points C6 and M6, which are over the Atlantic Ocean. Point C5 is in the Baixada Santista area. The points that showed the best comparison against in situ data are circled in red.

CAMS uses an updated version of the MACCity inventory for anthropogenic emissions of chemical species (Granier et al. 2011) and the Global Fire Assimilation System (GFAS 1.4) for wildfire and biomass burning emissions. The MACCity inventory covers the period 1960–2010, and it has been updated for subsequent years based on the Intergovernmental Panel on Climate Change Representative Concentration Pathways (IPCC-RCP 8.5) scenario, which corresponds to the ‘business as usual’ scenario. In CAMS, the MACCity inventory provides anthropogenic emissions for the following species: CO, NO, SO2, black carbon, organic carbon and anthropogenic secondary aerosols (Inness et al. 2019). Although the total anthropogenic emissions may be similar in local and global emission inventories, the literature points out large discrepancies in the distribution of pollutants among different sectors like transport, industry and agriculture emissions (Castesana et al. 2022; Paton-Walsh et al. 2022). Dust and sea salt aerosol emissions are simulated online, relying on meteorological conditions (European Centre for Medium-Range Weather Forecasts 2024). The chemical mechanism in the IFS-COMPO includes 123 tracers for reactive gases and 16 tracers for various aerosol species and size ranges, 157 gas-phase reactions, 3 heterogeneous reactions and 2 aqueous phase reactions, including the production of secondary organic and inorganic aerosols. The following processes are simulated, affecting the mass mixing ratio (kg kg−1) of each tracer: transport (advection and turbulent diffusion), emission, dry and wet deposition, chemical and microphysical processes (European Centre for Medium-Range Weather Forecasts 2024).
The IFS aerosol module (AER) includes the following prognostic species: sea salt (SS), dust, black carbon (BC), organic matter (OM), sulfate (SU), nitrate (NI), ammonium (AM) and secondary organic aerosols (SOAs). Sea salt, dust and nitrate are represented in up to three size bins, from 0.03 to 20 µm. Black carbon and OM are divided into hydrophobic and hydrophilic components. SOA is divided into biogenic (having isoprene and terpenes as gaseous precursors) and anthropogenic (having aromatics as gaseous precursors). The concentrations of PM2.5 and PM10 are assessed using the following equations (European Centre for Medium-Range Weather Forecasts 2024):
where ρ represents air density (kg m−3); DUSTk, SSk and NIk refer to soil dust, sea sal, and nitrate components in different size fractions. All aerosol prognostic species are represented as dry mass mixing ratios (kg kg−1). From the different aerosol components, the model calculates the AOD, using prescribed optical properties for each aerosol species and size range, and integrates them in the atmospheric column, assuming that aerosols are externally mixed.
To obtain improved estimates of trace gases and aerosol gridded concentrations, CAMS uses a data assimilation system, minimising a cost function that measures the differences between the model fields and the corresponding observations, resulting in the best possible estimate by adjusting the model initial conditions. CAMS assimilates various atmospheric composition satellite observations (refer to European Centre for Medium-Range Weather Forecasts 2024 for the complete list) for the variables CO, O3, NO2, SO2 and AOD. The data assimilation for species like O3 is rather simple because the observations can be directly compared with a variable included in the model. However, in the case of aerosols, AOD observations do not provide sufficient information to constrain each of the individual aerosol species and bins, so the total aerosol mass mixing ratio, defined as the sum of all aerosol species, is used as the control variable in the assimilation process (European Centre for Medium-Range Weather Forecasts 2024).
2.4. MERRA-2 reanalysis data
The Modern-Era Retrospective Analysis for Research and Applications, version 2, MERRA-2, is NASA’s latest atmospheric reanalysis of the modern satellite era, produced by the Global Modeling and Assimilation Office (GMAO) (Gelaro et al. 2017). MERRA-2 includes aerosol data assimilation, creating a multidecadal reanalysis where both aerosol and meteorological observations are integrated within a global data assimilation system. MERRA-2 utilises the Goddard Earth Observing System (GEOS 5.12.4) atmospheric data assimilation system, which comprises the GEOS atmospheric model and the Gridpoint Statistical Interpolation (GSI) analysis scheme. The model features a finite-volume dynamical core with cubed-sphere horizontal discretisation at ~0.5 × 0.625° resolution (Table 2) and 72 hybrid-eta levels up to 0.01 hPa. The analysis is performed on a latitude–longitude grid with the same spatial resolution using a three-dimensional variational assimilation algorithm (3DVAR) and a 6-h update cycle (Gelaro et al. 2017). Temporal resolution ranges from 1 to 3 h, depending on the product. In this study, MERRA-2 data were aggregated into daily and monthly averages of air pollutant concentrations. Six grid points intersecting the metropolitan area were selected, identified M1 through M6 in Fig. 1: M1 (−23.000, −46.875°); M2 (−23.000, −46.250°); M3 (−23.500, −46.875°); M4 (−23.500, −46.250°); M5 (−24.000, −46.875°); M6 (−24.000, −46.250°).
For anthropogenic emissions of PM, MERRA-2 uses the AeroCom Phase II HCA0 v1, including the sectors: residential, biofuel, industry, power, land and inland waterway transport. The AeroCom Phase II dataset varies annually in the period 1979–2006. Unlike CAMS, MERRA-2 has not updated emissions, repeating the 2006 emissions for the subsequent years. Biomass burning emissions are obtained from the Quick-Fire Emissions Dataset (QFED v2.4-r6), based on daily satellite retrievals. Dust and SS aerosols have wind speed-dependent emissions (Randles et al. 2017). The aerosol module, Goddard Chemistry Aerosol Radiation and Transport (GOCART), considers sources, sinks and chemistry of the following aerosol species: dust, sulfate, SS, BC and organic carbon (OC). Loss processes for aerosols include dry and wet deposition, transport and convective scavenging. The carbonaceous aerosols are divided into hydrophobic and hydrophilic components. All aerosol species are treated in bulk, except SS and dust, which contain five non-interacting size bins each (Randles et al. 2017). Compared with CAMS, the MERRA-2 aerosol chemistry module is less sophisticated because it does not include nitrate aerosols, gas phase photochemistry and anthropogenic SOAs.
Surface O3 concentrations (M2I3NVCHM product) and aerosol components (M2T1NXAER product) from the MERRA-2 reanalysis were utilised in this study. NO2 concentrations were not available in the MERRA-2 product. The concentrations of PM2.5 and PM10 were calculated as follows (Buchard et al. 2017):
where ρ represents air density (kg m−3); DUST refers to the soil dust component in different size fractions; SS represents the sea salt component in different size fractions; BC and OC stand for black carbon and organic carbon as the hydrophilic and hydrophobic fractions; SO4 is the sulfate ion, assumed to be neutralised by ammonium (NH4+). All aerosol prognostic species are represented as dry mass mixing ratios (kg kg−1).
The data assimilation of atmospheric composition in MERRA-2 follows a similar approach to CAMS. MERRA-2 assimilates satellite data for variables like AOD and O3, adjusting the model’s initial conditions to minimise the difference between observation and model fields. For the complete set of datasets assimilated in MERRA-2, refer to Gelaro et al. (2017). The three-dimensional profiles of aerosol mass mixing ratios simulated by the model are converted into AOD using prescribed aerosol optical properties and assuming external mixing. Just like CAMS, AOD is used as the control variable in the assimilation process. After the AOD data assimilation, apportionment of aerosol mass among the aerosol species is determined by a convolution of aerosol parameterisations, meteorology and assumed aerosol emissions (Randles et al. 2017).
2.5. Evaluation methods for the reanalysis products
Correlation analyses were performed between the MASP in situ data and the concentrations at each reanalysis grid point for the four air pollutants under analysis to select the most representative CAMS and MERRA-2 grid points. Metrics to evaluate the performance of reanalysis data in representing in situ observations included the Pearson correlation coefficient (r), relative mean bias (RMB) and mean absolute error (MAE). The Pearson coefficient is defined as
where n is the number of observations, xi and yi respectively represent the reference (ground-based observations) and reanalysis data, and mean values are represented by bar symbol (x̅ and y̅). The coefficient r indicates the strength and direction of the linear relationship between the two variables. A value r = 1 signifies a perfect positive correlation, r = −1 indicates a perfect negative correlation and r = 0 suggests no linear correlation between the variables.
Least squares linear regression was applied to the scatter plots of reanalysis (y) v. observations (x) to retrieve linear correction curves for the reanalysis products:
where a represents the slope and b is the intercept. A 95% level of significance was adopted for this study (P < 0.05). The P-value associated with the Pearson coefficient was used to evaluate whether there was a significant linear correlation between reanalysis and observations (null hypothesis: there is no linear relationship between the variables). Also, the P-values associated with the linear regression coefficients were used to evaluate whether the coefficients differed significantly from zero (null hypothesis: the coefficient equals zero).
MAE is defined as the mean absolute difference between the reanalysis data (yi) and the reference ground-based observations (xi):
MAE provides the average error without considering the direction of the errors (whether they are positive or negative), making it a useful metric for understanding the overall prediction accuracy. The lower the MAE, the more accurate the model.
The RMB was used to obtain the percentage difference between reanalysis and reference observations:
A positive RMB indicates that the reanalysis estimates predictions are, on average, overestimated compared with the reference ground-based observations.
Taylor diagrams were used to support the choice of the best CAMS and MERRA-2 grid points to represent air quality conditions in the MASP, allowing for simultaneous comparison of different metrics (Taylor 2001). In this work, the Taylor diagrams have two dimensions: the Pearson correlation coefficient (r) and the ratio (σr) between the standard deviation of reanalysis (σy) and ground-based (σx) observations:
3.Results
The results are organised as follows: in Section 3.1, we present statistical metrics for the comparison between reanalysis and observation datasets based on different reanalysis grid points. This is a necessary step to justify the co-location criteria, i.e. the choice of the reanalysis grid points that were most representative of the air quality conditions in the MASP. In Section 3.2, we present the overall performance of the reanalysis products at a monthly temporal scale. Finally, in Sections 3.3 and 3.4, we present and interpret concentration time series and mean annual cycles.
3.1. Comparison of different reanalysis grid points against in situ air quality time series
As depicted in Fig. 1, a few grid points represent the whole MASP in the reanalysis products. Nearest neighbour and interpolation are the most common methods for collocation of in situ and model gridded atmospheric data (Albuquerque de Almeida et al. 2018; Ali et al. 2022). A different collocation approach was taken by comparing in situ observations against data from six reanalysis grid points intersecting the metropolitan area. The average air quality conditions in the MASP were represented by the mean of observations in eight monitoring stations (Section 2.2). Fig. 2 shows Taylor diagrams for each air pollutant to compare statistical metrics for the different reanalysis grid points against in situ observational data. The objective is to determine which reanalysis grid points better reproduce observations in the MASP.
Taylor diagrams comparing the performance of reanalysis products for different grid points against in situ observational data. Pearson correlation coefficients (r), standard deviation ratio (σr) and relative mean bias (RMB) are depicted for the air pollutants O3 (a), NO2 (b), PM2.5 (c) and PM10 (d). Circles represent the MERRA-2 reanalysis grid points M1–6. Squares represent the CAMS reanalysis grid points C1–6. The star symbol references the statistical metrics that would result in a perfect comparison against in situ data, with r = 1.0 and σr = 1.0.

The Pearson correlation coefficients for the CAMS grid points were consistently higher than MERRA-2 for all air pollutants and grid points. O3, NO2 and PM2.5 concentrations estimated by CAMS showed the highest correlation coefficients (r > 0.6) with the reference observations, especially at points C2 and C4 (Fig. 2a–c). CAMS data dispersion was typically greater than the reference observations (σ > 1.0). Data from the CAMS C4 grid point showed a data dispersion closer to the reference measurements when compared with the C2 grid point. The CAMS C4 grid point was selected for the analysis below. Concerning the MERRA-2 dataset, grid point M3 showed the best combination of relatively high correlation coefficients and standard deviation ratio close to 1.0 for O3 (Fig. 2a). The PM MERRA-2 dataset was uncorrelated with the ground-based observations, with r < 0.2 in all of the grid points (Fig. 2c, d). The reasons behind the poor representation of PM in MERRA-2 is discussed in Section 3.3. It is noteworthy that the selected grid points (C4 and M3) are typically downwind of the MASP considering the prevailing eastern winds in the region (Silva et al. 2017).
3.2. Overall performance of the reanalysis products at the monthly temporal scale
After selecting grid points C4 and M3 as the most suitable to represent the reference observations, statistical metrics were calculated to characterise the overall performance of the reanalysis products for each air pollutant at the monthly temporal scale. CAMS reanalysis data showed significant correlations with the reference observations, with r values in the range 0.74–0.89 (Table 3). MERRA-2 data showed a significant correlation with observations only for O3.
O3 | NO2 | PM2.5 | PM10 | ||||||
---|---|---|---|---|---|---|---|---|---|
CAMS | MERRA-2 | CAMS | MERRA-2 | CAMS | MERRA-2 | CAMS | MERRA-2 | ||
RMB (%) | 136 | 171 | −45 | – | 50 | −20 | 16 | −11 | |
MAE (μg m−3) | 52.2 | 65.8 | 14.0 | – | 7.5 | 3.7 | 4.8 | 7.4 | |
r | 0.76* | 0.43* | 0.74* | – | 0.85* | 0.21 | 0.89* | −0.28 | |
Slope | 1.23 | 0.62 | 0.58 | – | 1.35 | – | 1.15 | – | |
Intercept (μg m−3) | 43.3 | 80.3 | −0.92 | – | 2.25 | – | 0.29 | – |
Asterisks indicate significant correlation coefficients (P < 0.05).
O3 concentrations were overestimated by CAMS and MERRA-2 reanalysis, with RMB values above 100% and MAE values in the range 52–66 µg m−3 (Table 3). Despite the overestimation, O3 data from CAMS was linearly correlated with observations, reaching an r of 0.76 (Fig. 3a). By contrast, O3 data from MERRA-2 showed inconsistent behaviour along the annual cycle, showing a weaker linear correlation with reference observations, especially in the first semester, which corresponds to the austral summer and autumn periods (Fig. 3b). CAMS overestimation of O3 concentrations is consistent with the underestimation of NO2 concentrations (Table 3), given that O3 photochemical production is limited by hydrocarbons in the MASP (Alvim et al. 2017). Under this production regime, a decrease in NO2 concentrations typically results in increased O3 concentrations. NO2 concentrations were not available in the MERRA-2 dataset, hindering the interpretation of the O3 product.
Comparison between reanalysis and observational data for O3 (a, b), and NO2 (c) concentrations, considering the CAMS and MERRA-2 datasets. The red dashed line represents the 1:1 ratio.

PM2.5 and PM10 concentrations estimated by CAMS were strongly correlated with observations, with r values above 0.85 (Fig. 4 and Table 3). The relative mean bias was 50% for PM2.5 and 16% for PM10, indicating that the CAMS product is suitable to represent PM concentrations in the MASP at the monthly scale, after the correction of biases. However, MERRA-2 estimates for PM were not correlated with the reference observations, failing to reproduce the main features of PM seasonal variability in the MASP.
Comparison between reanalysis and observational data for PM2.5 (a, b), and PM10 (c, d) concentrations, considering the CAMS and MERRA-2 datasets. The red dashed line represents the 1:1 ratio.

On a daily scale, there is a greater dispersion in the comparison between reanalysis and observations, possibly associated with flaws in the models’ representation of atmospheric mesoscale and air pollution removal processes. Section S3 in the Supplementary material depicts an example of a daily concentration time series with contrasting conditions of atmospheric dispersion, showing that CAMS was able to reproduce the main features of PM concentration variability whereas MERRA-2 showed weak variability, disconnected from the ground-based observations.
3.3. Performance of the reanalysis products in reproducing air quality seasonal cycles
To assess the performance of the reanalysis products concerning air quality seasonal variations, mean annual cycles were calculated for the study period (2015–19), depicted in Fig. 5. The annual cycles of NO2 and PM in the MASP are mostly driven by climate conditions, because air pollutant emissions, dominated by vehicular emissions, are rather constant throughout the year (Companhia Ambiental do Estado de São Paulo 2024). The winter (June–August) in this subtropical metropolis is mild, cold and dry, hindering atmospheric dispersion and favouring the accumulation of primary air pollutants (Carvalho et al. 2015; Oliveira et al. 2022). Frequent rain showers in the summer and unfavourable atmospheric dispersion conditions in the winter result in higher concentrations of NO2, PM2.5 and PM10 during the winter in the MASP, as depicted in Fig. 5b–d. Secondary air pollutants like O3 follow a different pattern, with higher concentrations in the austral spring and summer (Fig. 5a), when the increase in solar radiation input and reduced cloud cover favour the photochemical production of secondary pollutants in the MASP (Schuch et al. 2019).
Mean annual cycles of air pollutant concentrations from in situ observations, MERRA-2 and CAMS reanalysis products, considering the period 2015–19: O3 (a), NO2 (b), PM2.5 (c), and PM10 (d). Shadows represent standard deviations.

Concerning O3, both reanalysis products, CAMS and MERRA-2, approximately reproduced the observed seasonality (Fig. 5a). Both reanalysis products estimated an O3 concentration decrease from the austral spring to summer, which was not observed in the ground-based measurements. Also, both reanalysis products overestimated the in situ O3 concentrations in all seasons by as much as 50 µg m−3, with RMB > 100%.
The CAMS reanalysis was also able to reproduce the seasonal variability of NO2, which, like other primary pollutants, shows higher concentrations during the MASP austral winter (Carvalho et al. 2015) (Fig. 5b). CAMS consistently underestimated NO2 concentrations along the year, with RMB = −45%. According to the product’s documentation, CAMS NO2 concentrations are significantly affected by the prescribed anthropogenic emissions. It is possible that NO2 emissions in the MASP are underestimated in CAMS’ emission inventory. Moreover, the overestimation of O3 concentrations by CAMS may partially explain the underestimation of NO2 concentrations under an O3 photochemical production regime limited by hydrocarbons (Alvim et al. 2017).
Concerning PM concentrations, CAMS reanalysis correctly reproduced the increased concentrations in the austral winter (Fig. 5c, d), which was also reflected by r values above 0.85 (Table 3). Atmospheric dispersion is hindered during the mild cold and dry winter period in the MASP, favouring the accumulation of PM (Oliveira et al. 2022). CAMS overestimated PM concentrations, especially PM2.5. However, MERRA-2 did not reproduce the austral winter increase in PM concentrations. Rather, MERRA-2 PM showed an increase in September, when the MASP is typically influenced by the long-range and regional transport of biomass burning aerosols above the planetary boundary layer (Yamasoe et al. 2017). Occasionally, the biomass-burning smoke trapped in the residual layer can affect PM2.5 near-surface concentrations in the MASP (de Arruda Moreira et al. 2021; Souto-Oliveira et al. 2023; Vieira et al. 2023), which could partially explain the secondary peak observed in September for PM concentrations (Fig. 5c, d). Supplementary Fig. S2 shows that exceptionally high PM concentrations were observed at ground level in 2017, which may have contributed to this secondary peak in the mean annual cycle. Despite the contribution of the transport of biomass burning emissions, the main peak in the ground-based PM concentrations occurs in July, demonstrating that the unfavourable meteorological conditions in the winter are the main driver behind the PM seasonal variability in the MASP, which was not captured by the MERRA-2 product.
The seasonal pattern for each PM component was assessed to further investigate the reasons behind the poor performance of MERRA-2 for PM. OC and SS components dominated the MERRA-2 PM2.5 concentrations (Fig. 6b), with concentration peaks in September and November respectively (Fig. 6a). SS aerosols dominated the MERRA-2 PM10 concentrations, representing 58% of the mass (Fig. 6d), and also showing a peak in November (Fig. 6c). Only the MERRA-2 BC showed the expected seasonality for PM in the MASP, with higher concentrations in the austral winter. CAMS reanalysis, however, estimated a dominance of OM, accounting for more than 80% of PM2.5 and PM10 concentrations (Fig. 7). Inorganics were the second most important PM component according to CAMS (~8%), followed by SS and BC. CAMS’ OM, BC and sulfate components peaked in July, consistent with the expected seasonality of PM concentrations in the MASP.
Mean annual cycle of MERRA-2 near-surface PM2.5 (a), and PM10 (c) components as mixing ratios (kg kg−1). The relative contribution of each aerosol component is also shown for PM2.5 (b), and PM10 (d). MERRA-2 accounted for the following aerosol components: dust and sea salt (SS) in different size ranges, organic (OC) and black carbon (BC) in different size ranges and sulfate (SO4).

Mean annual cycle of CAMS near-surface PM2.5 (a), and PM10 (c) components as mixing ratios (kg kg−1). The relative contribution of each aerosol component is also shown for PM2.5 (b), and PM10 (d). CAMS accounted for the following aerosol components: dust and sea salt (SS) in different size ranges, organic matter (OM), black carbon (BC), sulfate (SU), ammonium (AM) and nitrate (NI).

3.4. Temporal variability of concentrations at the monthly scale
Fig. 8 shows the monthly time series of air pollutant concentrations estimated by the CAMS and MERRA-2 reanalysis and the reference observations in the MASP. There were no significant concentration trends during the study period (2015–19), although some interannual variability is present. For example, the relatively low O3 concentrations observed in the austral spring of 2018 (September–November) were correctly reproduced by the CAMS reanalysis product (Fig. 8a and Supplementary Fig S3). The MERRA-2 data showed a rather constant O3 annual cycle. In some years, the MERRA-2 O3 concentration peak was not in phase with the reference observations, such as in the austral spring of 2018.
Monthly time series of concentrations from the reanalysis CAMS and MERRA-2 and from ground-based observations for the following air pollutants: O3 (a), NO2 (b), PM2.5 (c), PM10 (d).

CAMS was able to reproduce the relatively high PM2.5 concentrations in the austral winters of 2017 and 2018 but overestimated PM2.5 in the winter of 2019 (Fig. 8c). Oscillations in the PM2.5 and PM10 concentrations observed in the austral summers of 2016 and 2017 were remarkably well reproduced by the CAMS reanalysis (Fig. 8c, d), consistent with the high correlation coefficients presented in Section 3.2. The CAMS dataset properly represented PM monthly concentration anomalies (refer to Section S2 in the Supplementary material). The correct reproduction of the PM variability indicates that the CAMS reanalysis can represent the variability of aerosol emissions and atmospheric removal processes in the MASP, at least at the monthly temporal scale. As discussed in Section 3.3, the MERRA-2 product failed to reproduce the basic features of PM seasonal variability. Likewise, the MERRA-2 PM time series showed a monthly variability clearly disconnected from the reference observations (Fig. 8c, d, and Supplementary Fig. S3).
4.Discussion and conclusion
Overall, the CAMS reanalysis data are the best option to represent the near-surface concentration of air pollutants in the MASP. At the monthly scale, linear correlations between CAMS and reference ground-based observations were statistically significant for O3, NO2, PM10 and PM2.5. CAMS reanalysis was able to reproduce seasonal cycles and monthly variability of pollutant concentrations (Fig. 5). However, CAMS overestimated O3 concentrations by 136% in the MASP (Table 3), which is unusual compared with other cities worldwide. In mid-latitude regions of the Northern Hemisphere, the RMB values reported in the literature range between −40 and 20% for CAMS O3 concentrations (Wagner et al. 2021; Lacima et al. 2022). CAMS validation studies for O3 in the tropical lower troposphere reported overestimations of up to 30% (Wang et al. 2020) and underestimations of 50% in Colombian cities (Casallas et al. 2024). Part of the overestimation of CAMS’ O3 concentrations in the MASP is likely due to the underestimation of NO2, considering an O3 production regime limited by hydrocarbons (Alvim et al. 2017). Another hypothesis is a possible inaccurate representation of the MASP’s predominant volatile organic compounds (VOCs) in the CAMS emission inventories and photochemical model. MASP has a unique vehicular emission profile associated with the extensive use of ethanol, resulting in distinct VOC amounts, speciation and O3 yields (Brito et al. 2015).
CAMS biases were generally smaller for PM, with 50 and 16% overestimations for PM2.5 and PM10 respectively (Table 3). CAMS reproduced features of the monthly variability of PM concentrations in the MASP remarkably well (Fig. 8), resulting in Pearson correlation coefficients above 0.85. Although the CAMS data have shown significant biases in the MASP, they can be easily adjusted using the linear regression equations obtained in this work from the parameters in Table 3. Other statistical downscaling techniques, like quantile mapping, could also be applied to adjust the CAMS biases in the MASP (e.g. Casanueva et al. 2020).
The concentrations estimated by the MERRA-2 reanalysis were not correlated with the reference ground-based observations in the MASP (Table 3). The typical decrease in O3 concentrations along the first semester (from the austral summer to autumn) was not correctly reproduced by the MERRA-2 reanalysis (Fig. 5). The MERRA-2 dataset showed weaker monthly variability for O3 concentrations compared with the reference observations. In some years, MERRA’s O3 concentration peak was not in phase with the reference observations, such as in the austral spring of 2018 (Fig. 8). Wargan et al. (2017) validated MERRA-2 against ozone sonde data, showing satisfactory performance in capturing O3 spatial and temporal variability in the stratosphere and upper troposphere. However, a bias was identified, with MERRA-2 consistently underestimating O3 concentrations in the upper troposphere compared with ozone sondes, showing standard deviations of up to 24.5% in this region, despite high correlations above 0.80. Lacima et al. (2022) validated MERRA-2 against in situ data over Europe and showed a systematic overestimation in O3, of ~ 34%. These findings highlight the need for further investigation into the reasons behind the incorrect representation of near-surface O3 concentrations by MERRA-2 in the MASP.
PM estimates from MERRA-2 did not reproduce the main features of their seasonal variability in the MASP, which typically shows higher PM concentrations in the austral winter. Instead, MERRA-2 PM showed an increase in September, when the MASP is typically influenced by the long-range and regional transport of biomass burning aerosols from the Amazon Basin and central regions of South America toward the south of the continent (Yamasoe et al. 2017; Souto-Oliveira et al. 2023). This transport typically occurs above the boundary layer, with occasional impacts on near-surface air quality conditions in the MASP (de Arruda Moreira et al. 2021; Souto-Oliveira et al. 2023; Vieira et al. 2023). The transport of biomass-burning smoke strongly affects aerosol measurements in the atmospheric column, such as AOD remote sensing observations, which constrains PM reanalysis retrievals (Randles et al. 2017; Garrigues et al. 2022). In the reanalysis products, apportionment of the different aerosol components is determined by the convolution between the aerosol model estimates and assumptions, and the AOD data assimilated from satellite observations. Both reanalysis products, CAMS and MERRA-2, reproduced the seasonal pattern of AOD based on AERONET observations in the MASP, with higher values in September (Supplementary Fig. S7). This is an independent validation because AERONET data are not assimilated in the reanalysis products during the study period. According to the MERRA-2 documentation, it is possible to have a good agreement in AOD while having poor agreement in the aerosol components (Randles et al. 2017). If the aerosol model overestimates the contribution of a particular aerosol component, a positive AOD increment during the data assimilation process will magnify this bias. As such, it is possible that AOD observations excessively influence the PM MERRA-2 data, leading to a strong positive bias in PM estimated concentrations in September (Fig. 5c, d).
To further investigate the reasons behind the poor performance of MERRA-2 for PM, the seasonal variability of PM constituents retrieved from the reanalysis products was assessed (Fig. 6 and 7). MERRA-2 PM2.5 was dominated by SS (35%), OC (25%) and sulfate (23%), with concentration peaks in September and November. MERRA-2 PM10 was dominated by SS aerosols (58%), also showing a peak in November. Previous observational studies in the MASP showed that organics represent 55% of the PM2.5 mass, followed by sulfate (15%) and equivalent BC (14%) (Santos et al. 2021). Concerning PM10, previous observations in the MASP have shown a dominance of soil dust particles (20–50%), whereas SS particles represented less than 5% of the PM10 mass (Castanho and Artaxo 2001; de Almeida Albuquerque et al. 2012). The comparison with previous observations in the MASP suggests that SS concentrations are overestimated in the MERRA-2 reanalysis, contributing to a distortion in the PM2.5 and PM10 seasonality. At the same time, OC concentrations may be underestimated in MERRA-2. The predicted OC peak in September, instead of the expected peak in July, suggests that MERRA-2 is giving too much importance to the transport of biomass-burning smoke, whereas the local OC emission sources could be underestimated. Buchard et al. (2017) reported similar biases for MERRA-2 PM in the north-west and north-east USA, with an overestimation of dust and SS and an underestimation of OC emissions sources in suburban areas during the winter.
Although SS dominated PM10 composition in MERRA-2 (58%), in CAMS, it represented less than 5%, consistent with ground-based observations (Castanho and Artaxo 2001; de Almeida Albuquerque et al. 2012). CAMS estimates showed similar apportionment of aerosol components in PM2.5 and PM10, with an overwhelming dominance of organic matter (>80%), peaking in the austral winter (Fig. 7). Even though the PM seasonality is correctly represented in CAMS, comparing the relative importance of aerosol components in CAMS with ground-based observations, the contribution of dust in PM10 and the contribution of BC in PM2.5 may be underestimated, whereas OM could be overestimated. Nevertheless, the resulting PM seasonal variability was correctly reproduced by CAMS, indicating that the model was able to address the relative contributions of local emissions and regional transport to the MASP area.
Concerning the implications of the findings, we conclude that the CAMS reanalysis was adequate to represent near-surface air quality conditions in the vicinity of the MASP and at the monthly scale, although bias corrections are needed. This validation analysis should be extended to other South American cities to characterise the strengths and limitations of air composition reanalysis products in different regions. Air quality reanalysis data can be especially useful in regions where ground-based monitoring networks are limited or absent, such as in South America.
Data availability
The data that support this study are available at the Laboratory of Atmospheric Physics FTP server at http://ftp.lfa.if.usp.br/ftp/public/LFA_Processed_Data/articles_database/.
Declaration of funding
This work was supported by the Sao Paulo State Research Foundation (Fundação de Amparo à Pesquisa do Estado de São Paulo, FAPESP) (grant numbers 2022/16238-5 to Marina S. Paiva, and 2016/18438-0 to Luciana V. Rizzo and Marco A. Franco) and by the Brazilian National Council for Scientific and Technological Development (Conselho Nacional de Desenvolvimento Científico e Tecnológico, CNPq) (grant numbers 405179/2021-9 and 304819/2022-0 to Luciana V. Rizzo).
Acknowledgements
The authors thank the Sao Paulo State Environmental Agency, Companhia Ambiental do Estado de São Paulo (CETESB) for providing the ground-based air quality data.
References
Albuquerque de Almeida V, Marton E, Nunes AMB (2018) Assessing the ability of three global reanalysis products to reproduce South American monsoon precipitation. Atmósfera 31(1), 1-10.
| Crossref | Google Scholar |
Ali MA, Bilal M, Wang Y, Nichol JE, Mhawish A, Qiu Z, de Leeuw G, Zhang Y, Zhan Y, Liao K, Almazroui M, Dambul R, Shahid S, Islam MN (2022) Accuracy assessment of CAMS and MERRA-2 reanalysis PM2.5 and PM10 concentrations over China. Atmospheric Environment 288, 119297.
| Crossref | Google Scholar |
Alvim DS, Gatti LV, Corrêa SM, Chiquetto JB, de Souza Rossatti C, Pretto A, Santos MH, Yamazaki A, Orlando JP, Santos GM (2017) Main ozone-forming VOCs in the city of Sao Paulo: observations, modelling and impacts. Air Quality, Atmosphere and Health 10(4), 421-435.
| Crossref | Google Scholar |
Brito J, Wurm F, Yáñez-Serrano AM, de Assunção JV, Godoy JM, Artaxo P (2015) Vehicular emission ratios of VOCs in a megacity impacted by extensive ethanol use: results of ambient measurements in São Paulo, Brazil. Environmental Science & Technology 49, 11381-11387.
| Crossref | Google Scholar | PubMed |
Brito J, Carbone S, A. Monteiro dos Santos D, Dominutti P, de Oliveira Alves N, V. Rizzo L, Artaxo P (2018) Disentangling vehicular emission impact on urban air pollution using ethanol as a tracer. Scientific Reports 8(1), 10679.
| Crossref | Google Scholar | PubMed |
Buchard V, da Silva AM, Randles CA, Colarco P, Ferrare R, Hair J, Hostetler C, Tackett J, Winker D (2016) Evaluation of the surface PM2.5 in Version 1 of the NASA MERRA Aerosol Reanalysis over the United States. Atmospheric Environment 125, 100-111.
| Crossref | Google Scholar |
Buchard V, Randles CA, da Silva AM, Darmenov A, Colarco PR, Govindaraju R, Ferrare R, Hair J, Beyersdorf AJ, Ziemba LD, Yu H (2017) The MERRA-2 aerosol reanalysis, 1980 onward. Part II: evaluation and case studies. Journal of Climate 30(17), 6851-6872.
| Crossref | Google Scholar | PubMed |
Carvalho H (2016) The air we breathe: differentials in global air quality monitoring. The Lancet Respiratory Medicine 4(8), 603-605.
| Crossref | Google Scholar | PubMed |
Carvalho VSB, Freitas ED, Martins LD, Martins JA, Mazzoli CR, de Fátima Andrade M (2015) Air quality status and trends over the Metropolitan Area of São Paulo, Brazil as a result of emission control policies. Environmental Science & Policy 47, 68-79.
| Crossref | Google Scholar |
Casallas A, Castillo-Camacho MP, Guevara-Luna MA, González Y, Sanchez E, Belalcazar LC (2022) Spatio-temporal analysis of PM2.5 and policies in northwestern South America. Science of The Total Environment 852, 158504.
| Crossref | Google Scholar | PubMed |
Casallas A, Cabrera A, Guevara-Luna MA, Tompkins A, González Y, Aranda J, et al. (2024) Air pollution analysis in northwestern South America: a new Lagrangian framework. Science of The Total Environment 906, 167350.
| Crossref | Google Scholar | PubMed |
Casanueva A, Herrera S, Iturbide M, Lange S, Jury M, Dosio A, Maraun D, Gutiérrez JM (2020) Testing bias adjustment methods for regional climate change applications under observational uncertainty and resolution mismatch. Atmospheric Science Letters 21(7), e978.
| Crossref | Google Scholar |
Castanho ADA, Artaxo P (2001) Wintertime and summertime Sao Paulo aerosol source apportionment study. Atmospheric Environment 35, 4889-4902.
| Crossref | Google Scholar |
Castesana P, Diaz Resquin M, Huneeus N, Puliafito E, Darras S, Gómez D, Granier C, Osses Alvarado M, Rojas N, Dawidowski L (2022) PAPILA dataset: a regional emission inventory of reactive gases for South America based on the combination of local and global information. Earth System Science Data 14(1), 271-293.
| Crossref | Google Scholar |
Cohen AJ, Brauer M, Burnett R, Anderson HR, Frostad J, Estep K, Balakrishnan K, Brunekreef B, Dandona L, Dandona R, Feigin V, Freedman G, Hubbell B, Jobling A, Kan H, Knibbs L, Liu Y, Martin R, Morawska L, Pope CA, Shin H, Straif K, Shaddick G, Thomas M, van Dingenen R, van Donkelaar A, Vos T, Murray CJL, Forouzanfar MH (2017) Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: an analysis of data from the Global Burden of Diseases Study 2015. The Lancet 389(10082), 1907-1918.
| Crossref | Google Scholar | PubMed |
Companhia Ambiental do Estado de São Paulo (2016) Classificação expedita da representatividade espacial das estações de monitoramento da qualidade do ar da CETESB no Estado de São Paulo. (CETESB) Available at https://cetesb.sp.gov.br/ar/wp-content/uploads/sites/28/2013/12/Relat%C3%B3rio-Classifica%C3%A7%C3%A3o_Terceira-Etapa.pdf [In Portuguese]
Companhia Ambiental do Estado de São Paulo (2024) Qualidade do ar no estado de São Paulo 2023. (CETESB) Available at https://cetesb.sp.gov.br/ar/wp-content/uploads/sites/28/2024/08/Relatorio-de-Qualidade-do-Ar-no-Estado-de-Sao-Paulo-2023.pdf [In Portuguese]
Damascena AS, Yamasoe MA, Martins VS, Rosas J, Benavente NR, Sánchez MP, Tanaka NI, Saldiva PHN (2021) Exploring the relationship between high-resolution aerosol optical depth values and ground-level particulate matter concentrations in the Metropolitan Area of São Paulo. Atmospheric Environment 244(May 2020), 117949.
| Crossref | Google Scholar |
de Almeida Albuquerque TT, de Fátima Andrade M, Ynoue RY (2012) Characterization of atmospheric aerosols in the city of São Paulo, Brazil: comparisons between polluted and unpolluted periods. Environmental Monitoring and Assessment 184(2), 969-84.
| Crossref | Google Scholar | PubMed |
de Arruda Moreira G, da Silva Andrade I, Cacheffo A, da Silva Lopes FJ, Calzavara Yoshida A, Gomes AA, et al. (2021) Influence of a biomass-burning event in PM2.5 concentration and air quality: a case study in the metropolitan area of São Paulo. Sensors 21(2), 425.
| Crossref | Google Scholar | PubMed |
de Fatima Andrade M, Kumar P, de Freitas ED, Ynoue RY, Martins J, Martins LD, Nogueira T, Perez-Martinez P, de Miranda RM, Albuquerque T, Gonçalves FLT, Oyama B, Zhang Y (2017) Air quality in the megacity of São Paulo: evolution over the last 30 years and future perspectives. Atmospheric Environment 159, 66-82.
| Crossref | Google Scholar |
Dufek AS, Ambrizzi T, Da Rocha RP (2008) Are reanalysis data useful for calculating climate indices over South America? Annals of the New York Academy of Sciences 1146(1), 87-104.
| Crossref | Google Scholar | PubMed |
European Centre for Medium-Range Weather Forecasts (2024) ‘IFS Documentation CY49R1 – Part VIII: Atmospheric Composition.’ (ECMWF: Reading, UK) 10.21957/d13af18259
Garrigues S, Remy S, Chimot J, Ades M, Inness A, Flemming J, Kipling Z, Laszlo I, Benedetti A, Ribas R, Jafariserajehlou S, Fougnie B, Kondragunta S, Engelen R, Peuch V-H, Parrington M, Bousserez N, Vazquez Navarro M, Agusti-Panareda A (2022) Monitoring multiple satellite aerosol optical depth (AOD) products within the Copernicus Atmosphere Monitoring Service (CAMS) data assimilation system. Atmospheric Chemistry and Physics 22(22), 14657-14692.
| Crossref | Google Scholar |
Gelaro R, McCarty W, Suárez MJ, Todling R, Molod A, Takacs L, Randles CA, Darmenov A, Bosilovich MG, Reichle R, Wargan K, Coy L, Cullather R, Draper C, Akella S, Buchard V, Conaty A, da Silva AM, Gu W, Kim GK, Koster R, Lucchesi R, Merkova D, Nielsen JE, Partyka G, Pawson S, Putman W, Rienecker M, Schubert SD, Sienkiewicz M, Zhao B (2017) The Modern-Era Retrospective Analysis for research and applications, version 2 (MERRA-2). Journal of Climate 30(14), 5419-5454.
| Crossref | Google Scholar | PubMed |
Global Modeling and Assimilation Office (2015a) MERRA-2 inst3_3d_chm_Nv: 3d, 3-hourly, instantaneous, model-level, assimilation, carbon monoxide and ozone mixing ratio V5.12.4. (Goddard Earth Sciences Data and Information Services Center: Greenbelt, MD, USA) 10.5067/HO9OVZWF3KW2
Global Modeling and Assimilation Office (2015b) MERRA-2 tavg1_2d_aer_Nx: 2d,1-hourly, time-averaged, single-level, assimilation, aerosol diagnostics V5.12.4. (Goddard Earth Sciences Data and Information Services Center: Greenbelt, MD, USA) 10.5067/KLICLTZ8EM9D
Goldberg DL, Anenberg SC, Kerr GH, Mohegh A, Lu Z, Streets DG (2021) TROPOMI NO2 in the United States: a detailed look at the annual averages, weekly cycles, effects of temperature, and correlation with surface NO2 concentrations. Earth’s Future 9(4), e2020EF001665.
| Crossref | Google Scholar | PubMed |
Granier C, Bessagnet B, Bond T, D’Angiola A, Denier van der Gon H, Frost GJ, Heil A, Kaiser JW, Kinne S, Klimont Z, Kloster S, Lamarque J-F, Liousse C, Masui T, Meleux F, Mieville A, Ohara T, Raut J-C, Riahi K, Schultz MG, Smith SJ, Thompson A, van Aardenne J, van der Werf GR, van Vuuren DP (2011) Evolution of anthropogenic and biomass burning emissions of air pollutants at global and regional scales during the 1980–2010 period. Climatic Change 109(1–2), 163-190.
| Crossref | Google Scholar |
Inness A, Ades M, Agustí-Panareda A, Barré J, Benedictow A, Blechschmidt A-M, Dominguez JJ, Engelen R, Eskes H, Flemming J, Huijnen V, Jones L, Kipling Z, Massart S, Parrington M, Peuch V-H, Razinger M, Remy S, Schulz M, Suttie M (2019) The CAMS reanalysis of atmospheric composition. Atmospheric Chemistry and Physics 19(6), 3515-3556.
| Crossref | Google Scholar |
Instituto Brasileiro de Geografia e Estatística (2023) Cidades e Estados do Brasil: Brasil / São Paulo / São Paulo. (IBGE) Available at https://cidades.ibge.gov.br/brasil/sp/sao-paulo/panorama [In Portuguese, verified 13 May 2025]
Jin C, Wang Y, Li T, Yuan Q (2022) Global validation and hybrid calibration of CAMS and MERRA-2 PM2.5 reanalysis products based on OpenAQ platform. Atmospheric Environment 274, 118972.
| Crossref | Google Scholar |
Júnior ALP, Curado LFA, Palácios RdS, dos Santos LOF, Querino CAS, Querino JKAdS, et al. (2025) Evaluation of Aerosol Optical Depth (AOD) estimated by Copernicus Atmosphere Monitoring Service (CAMS) in Brazil. Theoretical and Applied Climatology 156(2), 116.
| Crossref | Google Scholar |
Lacima A, Petetin H, Soret A, Bowdalo D, Jorba O, Chen Z, et al. (2022) Long-term evaluation of surface air pollution in CAMSRA and MERRA-2 global reanalysis over Europe (2003–2020). Geoscientific Model Development Discussions 16, 2689-2718.
| Crossref | Google Scholar |
Lahoz WA, Schneider P (2014) Data assimilation: making sense of Earth Observation. Frontiers in Environmental Science 2, 16.
| Crossref | Google Scholar |
Li J, Wang T, Li C, Yan H, Alam K, Cui Y, et al. (2024) Can the aerosol pollution extreme events be revealed by global reanalysis products? Science of The Total Environment 923, 171424.
| Crossref | Google Scholar | PubMed |
Molina LT (2021) Introductory lecture: air quality in megacities. Faraday Discussions 226, 9-52.
| Crossref | Google Scholar |
Oliveira MCQD, Drumond A, Rizzo LV (2022) Air pollution persistent exceedance events in the Brazilian metropolis of Sao Paulo and associated surface weather patterns. International Journal of Environmental Science and Technology 19(10), 9495-9506.
| Crossref | Google Scholar |
Paton-Walsh C, Emmerson KM, Garland RM, Keywood M, Hoelzemann JJ, Huneeus N, Buchholz RR, Humphries RS, Altieri K, Schmale J, Wilson SR, Labuschagne C, Kalisa E, Fisher JA, Deutscher NM, van Zyl PG, Beukes JP, Joubert W, Martin L, Mkololo T, Barbosa C, de Fatima Andrade M, Schofield R, Mallet MD, Harvey MJ, Formenti P, Piketh SJ, Olivares G (2022) ‘Key challenges for tropospheric chemistry in the Southern Hemisphere, Elementa.’ (University of California Press) 10.1525/elementa.2021.00050
Randles CA, da Silva AM, Buchard V, Colarco PR, Darmenov A, Govindaraju R, Smirnov A, Holben B, Ferrare R, Hair J, Shinozuka Y, Flynn CJ (2017) The MERRA-2 Aerosol Reanalysis, 1980 onward. Part I: system description and data assimilation evaluation. Journal of Climate 30(17), 6823-6850.
| Crossref | Google Scholar | PubMed |
Ranjan AK, Patra AK, Gorai AK (2021) A review on estimation of particulate matter from satellite-based aerosol optical depth: data, methods, and challenges. Asia-Pacific Journal of Atmospheric Sciences 57(3), 679-699.
| Crossref | Google Scholar |
Salvo A, Geiger FM (2014) Reduction in local ozone levels in urban São Paulo due to a shift from ethanol to gasoline use. Nature Geoscience 7, 450-458.
| Crossref | Google Scholar |
Santos DM, Rizzo LV, Carbone S, Schlag P, Artaxo P (2021) Physical and chemical properties of urban aerosols in São Paulo, Brazil: Links between composition and size distribution of submicron particles. Atmospheric Chemistry and Physics 21, 8761-8773.
| Crossref | Google Scholar |
Schuch D, de Freitas ED, Espinosa SI, Martins LD, Carvalho VSB, Ramin BF, Silva JS, Martins JA, de Fatima Andrade M (2019) A two decades study on ozone variability and trend over the main urban areas of the São Paulo state, Brazil. Environmental Science and Pollution Research 26(31), 31699-31716.
| Crossref | Google Scholar |
Silva FB, Longo KM, de Andrade FM (2017) Spatial and temporal variability patterns of the urban heat island in São Paulo. Environments 4(2), 27.
| Crossref | Google Scholar |
Souto-Oliveira CE, Marques MT, Nogueira T, Lopes FJ, Medeiros JA, Medeiros IM, et al. (2023) Impact of extreme wildfires from the Brazilian Forests and sugarcane burning on the air quality of the biggest megacity in South America. Science of The Total Environment 888, 163439.
| Crossref | Google Scholar | PubMed |
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. Journal of Geophysical Research: Atmospheres 106(D7), 7183-7192.
| Crossref | Google Scholar |
Vara-Vela A, Andrade MF, Kumar P, Ynoue RY, Muñoz AG (2016) Impact of vehicular emissions on the formation of fine particles in the Sao Paulo Metropolitan Area: a numerical study with the WRF-Chem model. Atmospheric Chemistry and Physics 16, 777-797.
| Crossref | Google Scholar |
Vieira EVR, do Rosario NE, Yamasoe MA, Morais FG, Martinez PJP, Landulfo E, Maura de Miranda R (2023) Chemical characterization and optical properties of the aerosol in São Paulo, Brazil. Atmosphere 14(9), 1460.
| Crossref | Google Scholar |
Vormittag EdMPAdA, Cirqueira SSR, Wicher Neto H, Saldiva PHN (2021) Análise do monitoramento da qualidade do ar no Brasil. Estudos Avançados 35(102), 7-30.
| Crossref | Google Scholar |
Wagner A, Bennouna Y, Blechschmidt A-M, Brasseur G, Chabrillat S, Christophe Y, Errera Q, Eskes H, Flemming J, Hansen KM, Inness A, Kapsomenakis J, Langerock B, Richter A, Sudarchikova N, Thouret V, Zerefos C (2021) Comprehensive evaluation of the Copernicus Atmosphere Monitoring Service (CAMS) reanalysis against independent observations. Elementa: Science of the Anthropocene 9(1), 00171.
| Crossref | Google Scholar |
Wang Y, Ma YF, Eskes H, Inness A, Flemming J, Brasseur GP (2020) Evaluation of the CAMS global atmospheric trace gas reanalysis 2003–2016 using aircraft campaign observations. Atmospheric Chemistry and Physics 20(7), 4493-4521.
| Crossref | Google Scholar |
Wargan K, Labow G, Frith S, Pawson S, Livesey N, Partyka G (2017) Evaluation of the ozone fields in NASA’s MERRA-2 reanalysis. Journal of Climate 30(8), 2961-2988.
| Crossref | Google Scholar | PubMed |
Yamasoe MA, do Rosário NME, Barros KM (2017) Downward solar global irradiance at the surface in São Paulo city – the climatological effects of aerosol and clouds. Journal of Geophysical Research 122(1), 391-404.
| Crossref | Google Scholar |