Abstract
Ethiopia is highly susceptible to the effects of climate change and variability. This study evaluated the performances of 37 CMIP6 models against a gridded rainfall product of Ethiopia known as Enhancing National Climate Services (ENACTS) in simulating the observed rainfall from 1981 to 2014. Taylor Skill Score was used for ranking the performance of individual models for mean monthly, June–September, and February–May seasonal rainfall. Comprehensive rating metrics (RM) were used to derive the overall ranks of the models. Results show that the performances of the models were not consistent in reproducing rainfall distributions at different statistical metrics and timeframes. More than 20 models simulated the largest dry bias on high topographic and rainfall-receiving areas of the country during the June–September season. The RM-based overall ranks of CMIP6 models showed that GFDL-CM4 is the best-performing model followed by GFDL-ESM4, NorESM2-MM, and CESM2 in simulating rainfall over Ethiopia. The ensemble of these four Global Climate Models showed the best performance in representing the spatiotemporal patterns of the observed rainfall relative to the ensembles of all models. Generally, this study highlighted the existence of dry bias in climate model projections for Ethiopia, which requires bias adjustment of the models, for impact assessment.
HIGHLIGHTS
Dry rainfall bias is common in most CMIP6 models over Ethiopia in both rainy seasons.
The four top-ranked CMIP6 models for Ethiopia are GFDL-CM4, GFDL-ESM4, NorESM2-MM, and CESM2.
Most CMIP6 models better captured the short seasonal rainfall cycle (FMAM) relative to the long seasonal rainfall cycle (JJAS).
Weighted average method is a good approach in generating ensembles.
INTRODUCTION
Climate variability and change have a substantial impact on the livelihood and economy of people in the Horn of Africa including Ethiopia (Reinman 2012). To address the impact of climate change, the Ethiopian Government implements a National Adaptation Plan (NAP) by developing a Climate Resilient Green Economy (CRGE) strategy as the country's development policy framework. The country aims to pursue further integration of climate change adaptation proactively and iteratively in development policies and strategies (FDRE 2019). It is, therefore, important to have a reliable future climate projection for the country to build a successful NAP, devise efficient CRGE strategies, and prevent setbacks associated with climate change.
The coupled model intercomparison project phase 6 (CMIP6) models show improvements in the spatial resolution and the representation of physical processes relative to the previous CMIP5 models (Eyring et al. 2016). Various studies therefore utilized CMIP6 models to study climate characteristics of observed datasets over different parts of the globe (Zhu & Yang 2020; Ayugi et al. 2021; Babaousmail et al. 2021; Ngoma et al. 2021; Faye & Akinsanola 2022; Makula & Zhou 2022). The above-mentioned authors reported that CMIP6 had both better and poorer performance than earlier versions of CMIP5 in simulating climate characteristics in different regions. Moreover, most of those studies indicate that the performance of models in reproducing the observed climate can vary over regions and across CMIP6 models.
Few studies have evaluated the CMIP5 models in Ethiopia (Jury 2015; Li et al. 2016; Dyer et al. 2020) and they indicated that CMIP5 models are still prone to substantial biases in simulating seasonal and annual rainfall over the country. Few studies have evaluated and utilized CMIP6 models in Ethiopia (Alaminie et al. 2021; Dyer et al. 2022; Fetene et al. 2022). Despite the high importance of assessing the performances of different CMIP6 models in replicating the mean monthly and seasonal rainfall both spatially and temporally, there is no national study that investigates the performance of different CMIP6 models relevant to impact studies of different sectors. This study aimed to fill that knowledge gap using the most reliable gridded dataset of Enhancing National Climate Services (ENACTS) produced specifically for Ethiopia to evaluate the CMIP6 models.
This study evaluates the capability of CMIP6 models in their representation of the spatiotemporal rainfall distribution before employing them in future climate projections for building adaptation capacity and climate resilient strategies at the national scale. It also selects the optimum number of models for generating an ensemble of CMIP6 which can be used for future climate projection impact studies over Ethiopia Spatial and temporal evaluation of CMIP6 rainfall products were conducted for mean monthly, long rainy season during June–September (JJAS), and short rainy season during February–May (FMAM). The two seasons account for about 90% of the total rainfall over the country (Philip et al. 2018).
STUDY AREA, DATASETS, AND METHODS
Study area description
Datasets
ENACTS and CHIRPS
Rainfall measurements from ground-based rain gauge observations in Ethiopia cannot provide adequate and timely information due to insufficient spatial coverage, poor representatives, and many missing records (Dinku et al. 2014). Thus, ENACTS has been developed by a three-track approach that involved the National Meteorological and Hydrological Services (NMHS), the International Research Institute for Climate and Society (IRI), and the University of Reading in order to produce high resolution gridded climate data time series (Dinku et al. 2014).
ENACTS climate dataset derives from two main kinds of data. The first dataset is station rainfall data from the national weather stations managed by NMHS. The Ethiopian NMHS contributed high-quality-controlled ground-based station data from more than 600 rain gauge stations. The second dataset is satellite rainfall estimates from the European Organization for the Exploitation of Meteorological Satellites. TAMSAT satellite rainfall data were calibrated over Ethiopia using over 600 high-quality-controlled station data to generate the best possible satellite rainfall estimates. Station rainfall data were used to evaluate and correct the errors in the spatially complete products, which in turn are used to fill spatial and temporal gaps in station observations. The combined station and satellite products were generated at a daily or decadal time scale for all available stations, and the final products are datasets with over 40 years of rainfall time series produced at nominal 4-km grids across the country. Most global satellite products use 17 stations through Global Telecommunications System (GTS), stations while the ENACTS data incorporate over 600 stations. The result is a distinctive, high-quality, and more reliable dataset than any other long-term, satellite-based time series dataset (Dinku et al. 2014). ENACTS products are a ‘gold standard’ binding of the best local datasets and global data and can be produced sustainably so that new data can be regularly added to the historical archive (Dinku et al. 2018).
ENACTS has a spatial resolution of 0.0375° × 0.0375° from 1981 to 2020 across Ethiopia. Hence, in this study, ENACTS gridded dataset was used as an observation reference in order to evaluate the performance of CMIP6 models in simulating rainfall characteristics for the period 1981–2014 over Ethiopia.
In addition, monthly Climate Hazards Group InfraRed Precipitation with Station (CHIRPS) data for the period 1981–2014 were used as an additional observed dataset to evaluate the climate models. CHIRPS v2.0 is a quasi-global (50°S–50°N) gridded product and has a spatial resolution of 0.05° × 0.05° at daily, pentad, decadal, and monthly temporal resolution (Funk et al. 2015). CHIRPS v2.0 is available online at http://chg.geog.ucsb.edu/data/chirps. A detailed description of CHIRPS satellite rainfall products is available in Funk et al. (2015).
CMIP6 model data
This study used the monthly precipitation data of historical simulation of 37 CMIP6 model outputs from 1981 to 2014 for the same period as observed reference gridded datasets. The rainfall simulated data from 37 CMIP6 models were obtained from Earth System Grid Federation archives (https://esgf-node.llnl.gov/ search/cmip6). The details of CMIP6 models regarding the modeling groups, countries, and horizontal resolution are presented in Table 1.
No. . | Model . | Country . | Resolution (longitude × latitude) . |
---|---|---|---|
1 | ACCESS-CM2 | Australia | 1.9 × 1.3 |
2 | ACCESS-ESM1-5 | Australia | 1.9 × 1.3 |
3 | BCC-CSM2-MR | China | 1.13 × 1.13 |
4 | BCC-ESM1 | China | 2.81 × 2.81 |
5 | CAMS-CSM1-0 | China | 1.13 × 1.13 |
6 | CanESM5 | Canada | 2.81 × 2.81 |
7 | CESM2 | USA | 1.25 × 0.94 |
8 | CESM2-WACCM | USA | 1.25 × 0.94 |
9 | CESM2-WACCM-FV2 | USA | 2.5 × 1.9 |
10 | CMCC-CM2-HR4 | Italy | 1.25 × 0.94 |
11 | CMCC-CM2-SR5 | Italy | 1.25 × 0.94 |
12 | CMCC-ESM2 | Italy | 1.3 × 0.9 |
13 | CNRM-CM6- 1 | France | 1.4 × 1.4 |
14 | CNRM-CM6-1-HR | France | 0.5 × 0.5 |
15 | CNRM-ESM2-1 | France | 1.4 × 1.4 |
16 | E3SM-1-0 | USA | 1.0 × 1.0 |
17 | EC-Earth3 | Europe | 0.7 × 0.7 |
18 | EC-Earth3-Veg | Europe | 0.7 × 0.7 |
19 | EC-Earth3-Veg-LR | Europe | 1.1 × 1.1 |
20 | FGOALS-f3-L | China | 1.25 × 1.0 |
21 | FGOALS-g3 | China | 2.0 × 2.5 |
22 | GFDL-CM4 | USA | 2.50 × 2.00 |
23 | GFDL-ESM4 | USA | 1.25 × 1.00 |
24 | HadGEM3-GC31-LL | UK | 1.9 × 1.3 |
25 | IITM-ESM | India | 1.9 × 1.9 |
26 | INM-CM4-8 | Russia | 2.0 × 1.5 |
27 | INM-CM5-0 | Russia | 2.0 × 1.5 |
28 | IPSL-CM6A-LR | France | 2.5 × 1.3 |
29 | MIROC6 | Japan | 1.41 × 1.41 |
30 | MPI-ESM1-2-HR | Germany | 0.94 × 0.94 |
31 | MPI-ESM1-2-LR | Germany | 1.9 × 1.9 |
32 | MRI-ESM2-0 | Japan | 1.13 × 1.13 |
33 | NESM3 | China | 1.9 × 1.9 |
34 | NorESM2-MM | Norway | 1.25 × 0.94 |
35 | SAM0-UNICON | Republic of Korea | 1.25 × 0.94 |
36 | TaiESM1 | Taiwan | 1.25 × 0.94 |
37 | UKESM1-0-LL | UK | 1.88 × 1.25 |
No. . | Model . | Country . | Resolution (longitude × latitude) . |
---|---|---|---|
1 | ACCESS-CM2 | Australia | 1.9 × 1.3 |
2 | ACCESS-ESM1-5 | Australia | 1.9 × 1.3 |
3 | BCC-CSM2-MR | China | 1.13 × 1.13 |
4 | BCC-ESM1 | China | 2.81 × 2.81 |
5 | CAMS-CSM1-0 | China | 1.13 × 1.13 |
6 | CanESM5 | Canada | 2.81 × 2.81 |
7 | CESM2 | USA | 1.25 × 0.94 |
8 | CESM2-WACCM | USA | 1.25 × 0.94 |
9 | CESM2-WACCM-FV2 | USA | 2.5 × 1.9 |
10 | CMCC-CM2-HR4 | Italy | 1.25 × 0.94 |
11 | CMCC-CM2-SR5 | Italy | 1.25 × 0.94 |
12 | CMCC-ESM2 | Italy | 1.3 × 0.9 |
13 | CNRM-CM6- 1 | France | 1.4 × 1.4 |
14 | CNRM-CM6-1-HR | France | 0.5 × 0.5 |
15 | CNRM-ESM2-1 | France | 1.4 × 1.4 |
16 | E3SM-1-0 | USA | 1.0 × 1.0 |
17 | EC-Earth3 | Europe | 0.7 × 0.7 |
18 | EC-Earth3-Veg | Europe | 0.7 × 0.7 |
19 | EC-Earth3-Veg-LR | Europe | 1.1 × 1.1 |
20 | FGOALS-f3-L | China | 1.25 × 1.0 |
21 | FGOALS-g3 | China | 2.0 × 2.5 |
22 | GFDL-CM4 | USA | 2.50 × 2.00 |
23 | GFDL-ESM4 | USA | 1.25 × 1.00 |
24 | HadGEM3-GC31-LL | UK | 1.9 × 1.3 |
25 | IITM-ESM | India | 1.9 × 1.9 |
26 | INM-CM4-8 | Russia | 2.0 × 1.5 |
27 | INM-CM5-0 | Russia | 2.0 × 1.5 |
28 | IPSL-CM6A-LR | France | 2.5 × 1.3 |
29 | MIROC6 | Japan | 1.41 × 1.41 |
30 | MPI-ESM1-2-HR | Germany | 0.94 × 0.94 |
31 | MPI-ESM1-2-LR | Germany | 1.9 × 1.9 |
32 | MRI-ESM2-0 | Japan | 1.13 × 1.13 |
33 | NESM3 | China | 1.9 × 1.9 |
34 | NorESM2-MM | Norway | 1.25 × 0.94 |
35 | SAM0-UNICON | Republic of Korea | 1.25 × 0.94 |
36 | TaiESM1 | Taiwan | 1.25 × 0.94 |
37 | UKESM1-0-LL | UK | 1.88 × 1.25 |
Evaluation methods
Quantitative statistical measures
The models were first evaluated by observing their ability to capture the annual rainfall cycle and mean seasonal climatology (FMAM and JJAS seasons). The abilities of the CMIP6 models to replicate the climate properties of the study area were assessed using spatial comparisons. Since the resolutions are different for most models and observations, all data were re-gridded to a common grid of 1° × 1° resolution using the bilinear interpolation method to confirm uniform resolution (Makula & Zhou 2022). The performance of each model in simulating rainfall was also evaluated using statistical metrics. Statistical metrics used for this study included mean bias (MB), root-mean-squared error (RMSE), and Pearson correlation coefficient (PCC) for the period 1981–2014. These quantitative metrics have been employed in many studies in Africa (Dyer et al. 2020; Akinsanola et al. 2021; Alaminie et al. 2021; Ngoma et al. 2021; Makula & Zhou 2022) and are summarized in Table 2 along with their equations.
Statistics . | Formula . | Range . | Unit . | Best value . |
---|---|---|---|---|
Mean bias | −∞ to ∞ | mm | 0 | |
Root-mean-squared error | 0 to ∞ | mm | 0 | |
Pearson correlation coefficient | −1 to 1 | none | 1 |
Statistics . | Formula . | Range . | Unit . | Best value . |
---|---|---|---|---|
Mean bias | −∞ to ∞ | mm | 0 | |
Root-mean-squared error | 0 to ∞ | mm | 0 | |
Pearson correlation coefficient | −1 to 1 | none | 1 |
where ‘r’ is the rainfall value acquired from ENACTS; the ‘m’ represents the assessed value obtained from the CMIP6 rainfall dataset; and are mean of ENACTS and models, respectively; σr and σm are the standard deviations of ENACTS and models, respectively; and ‘n’ is the number of data pairs.
Ranking of CMIP6
The spatial performance of CMIP6 models has been further assessed based on the Taylor diagrams and Taylor Skill Score (TSS) (Taylor 2001). Taylor diagrams are used to measure the performance of the models by indicating the spatial similarity of each model with respect to the observations in terms of the PCC, the RMSE, and the standard deviation (SD).
The linear trends of the best-performing models were further assessed using the ‘modifiedmk’ R package. Sen's Slope and P-values are determined using the ‘modifiedmk’ R package. Modified Mann–Kendall (m-MK) test is used to test the significance of the trends.
Generating ensembles of CMIP6
Identification and selection of an ensemble of better-performing GCMs are important in the projection of climate in order to reduce the uncertainty associated with future climate projection (Philip et al., 2018). In this study, seven different ensembles were produced based on RM values:
- I.
Ensembles mean of CMIP6 models with RM values ≥ 0.85
- II.
Ensembles mean of CMIP6 models with RM values ≥ 0.75
- III.
Ensembles mean of CMIP6 models with RM values ≥ 0.65
- IV.
Ensembles mean of CMIP6 models with RM values ≥ 0.55
- V.
Ensembles mean of CMIP6 models with RM values ≥ 0.45
- VI.
Ensembles mean of CMIP6 models with RM values ≥ 0.35
- VII.
Ensembles mean of all 37 CMIP6 models
Finally, seven different ensembles were compared and evaluated against ENACTS using the spatial distribution of rainfall, quantitative statistical measures, and TSS in order to select the optimum number of models for generating ensembles. The weighted average (WA) and simple average (SA) methods were employed to generate different ensembles based on RM values using Equations (3) and (4), respectively. The performance of ensembles was compared relative to two methods (WA and SA).
RESULTS
Mean annual and seasonal cycle of precipitation
Out of the 37 CMIP6 models, 13 (35%) captured the general pattern of the long rainy season (JJAS). On the other hand, 18 models (49%) simulated rainfall peaks in September and October rather than in August during the long rainy season. Thirteen models, except for FGOALS-f3-L and MPI-ESM1-2-LR, simulated more rainfall compared to a seasonal peak during the JJAS season. The pattern of the short rainy season (FMAM) was adequately replicated with reasonable skill by all the 37 CMIP6 models except for three models (CAMS-CSM1-0, CNRM-CM6-1, and CNRM-CM6-1-HR). However, seven models reduced the peak for FMAM while 27 models increased the peak. UKESM1-0-LL and MPI-ESM1-2-LR showed better JJAS seasonal rainfall patterns and peaks with a slight difference of +2 and −2.8 mm/month, respectively. IITM-ESM model represented the FMAM peak value accurately with a slight difference of 0.7 mm to observed peak rainfall, followed by NESM3, IPSL-CM6A-LR, and UKESM1-0-LL.
Spatial distribution of seasonal rainfall
The highest amount of FMAM rainfall was received over the Southwest and Southeast highlands of the country, followed by Central Ethiopia, but it decreased towards to North, Northeast, and Southeast parts of the country (Figure 4). CHIRPS showed better performance in capturing the spatial pattern of rainfall during the FMAM season compared to the JJAS season. However, most of the models could not generally capture such geographical distribution of observed rainfall over the country during the FMAM season compared to the JJAS season. Moreover, more than 30 CMIP6 models were incapable of capturing observed rainfall over the Southwest and Southeast highland parts of the country, where the highest FMAM rainfall is observed. Out of 37 models, 24 (65%) underestimated the maximum observed rainfall, with the highest underestimation by ACCESS-CM2 and FGOALS-f3-L models over Southwest and Southeast highland parts of the country. In contrast, six models (ACCESS-ESM1-5, BCC_CSM2-MR, CESM2-WACCM, CESM2, MIROC6, MRI-ESM2-0) displayed an overestimation of the observed rainfall, with the highest overestimation by MIROC6 models over above-mentioned areas during the FMAM season. BCC-ESM1 and NorESM2-MM were more capable of simulating the spatial variations of the observed FMAM rainfall than other models. On the contrary, ACCESS-CM2, ACCESS-ESM1-5, CAMS-CSM1-0, FGOALS-f3-L, NESM3, SAM0-UNICON, and UKESM1-0-LL were unable to accurately represent the spatial distribution of the observed FMAM rainfall, especially in high rainfall-receiving areas.
The models and CHIRPS were evaluated for bias of FMAM rainfall during 1981–2014 (Figure 6). Out of the 37 models, 21 (57%) showed a dry bias of observed rainfall from 80 to 110 mm over most parts of the country, with higher dry bias by 12 models (ACCESS-CM2, ACCESS-ESM1-5, CAMS-CSM1-0, CMCC-CM2-HR4, E3SM-1-0, EC-Earth3, EC-Earth3-Veg, EC-Earth3-Veg-LR, HadGEM3-GC31-LL, MPI-ESM1-2-HR, NESM3, and UKESM1-0-LL) in rainfall-receiving areas of the country (southern parts of the country) during the FMAM season (Figure 6). On the contrary, 10 models (ACCESS-ESM1-5, BCC-CSM2-MR, CESM2-WACCM, CESM2, CMCC-CM2-HR4, CMCC-CMS-SR5, CMCC-ESM2, MIROC6, MRI-ESM2-0, and NorESM2-MM) (27%) revealed a wet bias of seasonal rainfall in the range 50–197 mm over rainfall-receiving areas of the country. MIROC6 revealed the highest wet bias in most parts of the country with greater than 70 mm to less than 200 mm during the JJAS and FMAM seasons. On the contrary, BCC-ESM1, GFDL-CM4, GFDL-ESM4, INM-CM5-0, and NorESM2-MM performed relatively well in simulating the FMAM rainfall with a lower bias of less than 40 mm in most parts of the country.
Statistical analysis of CMIP6 models
The RMSE results show that CHIRPS exhibited a larger error in JJAS (31.74 mm) compared to mean monthly (22.89 mm) and FMAM (20.65 mm) (Figure 7(b)). The general distribution of RMSE exhibited different variations for different models and time frames. The simulated error in mean monthly rainfall was in the range of 11.88 mm (GFDL-ESM4) to 83.06 mm (CNRM-CM6-1). The simulated rainfall of RMSE distribution for all the models during JJAS and FMAM were in the range of 35.01 mm (GFDL-CM4) to 114.78 mm (CNRM-CM6-1) and 11 mm (GFDL-ESM4) to 74.7 mm (MIROC6), respectively. Most of the models revealed higher RMSE when simulating rainfall for the JJAS season compared to the FMAM season. Three and 30 CMIP6 models reflected higher RMSE values of greater than 50 mm in simulating observed rainfall during the FMAM and JJAS seasons, respectively. GFDL-CM4 showed a lower RMSE value of 11.9 mm in mean monthly rainfall while GFDL-ESM4 revealed low RMSE values of 11 and 35 mm for FMAM and JJAS seasons, respectively.
3.4. Model ranking
TSS was used in the ranking of the CMIP6 models for simulating mean monthly, FMAM, and JJAS rainfall over the country (Table 3). Large variation was found in the TSS of simulated rainfall among different models and time scales (Table 3). TSS values of the models for simulating mean monthly rainfall ranged from 0.27 to 0.89. For FMAM and JJAS seasons, the variations ranged from 0.35 to 0.8 and 0.17 to 0.91, respectively. Thirteen of the models depicted more than 0.8 TSS in mean monthly, 11 of the models in JJAS, and three of the models in FMAM. GFDL-CM4, GFDL-ESM4, NorESM2-MM, NESM3, and CESM2-WACCM were the best-performing models, with the highest TSS values of greater than 0.85 during mean monthly. GFD-LCM4, GFDL-ESM4, NESM3, CESM2, CESM2-WACCM, and Nor-ESM2-MM showed the best JJAS rainfall representation, with TSS exceeding 0.85. During the FMAM season, GFDL-ESM4, GFDL-CM4, and NorESM2-MM adequately replicated the observed seasonal rainfall, with higher TSS values of more than 0.80. CNRM-CM6-1 and CNRM-ESM2-1 were the least skilled models in simulating the observed mean monthly and JJAS rainfall, with lower TSS values of less than 0.4, while CNRM-CM6-1 and FGOALS-f3-L poorly simulated the FMAM rainfall patterns.
Models . | TSS . | RM value (RM_TSS) . | Overall ranks of models based on RM_TSS values . | ||
---|---|---|---|---|---|
Mean monthly . | JJAS . | FMAM . | |||
GFDL-CM4 | 0.89(1) | 0.91(1) | 0.85(2) | 0.96 | 1 |
GFDL-ESM4 | 0.87(2) | 0.91(2) | 0.88(1) | 0.95 | 2 |
NorESM2-MM | 0.87(3) | 0.87(7) | 0.83(3) | 0.88 | 3 |
CESM2 | 0.85(7) | 0.87(5) | 0.78(6) | 0.85 | 4 |
CESM2-WACCM | 0.86(5) | 0.87(6) | 0.76(7) | 0.84 | 5 |
NESM3 | 0.87(4) | 0.90(3) | 0.71(15) | 0.80 | 6 |
E3SM-1-0 | 0.85(6) | 0.90(4) | 0.72(13) | 0.79 | 7 |
MPI-ESM1-2-LR | 0.83(8) | 0.84(8) | 0.74(10) | 0.77 | 8 |
BCC-CSM2-MR | 0.81(12) | 0.79(13) | 0.77(9) | 0.69 | 9 |
EC-Earth3 | 0.81(10) | 0.81(11) | 0.68(18) | 0.65 | 10 |
EC-Earth3-Veg-LR | 0.81(11) | 0.83(9) | 0.67(23) | 0.61 | 11 |
TaiESM1 | 0.76(16) | 0.74(19) | 0.73(11) | 0.60 | 12 |
EC-Earth3-Veg | 0.80(13) | 0.81(12) | 0.67(22) | 0.59 | 13 |
MPI-ESM1-2-HR | 0.82(9) | 0.82(10) | 0.64(28) | 0.58 | 14 |
IPSL-CM6A-LR | 0.74(18) | 0.78(14) | 0.70(16) | 0.57 | 15 |
CMCC-CM2-SR5 | 0.79(14) | 0.77(15) | 0.67(20) | 0.56 | 16 |
CanESM5 | 0.73(22) | 0.71(22) | 0.77(8) | 0.54 | 17 |
INM-CM5-0 | 0.74(20) | 0.61(27) | 0.78(5) | 0.53 | 18 |
CMCC-ESM2 | 0.78(15) | 0.77(16) | 0.66(25) | 0.50 | 19 |
BCC-ESM1 | 0.70(26) | 0.55(30) | 0.79(4) | 0.46 | 20 |
HadGEM3-GC31-LL | 0.75(17) | 0.71(21) | 0.66(24) | 0.44 | 21 |
UKESM1-0-LL | 0.74(19) | 0.70(23) | 0.67(21) | 0.43 | 22 |
CESM2-WACCM-FV2 | 0.68(28) | 0.68(25) | 0.73(12) | 0.42 | 23 |
MRI-ESM2-0 | 0.73(21) | 0.76(17) | 0.65(27) | 0.39 | 24 |
SAM0-UNICON | 0.71(23) | 0.61(28) | 0.72(14) | 0.37 | 25 |
ACCESS-ESM1-5 | 0.70(27) | 0.76(18) | 0.65(26) | 0.36 | 26 |
CMCC-CM2-HR4 | 0.70(25) | 0.60(29) | 0.68(190 | 0.35 | 27 |
FGOALS-g3 | 0.71(24) | 0.72(20) | 0.53(32) | 0.32 | 28 |
INM-CM4-8 | 0.62(30) | 0.44(33) | 0.69(17) | 0.28 | 29 |
MIROC6 | 0.66(29) | 0.66(26) | 0.55(31) | 0.23 | 30 |
IITM-ESM | 0.58(32) | 0.68(24) | 0.44(35) | 0.18 | 31 |
CAMS-CSM1-0 | 0.59(31) | 0.53(31) | 0.52(34) | 0.14 | 32 |
CNRM-CM6-1-HR | 0.46(34) | 0.39(35) | 0.61(29) | 0.12 | 33 |
ACCESS-CM2 | 0.51(33) | 0.42(34) | 0.52(33) | 0.10 | 34 |
CNRM-ESM2-1 | 0.39(36) | 0.31(36) | 0.61(30) | 0.08 | 35 |
FGOALS-f3-L | 0.42(35) | 0.49(32) | 0.35(37) | 0.06 | 36 |
CNRM-CM6-1 | 0.27(37) | 0.17(37) | 0.38(36) | 0.01 | 37 |
Models . | TSS . | RM value (RM_TSS) . | Overall ranks of models based on RM_TSS values . | ||
---|---|---|---|---|---|
Mean monthly . | JJAS . | FMAM . | |||
GFDL-CM4 | 0.89(1) | 0.91(1) | 0.85(2) | 0.96 | 1 |
GFDL-ESM4 | 0.87(2) | 0.91(2) | 0.88(1) | 0.95 | 2 |
NorESM2-MM | 0.87(3) | 0.87(7) | 0.83(3) | 0.88 | 3 |
CESM2 | 0.85(7) | 0.87(5) | 0.78(6) | 0.85 | 4 |
CESM2-WACCM | 0.86(5) | 0.87(6) | 0.76(7) | 0.84 | 5 |
NESM3 | 0.87(4) | 0.90(3) | 0.71(15) | 0.80 | 6 |
E3SM-1-0 | 0.85(6) | 0.90(4) | 0.72(13) | 0.79 | 7 |
MPI-ESM1-2-LR | 0.83(8) | 0.84(8) | 0.74(10) | 0.77 | 8 |
BCC-CSM2-MR | 0.81(12) | 0.79(13) | 0.77(9) | 0.69 | 9 |
EC-Earth3 | 0.81(10) | 0.81(11) | 0.68(18) | 0.65 | 10 |
EC-Earth3-Veg-LR | 0.81(11) | 0.83(9) | 0.67(23) | 0.61 | 11 |
TaiESM1 | 0.76(16) | 0.74(19) | 0.73(11) | 0.60 | 12 |
EC-Earth3-Veg | 0.80(13) | 0.81(12) | 0.67(22) | 0.59 | 13 |
MPI-ESM1-2-HR | 0.82(9) | 0.82(10) | 0.64(28) | 0.58 | 14 |
IPSL-CM6A-LR | 0.74(18) | 0.78(14) | 0.70(16) | 0.57 | 15 |
CMCC-CM2-SR5 | 0.79(14) | 0.77(15) | 0.67(20) | 0.56 | 16 |
CanESM5 | 0.73(22) | 0.71(22) | 0.77(8) | 0.54 | 17 |
INM-CM5-0 | 0.74(20) | 0.61(27) | 0.78(5) | 0.53 | 18 |
CMCC-ESM2 | 0.78(15) | 0.77(16) | 0.66(25) | 0.50 | 19 |
BCC-ESM1 | 0.70(26) | 0.55(30) | 0.79(4) | 0.46 | 20 |
HadGEM3-GC31-LL | 0.75(17) | 0.71(21) | 0.66(24) | 0.44 | 21 |
UKESM1-0-LL | 0.74(19) | 0.70(23) | 0.67(21) | 0.43 | 22 |
CESM2-WACCM-FV2 | 0.68(28) | 0.68(25) | 0.73(12) | 0.42 | 23 |
MRI-ESM2-0 | 0.73(21) | 0.76(17) | 0.65(27) | 0.39 | 24 |
SAM0-UNICON | 0.71(23) | 0.61(28) | 0.72(14) | 0.37 | 25 |
ACCESS-ESM1-5 | 0.70(27) | 0.76(18) | 0.65(26) | 0.36 | 26 |
CMCC-CM2-HR4 | 0.70(25) | 0.60(29) | 0.68(190 | 0.35 | 27 |
FGOALS-g3 | 0.71(24) | 0.72(20) | 0.53(32) | 0.32 | 28 |
INM-CM4-8 | 0.62(30) | 0.44(33) | 0.69(17) | 0.28 | 29 |
MIROC6 | 0.66(29) | 0.66(26) | 0.55(31) | 0.23 | 30 |
IITM-ESM | 0.58(32) | 0.68(24) | 0.44(35) | 0.18 | 31 |
CAMS-CSM1-0 | 0.59(31) | 0.53(31) | 0.52(34) | 0.14 | 32 |
CNRM-CM6-1-HR | 0.46(34) | 0.39(35) | 0.61(29) | 0.12 | 33 |
ACCESS-CM2 | 0.51(33) | 0.42(34) | 0.52(33) | 0.10 | 34 |
CNRM-ESM2-1 | 0.39(36) | 0.31(36) | 0.61(30) | 0.08 | 35 |
FGOALS-f3-L | 0.42(35) | 0.49(32) | 0.35(37) | 0.06 | 36 |
CNRM-CM6-1 | 0.27(37) | 0.17(37) | 0.38(36) | 0.01 | 37 |
Numbers within brackets represent the rank of CMIP6 model for each timeframe with respect to TSS.
The overall ranks were assigned for each model based on the RM values (RM_TSS) to indicate their performance in representing the spatial characteristics of rainfall (Table 3). It is noted that the value of the RM_TSS metric varied between 0.01 (last rank) and 0.96 (first preferred) over the 37 ranks (Table 3). GFDL-CM4, GFDL-ESM4, NorESM2-MM, CESM2, CESM2-WACCM, and NESM3 were occupying the first six positions with RM_TSS values of greater than 0.80 and hence regarded as the most skilful CMIP6 models in terms of TSS. It was observed that GFDL-CM4 and GFDL-ESM4 were the only CMIP6 models which were able to perform well for all timeframes with respect to TSS. On the other hand, CNRM-ESM2-1 FGOALS-f3-L, CNRM-CM6-1, ACCESS-CM2, and CNRM-CM6–1–HR were the last five with RM_TSS values of less than 0.2 and can be regarded as the least performing CMIP6 models for Ethiopia. It is important to note that the finest resolution CMIP6 model (CNRM-CM6–1–HR) showed the least skill for simulating rainfall, signifying that reducing the resolution is not a guarantee for better performance.
Two CMIP6 GCMs (GFDL-CM4 and GFDL-ESM4), which were developed by the same modeling center (Geophysical Fluid Dynamics Laboratory (GFDL), USA), ranked first and second in the overall ranking of the models (Table 3), whereas most of the CMIP6 models showed different performances by ranks even though they were developed by the same modeling center. For example, CESM2, CESM2-WACCM, and CESM2-WACCM-FV2 which were developed by the same institution (National Center for Atmospheric Research, USA) ranked 4th, 5th, and 23rd, respectively. Similarly, differences in rank among models developed by the same modeling centers were also depicted between the models: INM-CM5-0 and INM-CM5-0; BCC-ESM1 and BCC-CSM2-MR; CMCC-CM2-HR4 and CMCC-CM2-SR5; ACCESS-CM2 and ACCESS-ESM1-5; CESM2, CESM2-WACCM and CESM2-WACCM-FV2; MPI-ESM1-2-LR and MPI-ESM1-2-HR; and FGOALS-f3-L and FGOALS-g3 (Table 3).
Comparison of different ensembles
WA and SA methods for generating ensembles were compared and presented in Tables A1, A2 and A3. All ensembles showed better improvement in bias (Table A1) and RMSE (Table A2) in WA relative to the SA method. They revealed minimum bias (<7 mm) and RMSE (<40 mm) in all timeframes in WA-based ensembles relative to SA-based ensembles. The highest TSS value (>0.75) is found in WA-based ensembles relative to SA-based ensembles in all timeframes (Table A3). Thus, WA generated better-performing ensembles compared to SA and was therefore used in comparing ensembles in this study.
Spatially, a high amount of rainfall over the Northwest and West parts of the country was well acquired by all ensembles during the JJAS season (Figure 3). Furthermore, ENSEMB_4 and ENSEMB_8 outperformed all the 37 CMIP6 individual models and other ensembles in capturing the spatial distribution of JJAS (Figure 3) and FMAM (Figure 4) seasonal rainfall. All seven ensembles showed relatively weaker simulation bias ranging between −50 and 50 mm and between −40 and 40 mm in most parts of the country during the JJAS (Figure 5) and FMAM seasons (Figure 6), respectively.
All ensembles showed wet and dry bias in mean monthly and JJAS seasons, respectively (Table 4). Except for ENSEMB_4 and ENSEMB_8, they showed dry bias during the FMAM season. The simulated rainfall of RMSE for all ensembles was in the range between 16.12 mm (ENSEMB_4) and 29.06 mm (ENSEMB_ALL) during the FMAM season and 25.09 mm (ENSEMB_4) and 50.41 mm (ENSEMB_ALL) during the JJAS season. ENSEMB_4 and ENSEMB_8 exhibited lower RMSE values of greater than 30 mm in simulating observed rainfall during the whole period. The error was larger during the JJAS season relative to the FMAM season in all ensembles. TSS values of the ensembles for simulating rainfall in mean monthly were in the range of 0.82–0.88, while during FMAM and JJAS seasons they were 0.78–0.84 and 0.81–0.91, respectively. ENSEMB_4 showed the best performance in representing the spatial distribution of seasonal rainfall with the highest TSS value of greater than 0.84 while ENSEMB_ALL revealed the lowest performance compared to other ensembles.
Ensembles . | TSS . | Bias . | RMSE . | ||||||
---|---|---|---|---|---|---|---|---|---|
Mean monthly . | JJAS . | FMAM . | Mean monthly . | JJAS . | FMAM . | Mean monthly . | JJAS . | FMAM . | |
ENSEMB_4 | 0.88 | 0.9 | 0.84 | 1.81 | −0.16 | 1.49 | 20.23 | 25.09 | 16.12 |
ENSEMB_8 | 0.88 | 0.91 | 0.82 | 0.59 | −2.43 | 0.01 | 22.45 | 27.55 | 19.47 |
ENSEMB_10 | 0.86 | 0.87 | 0.78 | −0.3 | −1.79 | −4.7 | 30.26 | 39.52 | 26.32 |
ENSEMB_16 | 0.86 | 0.88 | 0.78 | −0.72 | −1.08 | −6.96 | 30.52 | 38.93 | 27.65 |
ENSEMB_20 | 0.86 | 0.87 | 0.78 | −0.51 | −1.09 | −6.55 | 31.02 | 39.94 | 27.76 |
ENSEMB_27 | 0.85 | 0.86 | 0.79 | −0.56 | −1.73 | −6.84 | 31.7 | 40.51 | 27.88 |
ENSEMB_ALL | 0.82 | 0.81 | 0.78 | −1.95 | −6.64 | −6.85 | 36.74 | 50.41 | 29.06 |
Ensembles . | TSS . | Bias . | RMSE . | ||||||
---|---|---|---|---|---|---|---|---|---|
Mean monthly . | JJAS . | FMAM . | Mean monthly . | JJAS . | FMAM . | Mean monthly . | JJAS . | FMAM . | |
ENSEMB_4 | 0.88 | 0.9 | 0.84 | 1.81 | −0.16 | 1.49 | 20.23 | 25.09 | 16.12 |
ENSEMB_8 | 0.88 | 0.91 | 0.82 | 0.59 | −2.43 | 0.01 | 22.45 | 27.55 | 19.47 |
ENSEMB_10 | 0.86 | 0.87 | 0.78 | −0.3 | −1.79 | −4.7 | 30.26 | 39.52 | 26.32 |
ENSEMB_16 | 0.86 | 0.88 | 0.78 | −0.72 | −1.08 | −6.96 | 30.52 | 38.93 | 27.65 |
ENSEMB_20 | 0.86 | 0.87 | 0.78 | −0.51 | −1.09 | −6.55 | 31.02 | 39.94 | 27.76 |
ENSEMB_27 | 0.85 | 0.86 | 0.79 | −0.56 | −1.73 | −6.84 | 31.7 | 40.51 | 27.88 |
ENSEMB_ALL | 0.82 | 0.81 | 0.78 | −1.95 | −6.64 | −6.85 | 36.74 | 50.41 | 29.06 |
Table A1 presented the estimated value of RM and overall ranks of 37 models, and seven ensembles with respect to bias, RMSE, PCC, and TSS. Compared to individual CMIP6 models, ENSEMB_4, ENSEMB_10, ENSEMB_20, ENSEMB_8, ENSEMB_27, and ENSEMB_16 were occupying the first six positions in the RM_BIAS values, while NorESM2-MM, BCC-ESM1, CMCC-CM2-HR4, and GFDL-ESM4 attained ranks 7, 8, 9, and 10, respectively. ENSEMB_4, ENSEMB_8, GFDL-ESM4, ENSEMB_10, ENSEMB_16, ENSEMB_20, and ENSEMB_20 were occupying the first seven positions in the RM_RMSE values, while MIROC6, CNRM-ESM2-1, and CNRM-CM6-1 attained the last three ranks in the RM_BIAS and RM_ RMSE values. GFDL-CM4 was occupying the first position in the RM_PCC value followed by GFDL-ESM4, MPI-ESM1-2-LR, and ENSEMB_4, whereas CNRM-CM6-1, CNRM-ESM2-1, and CNRM-CM6-1-HR were occupying the last three positions with the lowest RM_PCC value in that order. Compared to individual CMIP6 models, Table A1 shows that ENSEMB_4, ENSEMB_8, ENSEMB_16, ENSEMB_20, ENSEMB_10, ENSEMB_27, and ENSEMB_ALL ranked the 3rd, 4th, 6th, 7th, 8th, 12th, and 15th positions in the RM_TSS values, respectively.
Evaluation of models with CHIRPS
The performance of the model was further evaluated using CHIRPS as the reference dataset with respect to TSS and the results are presented in Table 5. Thirteen of the models illustrated more than 0.8 TSS in mean monthly, nine models in JJAS, and eight models in FMAM. CanESM5 showed a better skill, with TSS value of 0.9 in simulating mean monthly rainfall followed by BCC-ESM1, GFDL-CM4, NESM3, and MPI-ESM1-2-LR. MPI-ESM1-2-LR revealed the best performance in reproducing the JJAS rainfall pattern, with the highest TSS value of 0.91 followed by NESM3, BCC-CSM2-MR, and MPI-ESM1-2-HR. CanESM5 showed the best skill in representing FMAM rainfall, with TSS value of 0.98 followed by BCC-ESM1, GFDL-CM4, GFDL-ESM4, and NESM3.
Models . | TSS . | RM value based on TSS (RM_TSS) . | Overall ranks of models based on RM_TSS value . | ||
---|---|---|---|---|---|
Mean monthly . | JJAS . | FMAM . | |||
NESM3 | 0.53 | 0.89 | 0.84 | 0.90 | 1 |
GFDL-ESM4 | 0.69 | 0.88 | 0.89 | 0.89 | 2 |
MPI-ESM1-2-LR | 0.60 | 0.91 | 0.80 | 0.88 | 3 |
GFDL-CM4 | 0.70 | 0.87 | 0.90 | 0.86 | 4 |
BCC_CSM2-MR | 0.90 | 0.89 | 0.76 | 0.78 | 5 |
E3SM-1-0 | 0.74 | 0.86 | 0.74 | 0.76 | 6 |
CanESM5 | 0.90 | 0.54 | 0.98 | 0.72 | 7 |
HadGEM3-GC31-LL | 0.68 | 0.75 | 0.77 | 0.70 | 8 |
EC-Earth3-Veg-LR | 0.71 | 0.84 | 0.75 | 0.69 | 9 |
UKESM1-0-LL | 0.34 | 0.72 | 0.80 | 0.69 | 9 |
BCC-ESM1 | 0.90 | 0.48 | 0.96 | 0.68 | 11 |
EC-Earth3 | 0.72 | 0.78 | 0.73 | 0.68 | 11 |
MPI-ESM1-2-HR | 0.61 | 0.89 | 0.67 | 0.67 | 13 |
EC-Earth3-Veg | 0.71 | 0.78 | 0.72 | 0.64 | 14 |
ACCESS-ESM1-5 | 0.90 | 0.80 | 0.82 | 0.61 | 15 |
NorESM2-MM | 0.50 | 0.70 | 0.77 | 0.59 | 16 |
INM-CM5-0 | 0.67 | 0.68 | 0.73 | 0.55 | 17 |
MRI-ESM2-0 | 0.54 | 0.76 | 0.71 | 0.52 | 18 |
CESM2 | 0.89 | 0.73 | 0.70 | 0.50 | 19 |
CESM2-WACCM | 0.88 | 0.71 | 0.70 | 0.48 | 20 |
CMCC-CM2-SR5 | 0.82 | 0.75 | 0.59 | 0.47 | 21 |
CMCC-ESM2 | 0.81 | 0.75 | 0.57 | 0.42 | 22 |
FGOALS-g3 | 0.70 | 0.68 | 0.57 | 0.37 | 23 |
SAM0-UNICON | 0.48 | 0.64 | 0.65 | 0.35 | 24 |
INM-CM4-8 | 0.67 | 0.53 | 0.71 | 0.34 | 25 |
TaiESM1 | 0.48 | 0.68 | 0.63 | 0.32 | 26 |
ACCESS-CM2 | 0.95 | 0.47 | 0.67 | 0.30 | 27 |
CESM2-WACCM-FV2 | 0.86 | 0.63 | 0.66 | 0.27 | 28 |
IITM-ESM | 0.68 | 0.71 | 0.46 | 0.25 | 29 |
MIROC6 | 0.64 | 0.67 | 0.53 | 0.25 | 29 |
CAMS-CSM1-0 | 0.90 | 0.66 | 0.53 | 0.23 | 31 |
CNRM-ESM2-1 | 0.77 | 0.49 | 0.62 | 0.19 | 32 |
CNRM-CM6-1-HR | 0.79 | 0.42 | 0.61 | 0.12 | 33 |
IPSL-CM6A-LR | 0.64 | 0.46 | 0.58 | 0.12 | 33 |
CMCC-CM2-HR4 | 0.85 | 0.46 | 0.52 | 0.10 | 35 |
FGOALS-f3-L | 0.70 | 0.58 | 0.40 | 0.10 | 35 |
CNRM-CM6-1 | 0.81 | 0.28 | 0.33 | 0.00 | 37 |
Models . | TSS . | RM value based on TSS (RM_TSS) . | Overall ranks of models based on RM_TSS value . | ||
---|---|---|---|---|---|
Mean monthly . | JJAS . | FMAM . | |||
NESM3 | 0.53 | 0.89 | 0.84 | 0.90 | 1 |
GFDL-ESM4 | 0.69 | 0.88 | 0.89 | 0.89 | 2 |
MPI-ESM1-2-LR | 0.60 | 0.91 | 0.80 | 0.88 | 3 |
GFDL-CM4 | 0.70 | 0.87 | 0.90 | 0.86 | 4 |
BCC_CSM2-MR | 0.90 | 0.89 | 0.76 | 0.78 | 5 |
E3SM-1-0 | 0.74 | 0.86 | 0.74 | 0.76 | 6 |
CanESM5 | 0.90 | 0.54 | 0.98 | 0.72 | 7 |
HadGEM3-GC31-LL | 0.68 | 0.75 | 0.77 | 0.70 | 8 |
EC-Earth3-Veg-LR | 0.71 | 0.84 | 0.75 | 0.69 | 9 |
UKESM1-0-LL | 0.34 | 0.72 | 0.80 | 0.69 | 9 |
BCC-ESM1 | 0.90 | 0.48 | 0.96 | 0.68 | 11 |
EC-Earth3 | 0.72 | 0.78 | 0.73 | 0.68 | 11 |
MPI-ESM1-2-HR | 0.61 | 0.89 | 0.67 | 0.67 | 13 |
EC-Earth3-Veg | 0.71 | 0.78 | 0.72 | 0.64 | 14 |
ACCESS-ESM1-5 | 0.90 | 0.80 | 0.82 | 0.61 | 15 |
NorESM2-MM | 0.50 | 0.70 | 0.77 | 0.59 | 16 |
INM-CM5-0 | 0.67 | 0.68 | 0.73 | 0.55 | 17 |
MRI-ESM2-0 | 0.54 | 0.76 | 0.71 | 0.52 | 18 |
CESM2 | 0.89 | 0.73 | 0.70 | 0.50 | 19 |
CESM2-WACCM | 0.88 | 0.71 | 0.70 | 0.48 | 20 |
CMCC-CM2-SR5 | 0.82 | 0.75 | 0.59 | 0.47 | 21 |
CMCC-ESM2 | 0.81 | 0.75 | 0.57 | 0.42 | 22 |
FGOALS-g3 | 0.70 | 0.68 | 0.57 | 0.37 | 23 |
SAM0-UNICON | 0.48 | 0.64 | 0.65 | 0.35 | 24 |
INM-CM4-8 | 0.67 | 0.53 | 0.71 | 0.34 | 25 |
TaiESM1 | 0.48 | 0.68 | 0.63 | 0.32 | 26 |
ACCESS-CM2 | 0.95 | 0.47 | 0.67 | 0.30 | 27 |
CESM2-WACCM-FV2 | 0.86 | 0.63 | 0.66 | 0.27 | 28 |
IITM-ESM | 0.68 | 0.71 | 0.46 | 0.25 | 29 |
MIROC6 | 0.64 | 0.67 | 0.53 | 0.25 | 29 |
CAMS-CSM1-0 | 0.90 | 0.66 | 0.53 | 0.23 | 31 |
CNRM-ESM2-1 | 0.77 | 0.49 | 0.62 | 0.19 | 32 |
CNRM-CM6-1-HR | 0.79 | 0.42 | 0.61 | 0.12 | 33 |
IPSL-CM6A-LR | 0.64 | 0.46 | 0.58 | 0.12 | 33 |
CMCC-CM2-HR4 | 0.85 | 0.46 | 0.52 | 0.10 | 35 |
FGOALS-f3-L | 0.70 | 0.58 | 0.40 | 0.10 | 35 |
CNRM-CM6-1 | 0.81 | 0.28 | 0.33 | 0.00 | 37 |
Based on comparative rating metrics of RM_TSS values (Table 5), six models (NESM3, GFDL-ESM4, MPI-ESM1-2-LR, GFDL-CM4, BCC_CSM2-MR, and E3SM-1-0) outperformed the other individual models against CHIRPS, with RM_TSS value greater than 0.8 in that order. CNRM-CM6-1 revealed the lowest RM_TSS value (0.00) preceded by FGOALS-f3-L, CMCC-CM2-HR4, IPSL-CM6A-LR, and CNRM-CM6-1-HR. Some models such as NESM3, GFDL-CM4, and GFDL-ESM4 showed higher RM_TSS value, with greater than 0.8 in both CHIRPS (Table 5) and ENACTS (Table 3) even though their position is different. The fine resolution model, CNRM-CM6-1-HR, showed the least performance in simulating rainfall from CHIRPS and ENACTS datasets.
Trend analysis
Table 6 presents mean rainfall, Sen's Slope, and P-values for the four top-ranking models including ENACTS and CHIRPS during 1981–2014. The four top-ranking models are GFDL-ESM4, GFDL-CM4, NorESM2-MM, and ENSEMB_4. ENACTS depicts the insignificant increasing trend in rainfall at the rate of 0.10 and 0.59 mm/year for mean monthly and FMAM seasons at a 5% significance level, respectively. On the contrary, the JJAS rains exhibit an insignificant trend with a decrease at the rate of −0.49 mm/year. The CHIRPS trend has a smaller magnitude than ENACTS trend in mean monthly and both seasons. According to CHIRPS, significant and insignificant positive trends were observed at the rate of 0.33 and 0.14 mm/year at JJAS and mean monthly rains, respectively. An insignificant negative trend at the rate of −0.11 mm/year was depicted at FMAM rains. All four models were consistent in showing positive insignificant trends over Ethiopia for mean monthly and both seasons and they have the lowest absolute trend magnitudes compared to ENACTS in both seasons. GFDL-ESM4, GFDL-CM4, NorESM2-MM, and ENSEMB_4 slightly overestimated the observed rainfall for JJAS rainfall while they moderately overestimated for FMAM and mean monthly.
Model . | Mean monthly . | JJAS . | FMAM . | ||||||
---|---|---|---|---|---|---|---|---|---|
RF mm . | Sen's Slope mm/year . | P-value . | RF mm . | Sen's Slope mm/year . | P-value . | RF mm . | Sen's Slope mm/year . | P-value . | |
ENACTS | 58.72 | 0.10 | 0.45 | 111.69 | −0.49 | 0.20 | 41.61 | 0.59 | 0.08 |
CHIRPS | 46.87 | 0.09 | 0.14 | 75.23 | 0.33 | 0.02* | 40.67 | −0.11 | 0.43 |
GFDL-CM4 | 77.24 | 0.09 | 0.29 | 114.89 | 0.27 | 0.15 | 70.15 | 0.11 | 0.64 |
GFDL-ESM4 | 79.21 | 0.06 | 0.79 | 113.51 | 0.09 | 0.61 | 63.27 | 0.08 | 0.81 |
NorESM2-MM | 73.27 | 0.16 | 0.18 | 119.17 | 0.17 | 0.53 | 62.85 | 0.23 | 0.51 |
ENSEMB_4 | 77.51 | 0.18 | 0.09 | 114.73 | 0.19 | 0.19 | 67.40 | 0.20 | 0.17 |
Model . | Mean monthly . | JJAS . | FMAM . | ||||||
---|---|---|---|---|---|---|---|---|---|
RF mm . | Sen's Slope mm/year . | P-value . | RF mm . | Sen's Slope mm/year . | P-value . | RF mm . | Sen's Slope mm/year . | P-value . | |
ENACTS | 58.72 | 0.10 | 0.45 | 111.69 | −0.49 | 0.20 | 41.61 | 0.59 | 0.08 |
CHIRPS | 46.87 | 0.09 | 0.14 | 75.23 | 0.33 | 0.02* | 40.67 | −0.11 | 0.43 |
GFDL-CM4 | 77.24 | 0.09 | 0.29 | 114.89 | 0.27 | 0.15 | 70.15 | 0.11 | 0.64 |
GFDL-ESM4 | 79.21 | 0.06 | 0.79 | 113.51 | 0.09 | 0.61 | 63.27 | 0.08 | 0.81 |
NorESM2-MM | 73.27 | 0.16 | 0.18 | 119.17 | 0.17 | 0.53 | 62.85 | 0.23 | 0.51 |
ENSEMB_4 | 77.51 | 0.18 | 0.09 | 114.73 | 0.19 | 0.19 | 67.40 | 0.20 | 0.17 |
Bold and * indicate significant trend at 5% significant level.
DISCUSSION
The main focus of this study was to evaluate the performance of CMIP6 models in reproducing the mean monthly and seasonal observed rainfall over Ethiopia from 1981 to 2014 using ENACTS data. ENACTS reveals a bimodal rainfall pattern with two peaks at the national scale. This is due to the north–south movement of the Intertropical Convergence Zone (ITCZ), or tropical rain belt which oscillates from 15°S to 15°N throughout the year (Nicholson 2018). More than 30 models showed greater performance in capturing the short rainy seasonal cycle (FMAM) relative to the long rainy seasonal cycle (JJAS). This is because the majority of the models (22) cannot simulate rainfall peaks in August during the JJAS season as they produce peak rainfall in September and October. This might be related to the large East African rainfall pattern that has two rainfall seasons in MAM and OND. However, the JJAS seasonal rainfall is of great significance to the rain-fed agricultural economy of the Northwest part of Ethiopia. This season is responsible for 80–95% of the production of food crops in the country (Gissila et al. 2004). This emphasizes the need for improved simulation of the long rainy seasonal cycle of JJAS rainfall.
Majority of CMIP6 models showed a better spatial correlation in the long rainy JJAS season relative to the short rainy FMAM season over most parts of the country. This may be due to higher rainfall variability during FMAM compared to JJAS (Bekele et al. 2017) which is often difficult to capture in climate models (Kamruzzaman et al. 2022). However, more than 15 and 20 CMIP6 models simulated dry bias over most parts of the country in JJAS and FMAM seasons, respectively. This was more pronounced in high topographic and rainfall-receiving areas (Northwest and Central parts of the country). The dry bias that is reported in the previous version of CMIP5 models over the study area (Jury 2015; Li et al. 2016; Dyer et al. 2020) has not been significantly improved in CMIP6 models. Similar dry bias is reflected in the long rainy season in CMIP5 and CMIP6 models in East Africa (Ayugi et al. 2021). This might be related to weak Hadley circulation (Chadwick et al. 2013) and associated with large-scale dynamics and southward shifts of the ITCZ over East Africa (Yang et al. 2015). Dyer et al. (2022) demonstrate that biases in CMIP5 and CMIP6 are related to Southern Ocean warm biases and Western Indian Ocean warm biases, respectively.
Moreover, the fine resolution model, CNRM-CM6-1-HR, highly underestimated the observed JJAS seasonal rainfall, whereas medium and coarse resolution models such as BCC-ESM1, E3SM-1-0, GFDL-CM4, and NESM3 showed a better simulation of JJAS rainfall over the study area. This indicates that the underestimation of observed rainfall in this study may not be improved by simply increasing the models' horizontal resolution. This is mainly due to the challenges related to the complex topography, especially in the East African region, which is related to model physical parameterization (Ayugi et al. 2021). Once the representation of microphysical parameterizations is improved, increases in model resolution are also likely to contribute to more accurate simulation of precipitation especially in mountainous regions (Champion et al. 2011).
Our results found a wide range of model skills in their ability to simulate observed rainfall at different time scales and statistical metrics, that lead to different rankings of the models. Based on the overall ranks of CMIP6 models using RM_TSS values (RM_TSS), GFDL-CM4 showed the best performance followed by GFDL-ESM4, NorESM2-MM, CESM2, CESM2-WACCM, NESM3, E3SM-1-0, MPI-ESM1-2-LR, and BCC-CSM2-MR in simulating rainfall over Ethiopia with respect to ENACTS (Table 5). Generally, the best-performing models, considering all the metrics values (RM_BIAS, RM_RMSE, RM_PCC, and RM_TTS), were observed for the GFDL-ESM4, NorESM2-MM, and ENSEMB_4, while the worst performance was observed for the CNRM-CM6-1, CNRM-ESM2-1, and CNRM-CM6-1-HR models. These results are consistent with other previous studies over Ethiopia (Dyer et al. 2020; Alaminie et al. 2021), Uganda (Ngoma et al. 2021), North Africa (Babaousmail et al. 2021), and East Africa (Makula & Zhou 2022). The failure of individual models to capture the observed spatial patterns of precipitation might be due to the complex dynamics of the temporal and spatial properties of precipitation over the East African region, differences in initial and boundary conditions (Makula & Zhou 2022), and poor representation of convective schemes and model parametrization (Ongoma et al., 2019).
CHIRPS showed the best performance in representing the reference observed rainfall (ENACTS) based on spatial analysis, quantitative statistics metrics, and TSS. Furthermore, CMIP6 models were compared with respect to CHIRPS based on TSS in order to see the performance of models with regard to different observed datasets. Some models such as NESM3, GFDL-CM4, and GFDL-ESM4 were the most skillful models while CNRM-CM6-1, CNRM-ESM2-1, and CNRM-CM6-1-HR were the poorest models in both CHIRPS and ENACTS. However, most models revealed different skills with respect to ENACTS and CHIRPS, even though some models showed similar performance in both gridded datasets. Thus, the relative performances of the CMIP6 models may depend on the choice of the reference gridded dataset as the different gridded observation datasets revealed substantial differences (Akinsanola et al. 2021; Faye & Akinsanola 2022).
Various studies use the different numbers of top-ranked GCMs for generating ensembles since there is no standard on the selection of the maximum number of the top-ranked of GCMs (Alaminie et al. 2021; Babaousmail et al. 2021; Makula & Zhou 2022; Ngoma et al. 2022). We found that the WA method showed better skill compared to the SA method in reducing errors as it gives more weight to higher performing models. Overall, all ensembles outperformed most individual models in representing temporal and spatial rainfall distribution. Six ensembles (ENSEMB_4, ENSEMB_8, ENSEMB_10, ENSEMB_16, ENSEMB_20, and ENSEMB_27) showed better performance in most of the statistical metrics relative to the weighted average of all models. Nevertheless, the weighted average of the four top-ranked models (ENSEMB_4) can best reproduce spatial and temporal rainfall variations while the weighted average of all models (ENSEMB_ALL) revealed the least skill. Moreover, ENSEMB_4 was the only ensemble that shows the most skills in all metrics compared to other ensembles and individual models. This indicated the importance of choosing models for climate change impact assessment for a given region. A similar result was observed in the earlier version of CMIP5 model over Ethiopia (Jury 2015; Dyer et al. 2020). Dyer et al. (2020) showed that the ensemble averages of 30 CMIP5 models revealed weak bimodal seasonality in the Awash basin due to large discrepancies in individual models. An average of better models may provide more confident guidance than an average of all models as it is prone to dry bias (Jury 2015). Kang et al. (2014) stated that the predictive capabilities of the ensembles do not significantly improve after a certain number of GCMs. A model that cannot properly represent the current climate system may not add any improvement to the ensemble.
CONCLUSION AND RECOMMENDATION
This study evaluated the performance of 37 CMIP6 models in reproducing observed rainfall at the national level for the period 1981–2014. Most of the CMIP6 models revealed lower performance over high topographic and rainfall-receiving areas of Northwest and Central parts of the country, while they performed well in low topographic and rainfall-receiving areas of the Southeast. This indicates that the effect of high topography on precipitation is still a challenge in climate modeling. Overall, CMIP6 models revealed a similarly poor capability with producing a dry bias simulation of rainfall as CMIP5 models over the study area. The four top-ranked models (GFDL-CM4 GFDL-ESM4, NorESM2-MM, and CESM2) and weighted average of those models (ENSEMB_4) can be applied in the projection of climate using CMIP6 under different Shared Socioeconomic Pathways (SSPs) for Ethiopia.
The findings of this study give important information to model producers and end-users of the datasets. More research and detailed studies are still crucial for improving the poor performance of models that lead to sources of dry biases since the future projection of its likelihood will be of great importance to the country's economy. This calls for the modeling groups to improve the simulation of the seasonal rainfall through a better understanding of the complex dynamics of the seasonal rainfall over Ethiopia. For climate model output users, caution is needed when employing the models in seasonal climate change studies as dry bias exists in seasonal rainfall simulation, and hence bias correction is important before employing models for future climate projection and impact studies.
Despite the robust findings of this study, there are some limitations. One of the limitations is the choice of the two seasonal rainfall (FMAM and JJAS) for different regions of the country since large seasonal variations are observed in different regions of Ethiopia. Thus, this study recommends evaluating the different CMIP6 models based on different homogeneous rainfall zones in the country to make further detailed studies of the performance of models. The results offer useful information about the different performances of CMIP6 models over the whole country and can serve as a reference for the new phase of climate models over the continent.
ACKNOWLEDGEMENTS
The authors acknowledge the provisions of a conducive working environment and structural and technological support for conducting the research by Water and Land Resource Center. We are grateful to the data centers: World Climate Research Programme and institutions that made their results available to the public. We acknowledge Ethiopia's National Meteorological and Hydrological Services (NMHS) for giving the gridded observational dataset (ENACTS). We also extend our gratitude to the anonymous reviewers for providing their valuable comments.
DATA AVAILABILITY STATEMENT
All relevant data are available from an online repository or repositories: http://wlrc-eth.org/wp-content/uploads/2023/08/CMIP6-Models-ENACTA-Data.zip.
CONFLICT OF INTEREST
The authors declare there is no conflict.