AI Training Datasets
The E3SM project and Allen Institute for AI (Ai2) work together to develop datasets for AI and machine learning applications. E3SMv2 and E3SMv3 have been processed by Ai2 to make them publicly accessible and easier to use for research purposes.
Dataset Details
E3SMv2: 73-year EAMv2 simulation (F2010, perpetual 2010 forcing, repeating annual SST cycle from 2005-2014 average). 6-hourly outputs: 42 years training, 10 years validation, 10 years test. More details see: Duncan et al. 2024
E3SMv3: 51-year EAMv3 AMIP-style simulation (1970-2020, F2010 with AMIP SSTs, constant 2010 CO2). Includes multiple ENSO cycles and global warming trend. More details see: Wu et al. 2025
SCREAMv1: Simple Cloud-Resolving E3SM Atmosphere Model version 1 training data (coming soon)
Tip
Check the archive_content text file to see files included in each tar archive. You can selectively download the files you need.