Clusters of atmopsheric and oceanic variables and teleconnections that are candidate drivers for Tropical Cyclogenesis
This project provides the dataset employed for the development of a machine learning framework designed to detect and interpret Tropical Cyclone Genesis (TCG) activity across six major tropical ocean basins: North Atlantic, Northeast Pacific, Northwest Pacific, North Indian, South Indian, and South Pacific.
The dataset includes pre-processed environmental and climatic variables relevant to TCG dynamics, aggregated at the basin level with monthly resolution from January 1980 to December 2022. All data are derived from the ERA5 reanalysis dataset, with a spatial resolution of 2.5° × 2.5°. ERA5 reanalysis data were accessed through the DKRZ data pool, made available by DKRZ Data Management. The atmospheric and oceanic variables provided are absolute vorticity at 850 hPa, maximum potential intensity (MPI), mean sea level pressure (MSLP), relative humidity at 700 hPa, sea surface temperature (SST), relative vorticity at 850 hPa, vertical wind shear between 850 and 200 hPa, and vertical velocity at 500 hPa. Several of these variables are derived from ERA5 primary variables and represent physically meaningful diagnostics used widely in tropical cyclone research. To reduce spatial dimensionality, each variable has been clustered within each basin using the K-means algorithm, and the area-weighted mean value of each cluster is reported as a time series.
Additionally, the dataset includes monthly values of a suite of large-scale climate indices known to influence tropical cyclone activity: Atlantic Meridional Mode (AMM), Niño3.4, North Atlantic Oscillation (NAO), Pacific Decadal Oscillation (PDO), Pacific-North American Pattern (PNA), Southern Oscillation Index (SOI), Tropical Northern Atlantic Index (TNA), Tropical Southern Atlantic Index (TSA), and the Western Pacific Index (WP).
Lastly, for each basin, the dataset contains monthly counts of tropical cyclogenesis events, enabling evaluation of predictive models and interpretability methods.
This dataset is intended to support research in seasonal TCG detection, and it enables reproducibility of the methods developed in the associated study.
Simple
- Date (Publication)
- 2025-10-22
- Edition
-
1
- Citation identifier
- CLINT_TC
- Citation identifier
- doi:10.26050/WDCC/CLINT_TC
- Name
-
NetCDF
- Keywords
-
-
machine learning
-
- Use limitation
-
CC-BY-4.0: Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/
- Language
-
eng; USA
- Begin date
- 1980-01-01
- End date
- 2022-12-31
- Supplemental Information
-
info:eu-repo/grantAgreement/EC/H2020/101003876/BE//CLImate INTelligence: Extreme events detection, attribution and adaptation design using machine learning
- Distribution format
-
-
NetCDF
()
-
NetCDF
()
- Transfer size
- 28
- OnLine resource
- https://www.wdc-climate.de/ui/entry?acronym=CLINT_TC
- Hierarchy level
- collection
- Attribute description
- sea_surface_temperature
- Descriptor
-
Sea surface temperature is usually abbreviated as "SST". It is the temperature of sea water near the surface (including the part under sea-ice, if any), and not the skin temperature, whose standard name is surface_temperature. For the temperature of sea water at a particular depth or layer, a data variable of sea_water_temperature with a vertical coordinate axis should be used. [CF-Standard Name]; unit: degC
- Attribute description
- air_pressure_at_mean_sea_level
- Descriptor
-
Air pressure at sea level is the quantity often abbreviated as MSLP or PMSL. Air pressure is the force per unit area which would be exerted when the moving gas molecules of which the air is composed strike a theoretical surface of any orientation. "Mean sea level" means the time mean of sea surface elevation at a given location over an arbitrary period sufficient to eliminate the tidal signals. [CF-Standard Name]; unit: hPa
- Attribute description
- eastward_derivative_of_eastward_wind
- Descriptor
-
not filled; unit: m/s
- Attribute description
- eastward_derivative_of_northward_wind
- Descriptor
-
not filled; unit: m/s
- Attribute description
- relative_humidity
- Descriptor
-
relative_humidity [CF-Standard Name]; unit: %
- Attribute description
- number_of_cyclogenesis_events_in_basin
- Descriptor
-
not filled; unit: 1
- Attribute description
- northward_derivative_of_northward_wind
- Descriptor
-
not filled; unit: m/s
- Attribute description
- atmosphere_upward_relative_vorticity
- Descriptor
-
atmosphere_upward_relative_vorticity [CF-Standard Name]; unit: s-1
- Attribute description
- northward_wind_shear
- Descriptor
-
not filled; unit: m/s
- Attribute description
- eastward_wind_shear
- Descriptor
-
not filled; unit: m/s
- Attribute description
- wind_speed_shear
- Descriptor
-
not filled; unit: m/s
- Attribute description
- northward_derivative_of_eastward_wind
- Descriptor
-
not filled; unit: m/s
- Attribute description
- atmosphere_upward_absolute_vorticity
- Descriptor
-
not filled; unit: s-1
- Attribute description
- model_level_number
- Descriptor
-
[CF-Standard Name]; unit: hPa
- Attribute description
- downward_air_velocity
- Descriptor
-
not filled; unit: m/s
- File identifier
- wdc-climate.de:5311644 XML
- Metadata language
-
eng; USA
- Hierarchy level
- collection
- Hierarchy level name
-
CLINT_TC
- Date stamp
- 2025-10-02T13:09:18
- Metadata standard name
-
ISO 19115
- Metadata standard version
-
ISO 19115-2:2009
Overviews
Spatial extent
Provided by
My GeoNetwork catalogue