• My GeoNetwork catalogue
  •   Search
  •   Map
  •   Sign in

Clusters of atmopsheric and oceanic variables and teleconnections that are candidate drivers for Tropical Cyclogenesis

This project provides the dataset employed for the development of a machine learning framework designed to detect and interpret Tropical Cyclone Genesis (TCG) activity across six major tropical ocean basins: North Atlantic, Northeast Pacific, Northwest Pacific, North Indian, South Indian, and South Pacific.

The dataset includes pre-processed environmental and climatic variables relevant to TCG dynamics, aggregated at the basin level with monthly resolution from January 1980 to December 2022. All data are derived from the ERA5 reanalysis dataset, with a spatial resolution of 2.5° × 2.5°. ERA5 reanalysis data were accessed through the DKRZ data pool, made available by DKRZ Data Management. The atmospheric and oceanic variables provided are absolute vorticity at 850 hPa, maximum potential intensity (MPI), mean sea level pressure (MSLP), relative humidity at 700 hPa, sea surface temperature (SST), relative vorticity at 850 hPa, vertical wind shear between 850 and 200 hPa, and vertical velocity at 500 hPa. Several of these variables are derived from ERA5 primary variables and represent physically meaningful diagnostics used widely in tropical cyclone research. To reduce spatial dimensionality, each variable has been clustered within each basin using the K-means algorithm, and the area-weighted mean value of each cluster is reported as a time series.

Additionally, the dataset includes monthly values of a suite of large-scale climate indices known to influence tropical cyclone activity: Atlantic Meridional Mode (AMM), Niño3.4, North Atlantic Oscillation (NAO), Pacific Decadal Oscillation (PDO), Pacific-North American Pattern (PNA), Southern Oscillation Index (SOI), Tropical Northern Atlantic Index (TNA), Tropical Southern Atlantic Index (TSA), and the Western Pacific Index (WP).

Lastly, for each basin, the dataset contains monthly counts of tropical cyclogenesis events, enabling evaluation of predictive models and interpretability methods.

This dataset is intended to support research in seasonal TCG detection, and it enables reproducibility of the methods developed in the associated study.

Simple

Date (Publication)
2025-10-22
Edition

1

Citation identifier
CLINT_TC
Citation identifier
doi:10.26050/WDCC/CLINT_TC
Principal investigator
  Politecnico di Milano - Filippo Dainelli
not filled
Author
  Politecnico di Milano - Filippo Dainelli
not filled
Publisher
  WDCC
Point of contact
  Politecnico di Milano - Filippo Dainelli
not filled
Name

NetCDF

Keywords
  • machine learning

Use limitation

CC-BY-4.0: Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/

Language

eng; USA

N
S
E
W
thumbnail


Begin date
1980-01-01
End date
2022-12-31
Supplemental Information

info:eu-repo/grantAgreement/EC/H2020/101003876/BE//CLImate INTelligence: Extreme events detection, attribution and adaptation design using machine learning

Distribution format
  • NetCDF ()

Transfer size
28
OnLine resource
https://www.wdc-climate.de/ui/entry?acronym=CLINT_TC
Hierarchy level
collection
Attribute description
sea_surface_temperature
Descriptor

Sea surface temperature is usually abbreviated as "SST". It is the temperature of sea water near the surface (including the part under sea-ice, if any), and not the skin temperature, whose standard name is surface_temperature. For the temperature of sea water at a particular depth or layer, a data variable of sea_water_temperature with a vertical coordinate axis should be used. [CF-Standard Name]; unit: degC

Attribute description
air_pressure_at_mean_sea_level
Descriptor

Air pressure at sea level is the quantity often abbreviated as MSLP or PMSL. Air pressure is the force per unit area which would be exerted when the moving gas molecules of which the air is composed strike a theoretical surface of any orientation. "Mean sea level" means the time mean of sea surface elevation at a given location over an arbitrary period sufficient to eliminate the tidal signals. [CF-Standard Name]; unit: hPa

Attribute description
eastward_derivative_of_eastward_wind
Descriptor

not filled; unit: m/s

Attribute description
eastward_derivative_of_northward_wind
Descriptor

not filled; unit: m/s

Attribute description
relative_humidity
Descriptor

relative_humidity [CF-Standard Name]; unit: %

Attribute description
number_of_cyclogenesis_events_in_basin
Descriptor

not filled; unit: 1

Attribute description
northward_derivative_of_northward_wind
Descriptor

not filled; unit: m/s

Attribute description
atmosphere_upward_relative_vorticity
Descriptor

atmosphere_upward_relative_vorticity [CF-Standard Name]; unit: s-1

Attribute description
northward_wind_shear
Descriptor

not filled; unit: m/s

Attribute description
eastward_wind_shear
Descriptor

not filled; unit: m/s

Attribute description
wind_speed_shear
Descriptor

not filled; unit: m/s

Attribute description
northward_derivative_of_eastward_wind
Descriptor

not filled; unit: m/s

Attribute description
atmosphere_upward_absolute_vorticity
Descriptor

not filled; unit: s-1

Attribute description
model_level_number
Descriptor

[CF-Standard Name]; unit: hPa

Attribute description
downward_air_velocity
Descriptor

not filled; unit: m/s

File identifier
wdc-climate.de:5311644 XML
Metadata language

eng; USA

Hierarchy level
collection
Hierarchy level name

CLINT_TC

Date stamp
2025-10-02T13:09:18
Metadata standard name

ISO 19115

Metadata standard version

ISO 19115-2:2009

Point of contact
  Politecnico di Milano - Filippo Dainelli
not filled
 
 

Overviews

Spatial extent

N
S
E
W
thumbnail


Keywords


Provided by

logo
Access to the portal
Read here the full details and access to the data.

Associated resources

Not available


  •   About
  •   Github
  •