Wind-dependent Variables: Predict Wind Speeds of Tropical Storms Hosted By Radiant Earth Foundation

1 week left

Radiant MLHub

Radiant MLHub is an open repository for geospatial machine learning training data. Radiant MLHub hosts open training datasets generated by Radiant Earth Foundation's team as well as other training data catalogs contributed by Radiant Earth’s partners. All the data are hosted in a cloud-friendly format, and the API allows for easy discovery and download. Radiant MLHub is open to anyone to access existing training data and/or share their training data for broader impact.

You can use the Radiant MLHub API to easily access the data for this competition in your programming environment. Alternatively, you can download the data locally by navigating to the Data Download tab.

Data structure

The Radiant MLHub API is a STAC compliant API that serves metadata about label items and source imagery and links to download these items.

A SpatioTemporal Asset Catalog (STAC) is a standardized specification for organizing metadata, making it easy to search for images or labels that match spatial, temporal, or other criteria. At the root level of the STAC API is a list of collections of items. In the Radiant MLHub API, each collection contains items for either source imagery or labels for a dataset. These items are descriptions of source imagery or labels and links to download assets related to these items. Properties found in these item descriptions include spatial extent, temporal extent, band descriptions in the case of optical imagery, label types and label properties in the case of labels, and other information like Digital Object Identifiers (DOIs) and citation examples to reference.

To learn more about Radiant MLHub API, check out this blogpost on Accessing and Downloading Training Data on the Radiant MLHub API.

Authenticating with the API

To access the Radiant MLHub API, you must be authenticated with an API key. Requests made to the API must contain a query parameter where the key is “key” and the value is your API key. For example, a request made to the /collections endpoint would look similar to this:

You can obtain an API key by creating a free account on Radiant Earth Foundation's dashboard and navigating to the “API Keys” tab.

Accessing the data

There are three collections that contain data for this competition:

  • nasa_tropical_storm_competition_train_source contains the train images (jpegs) and metadata (jsons, one per image)
  • nasa_tropical_storm_competition_train_labels contains the train labels (jsons, one per image)
  • nasa_tropical_storm_competition_test_source contains the test images (jpegs) and metadata (jsons, one per image)

To download the tropical storm images, metadata, and labels, you should first crawl the nasa_tropical_storm_competition_train_labels collection and download the labels file located within the label item. Then, to download the associated image, loop through the links array and navigate to the source imagery items. Source imagery items will have the “rel” type of “source”. Once you navigate to a source imagery item you can find a link to the image (image) as well as a link to the metadata json (features) within the assets dictionary.

An example notebook which implements these steps can be found here.