TissueNet: Detect Lesions in Cervical Biopsies

The objective of this challenge is to detect the most severe epithelial lesions of the uterine cervix. The challenge brings together thousands of microscopic slides from different medical centers across France. #health

€25,000 in prizes
Completed oct 2020
540 joined

Submission format

Rather than submitting your predicted labels, you'll package everything needed to do inference and submit that for containerized execution. The runtime repository contains the complete specification for the runtime.

All submission for inference must run on Python 3.8. No other languages or versions of Python are supported.

What to submit

Your final submission should be a zip archive named with the extension .zip (for example, submission.zip). The root level of the submission.zip file must contain a main.py which performs inference on the test images and writes the predictions to a file named submission.csv in the same directory as main.py. You can see an example of this submission setup in the runtime repository.

Here's an example:

submission root directory
├── assets          # Example of configuration and weights for the trained model
│   ├── model.json
│   └── weights.h5
└── main.py         # Inference script

Note: be sure that when you unzip the submission main.py exists in the folder where you unzip. This file must be present at the root level of the zip archive. There should be no folder that contains it.

During code execution, your submission will be unzipped and run in our cloud compute cluster. The script will have access to the following directory structure:

submission root directory
├── data
│   ├── submission_format.csv
│   ├── test_metadata.csv
│   ├── ...
│   ├── <all of the test images as pyramidal tifs>
│   ├── ...
│   ├── JNOHPOSX.tif
│   └── VVGJOSTC.tif
├── main.py
└── <additional assets included in the submission archive>

Submission checklist

  • Submission includes main.py in the root directory of the zip. There can be extra files with more code that is called (see assets folder in the example).
  • Submission contains any model weights that need to be loaded. There will be no network access.
  • Script loads the data for inference from the data folder in the root directory. All images for inference are in the root level of the data folder. This folder is read-only.
  • Script writes submission.csv to the root directory when inference is finished. This file must match the submission format exactly.
  • Use the versions of submission_format.csv and test_metadata.csv from the data folder provided to the container. Do not include these files with your submission, and do not read them from other locations. We need to be able to run your submission on other images by replacing these metadata files. See the problem description for details on the test set metadata.

Testing your submission locally

If you'd like to replicate how your submission will run online, you can test the submission locally before submitting. This is a great way to work out any bugs locally and ensure that your model will run quickly enough.

Submission format

The format for the submission file with predictions is a CSV file with an index column called filename and a column for each possible class. Your predictions must be integers. A 1 in a column represents a positive prediction for that class. As each image only has one true class, all other columns for that row should be 0. Your submission.csv must match this format exactly.

For example, if your predictions for the first five rows look like this:

filename 0 1 2 3
VVGJOSTC.tif 1 0 0 0
JNOHPOSX.tif 0 1 0 0
D3DM9NWP.tif 0 0 1 0
3HC8TO0L.tif 1 0 0 0
3YLJ2UIK.tif 0 0 0 1

Your submission.csv file would look like:

filename,0,1,2,3
VVGJOSTC.tif,1,0,0,0
JNOHPOSX.tif,0,1,0,0
D3DM9NWP.tif,0,0,1,0
3HC8TO0L.tif,1,0,0,0
3YLJ2UIK.tif,0,0,0,1

Runtime

Your code is executed within a container that is defined in our runtime repository. The limits are as follows:

  • Your submission must be written in Python and run Python 3.8 using the packages defined in the runtime repository.
  • The submission must complete execution in 16 hours or less. We expect most submissions will complete much more quickly and computation time per participant will be monitored to prevent abuse.
  • The container runtime has access to a single GPU. All of your code should run within the GPU environments in the container, even if actual computation happens on the CPU. (CPU environments are provided within the container for local debugging only).
  • The container has access to 5 vCPUs powered by an Intel Xeon E5-2690 chip and 48GB RAM.
  • The container has 1 Tesla K80 GPU with 12GB of memory.
  • The container will not have network access. All necessary files (code and model assets) must be included in your submission.
  • The container execution will not have root access to the filesystem.

The GPUs for executing your inference code are a shared resource across competitors. We request you are conscientious in your use of them. Please add progress information to your logs and cancel jobs that will run longer than the time limit. Canceled jobs won't count against your submission limit, and this means more available resources to score submissions that will complete on time.

As discussed, one of the hardest challenges when working with WSI data is that the data is extremely large and the relevant areas are relatively small. One of your goals should be to make model inference execute as quickly and efficiently as possible. Progress here will be an enormous contribution to the use of ML in Pathology.

Requesting package installations

Since the docker container will not have network access, all packages must be pre-installed. We are happy to add packages as long as they do not conflict and can build successfully.

Packages must be available through conda for Python 3.8.

To request an additional package be added to the docker image, follow the instructions in the runtime repository.