Competition: Differential Privacy Temporal Map Challenge: Sprint 1 (Prescreened Arena)

Navigation

Quick access

Differential Privacy Temporal Map Challenge

The goal of this challenge is to develop algorithms that preserve data utility as much as possible while guaranteeing individual privacy is protected. The challenge features a series of coding sprints to apply differential privacy methods to temporal map data, where each record is tied to a location and each individual may contribute to a sequence of events.

Why

Large data sets containing personally identifiable information (PII) are exceptionally valuable resources for research and policy analysis in a host of fields supporting America's First Responders such as emergency planning and epidemiology.

Temporal map data is of particular interest to the public safety community in applications such as optimizing response time and personnel placement, natural disaster response, epidemic tracking, demographic data and civic planning. Yet, the ability to track a person's location over a period of time presents particularly serious privacy concerns.

The Solution

Sprint 1 featured data on 911 calls in Baltimore made over the course of a year. Participants needed to build de-identification algorithms for generating privatized datasets that reported monthly incident counts for each type of incident by neighborhood.

The temporal sequence aspect of the problem is especially challenging because it allows one individual to contribute to many events (up to 20). This increases the sensitivity of the problem and the amount of added noise needed.

The Results

Many techniques from DP literature are not designed to handle high sensitivity. To overcome this, winning competitors tried different creative approaches:

Subsampling: Only use a maximum of k records from each person, to reduce sensitivity to k.
Preprocessing: Subsampling (only use a maximum of k records from each person, to reduce sensitivity to k), and reducing the data space by eliminating infrequent codes.
Post-processing: Clean up noisy data by applying various optimizing, smoothing and denoising strategies (several clever approaches were used, see solution descriptions in the post below).

These solutions were evaluated using a "Pie Chart Evaluation Metric", designed to measure how faithfully each privatization algorithm preserves the most significant patterns in the data within each map/time segment. The first place winner combined several techniques and tailored their algorithm to the level of privacy required to ultimately achieve the greatest utility score from the privatized data.

RESULTS ANNOUNCEMENT + MEET THE WINNERS

CHALLENGE REPOSITORY

NEXT IN THE SERIES: SPRINT 2

Preregistration	August 24, 2020
Open to submissions	October 1, 2020 - January 5, 2021
NIST PSCR Compliance check (for public voting)	January 5-6, 2021
Public voting	January 7-21, 2021
Judging and Evaluation	January 5 - February 2, 2021
Winners Announced	February 4, 2021

Preregistration	August 24, 2020
Sprint #1 - Participation	October 1 - November 15, 2020
Sprint #1 - Evaluation	November 15 - December 11, 2020
Sprint #1 - Winners announced	January 5, 2021
Sprint #2 - Participation	January 6 - February 22, 2021
Sprint #2 - Evaluation	February 22 - March 22, 2021
Sprint #2 - Winners announced	March 23, 2021
Sprint #3 - Participation	March 29 - May 17, 2021
Sprint #3 - Evaluation	May 17 - June 15, 2021
Sprint #3 - Winners announced	June 16, 2021

Open Source Deposit - Submissions due	June 30, 2021
Development Plan - Submissions due	June 30, 2021
Development Plan - Evaluation	July 1-10, 2021
Development Plan - Winners announced	July 14, 2021
Development Execution - Submissions due	October 9, 2021
Development Execution - Evaluation	October 9 - 23, 2021
Development Execution - Winners announced	October 27, 2021

Differential Privacy Temporal Map Challenge: Sprint 1 (Prescreened Arena)

Quick Facts

Participants

No. of Entries

Prize

Winner

Minutemen

Navigation

Quick access

Why

The Solution

The Results

On this page