Responding to natural disasters, such as earthquakes, floods, and wildfires, is a laborious task performed by on-the-ground emergency responders and analysts. Social media has emerged as a low-latency data source to quickly understand disaster situations. While most studies on social media are limited to text, images offer more information for understanding disaster and incident scenes. However, no large-scale image datasets for incident detection exists. In this work, we present the Incidents Dataset, which contains 446,684 images annotated by humans that cover 43 incidents across a variety of scenes. We employ a baseline classification model that mitigates false-positive errors and we perform image filtering experiments on millions of social media images from Flickr and Twitter. Through these experiments, we show how the Incidents Dataset can be used to detect images with incidents in the wild.
The large-scale Incidents Dataset consists of 446,684 scene-centric class-positive images (annotated by humans) related to natural disasters, types of damage or specific events that can require human attention or assistance, like traffic jams or car accidents. We use the term incidents to refer to the 43 categories covered by our dataset. 49 places are used to add diversity to the images. An additional set of 697,464 class-negative images are part of the dataset and used to train our final model to mitigate false-positive predictions. See the paper for more details.
Below we show some example images from the dataset:
We examine how our incident detection model, trained with class-negative loss, performs in different real-world scenarios using millions of images collected from two popular social media platforms: Twitter and Flickr.
Upload an image to see the incident and place prediction!