Social Sensing and Big Data Computing for Rapid Flood Mapping

Rapid flood mapping is crucial for emergency responders to gain better situation awareness during the event. However, traditional approaches normally require months of processing and quality assurance before the final flood extent and water depth are mapped and the losses and damages tallied. For example, the official flood-inundation maps for the 2015 floods in South Carolina were first released on February 22, 2016 by the United States Geological Survey (USGS), four months after the flooding event.

Using the 2015 South Carolina floods as the study case, we developed a novel approach to mapping the flood in near-real time by leveraging twitter data in geospatial processes. Specifically, we first analyzed the spatiotemporal patterns of flood-related tweets using quantitative methods to better understand how twitter activity is related to flood phenomena. Then a kernel-based flood mapping model was developed to map the flooding possibility for the study area based on the water height points derived from tweets and stream gauges.

Spatiotemporal Patterns of Flood-related Tweets

By analyzing the number of flood-related tweets and stream gauge height within the study area, we found that people tend to tweet more about floods when the flooding magnitude increases during the flooding event.

The figure below shows the temporal pattern analysis: (a) Number of flood-related tweets and daily maximum gauge height from the gauge station 02169500 during the flood period; (b) Cross-correlation analysis result between the two variables.

For the spatial pattern, we find that people closer to the flooding area tend to tweet more about the flood. The figure below illustrates the spatial pattern analysis: (a) Flood-related tweets and the inundated area within the study area; (b) Percentage of flood-related tweets with different distances to the inundated area.

Kernel-based Flood Mapping Model

The model takes the following data as inputs: water height points (WHPs), flood-related tweets, and DEM. Flood-related tweets are used to create a density surface, serving as a weighting factor based on the identified spatial patterns of twitter activity. This model consists of two steps: 1) generating a Flood Possibility Index (FPI) surface for each WHP using a kernel-based approach by considering the distance and elevation, and 2) generating the final FPI map based on all FPI surfaces.

Following figure shows the cell-by-cell comparison between our model output(FPI map) and the USGS inundation map with four categories: matched (flooded, both the FPI map and USGS map agree a cell was flooded), matched (not flooded, both maps agree a cell was not flooded), overestimated (the FPI map shows a cell was flooded while the USGS map does not), and underestimated (the UGGS map shows a cell was flooded while the FPI map does not). A majority of cells (83.4%, blue and light blue) between the two maps agree with each other, indicating that the proposed approach could provide a consistent and comparable estimation of the flood situation in near-real time, which is essential for improving the situational awareness during a flooding event to support decision making.


This model has been applied to the Hurricane Harvey Flooding in Huston, TX.  The map below shows the Harvey Social Vulnerability and Flooding Depths in the Harris County.

Kernel-based Flood Mapping Model by integrating social media and remote sensing imagery

Further studies have been carried out to improve this model by integrating incorporating post-event remote sensing images as another data source. Details of the improved model can be found at Huang X., Wang C., Li Z. (2018a, 2018b).


