Massive Data Institute

At the McCourt School of Public Policy

For much of my time at Georgetown, I worked at the Massive Data Institute at the McCourt School of Public Policy, on their Environmental Impact Data Collaborative Team. The EIDC works with researchers in government, academia, and industry to solve climate problems using big data to identify the most at risk communities who may be eligible for climate funding.

One of my major projects was building a tool to move data between geographic scales (i.e. congressional districts to counties). When dealing with digital shapefiles that come in bizarre and gerrymandered shapes, many nonprofits and academics need to subdivide or join data according to new shape boundaries. To do this, we developed a tool in Python that can intake data at one geographic scale, calculate the population density weighted partitions of one spatial unit, and output the data at a new geographic scale.

The original article outlinining this can be found here: MDI’s Spatial Toolkit

Use cases:

  1. Moving between counties and congressional districts when calculating Green Jobs in California
  2. Moving between water utility districts and census tracts to identify communities who could receive funding to address unsafe drinking water

Spatial Misalignment between CA Districts and Counties

Spatial Misalignment Between CA Districts and Counties