15.11.2012 Views

icegov2012 proceedings

icegov2012 proceedings

icegov2012 proceedings

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2. OVERVIEW OF DISTRICT MAPPING<br />

The mapping of the Sample Electoral Districts in New York State<br />

creates a number of districts on coordinate plain by taking into<br />

account a number of user controllable weights on variables. As<br />

shown in Figure 1 various data are collated and mapped to create<br />

districts. The mapping algorithm requires that the data that is<br />

being used have the ability to be in a coordinate plain. The<br />

original data that was being used contained 13 million data points,<br />

in which the location was an Address. Dan Goldberg provides a<br />

service, geocoding, which is able to convert the data from<br />

addresses to latitude and longitude. We expounded on the data by<br />

later adding, racial make up, economic factors and assessed house<br />

values.<br />

The mapping uses a Recursive Bisection algorithm, which creates<br />

rectangles on a coordinate plain according to user-weighted<br />

variables. Recursive Bisection is a graph, in the demo the graph is<br />

a set of data points on a coordinate plain, portioning method in<br />

which equal points are assigned to either side of a bisector, in the<br />

demo we modify the algorithm to take into account minority and<br />

political affiliation. A total of two and half million plus data<br />

points are used to construct this mapping. Which is lower than the<br />

total amount of registered voters because of diminishing returns<br />

from using more data points. The Recursive Bisection Algorithm<br />

being used in the Mapping uses the address of registered New<br />

York State voters; the addresses are converted into a Latitude and<br />

Longitude format that is used in a coordinate system. Each<br />

coordinate has a certain amount of data connected to it. Including<br />

County data on racial makeup of the county, GDP Per Capita and<br />

unemployment the point itself stores the political affiliation and<br />

cost of the house, if available.<br />

Figure 3. An early user interface utilizing Google Maps<br />

The output is then shown in an interface like that in Figure 3.<br />

Data for a district is outputted when a user clicks on a district.<br />

Allowing for the user to explore what the algorithm has created.<br />

To evaluate the demo, according to the United States census there<br />

are roughly 19.5 million citizens of New York State. The Demo<br />

uses about 13 million data points. Which would mean the<br />

majority of all New York State citizens are taken into account<br />

even with migration. The Algorithm produces only rectangular<br />

shapes, which may fail to take into account political and natural<br />

barriers. However, because the algorithm group’s data points<br />

together these shapes usually avoid doing so. The rectangular<br />

shapes may be very oblong, but this is not necessarily good or<br />

521<br />

bad, for example, many of the runs of the algorithm produced a<br />

long, slender district in the north east portion of New York, which<br />

may seemingly be a obstruction to proper political management,<br />

until the geographic location of the northern portion of the<br />

Hudson and Champlain Valley’s are taken into account.<br />

3. DATA<br />

Economic, Voter Registration, and Minority data are accessible at<br />

varying granularities for the State of New York. The mapping of<br />

Electoral Districts utilizes voter registration data collected from<br />

the New York State Board of Elections. The address of registered<br />

voters in the state of New York were transformed to latitude and<br />

longitude points by Dan Goldberg 2 at the Spatial Sciences<br />

Institute, USC Dana and David Dornsife College of Letters arts<br />

and Sciences, University of Southern California. Economic data is<br />

also utilized in the mapping an index is created from the county<br />

unemployment and GDP Per Capita. The US census has the data<br />

for governments minority data, it also has political boundaries and<br />

shape files. This creates areas where the racial makeup of a point<br />

can be guessed. These averages can later be taken into account.<br />

4. CONCLUSIONS AND FUTURE WORK<br />

We demonstrated an open data approach to creating Electoral<br />

Districts by using Recursive Bisection in Mapping. Data points<br />

where the registered voters of New York. Economic and Racial<br />

Data added to scope of controllable variables in the Mapping<br />

A number of extensions to the mapping algorithm and the sources<br />

of data used are ongoing. The leveraging of semantic<br />

technologies, such as a owl ontology and storage of the voter<br />

registration data. Average personal and business income and<br />

religious affiliation would increase the base of data being drawn<br />

on. If the mapping toke into account current political boundaries it<br />

would be further improved, this could be done using census<br />

boundaries and shape files for towns and counties.<br />

5. ACKNOWLEDGEMENTS<br />

The authors would like to thank Jim Hendler, Tetherless World<br />

Constellation at Rensselaer Polytechnic Institute, Dan Goldberg 3 ,<br />

University of Southern California, and Francine Berman,<br />

Rensselaer Polytechnic Institute and Aaron Tobais, Rensselaer<br />

Polytechnic Institute.<br />

6. REFERENCES<br />

[1] Vickrey, William. On the Prevention of Gerrymandering.<br />

Poltical Science Quarterly, 76, 1 (Mar. 1961), 1<br />

2 https://webgis.usc.edu/<br />

3 Dan W. Goldberg PHD, Spatial Sciences Institute, University of<br />

Southern California, dwgoldbe@usc.edu

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!