15.11.2012 Views

icegov2012 proceedings

icegov2012 proceedings

icegov2012 proceedings

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

5.1.4 Precedence Plots<br />

Based on computation of AA and SS for a particular district, the<br />

structure in the lower part of the subordination schematic can be<br />

used to prepare a ‘precedence plot’ for visualization (Figure 5).<br />

The precedence plot is a plot for AA (ascribed advantage) as Yaxis<br />

and SS (subordinate status) as the X-axis. The prominent<br />

position declines from top (upper-left) to toe (lower-right).<br />

Primary prominence varies vertically showing that there is a<br />

larger percentage of ascribed advantage (greater severity) with<br />

increasing height. Horizontal variation on a given level shows<br />

clarity of comparison. Farther to the right is greater clarity as<br />

more definite (less severe) with a larger percentage of subordinate<br />

status versus indefinite instances among the couplets where<br />

ascribed advantage (greater severity) is lacking. In other words,<br />

more indefinite instances constitute increased lack of clarity<br />

(incomparability in the usual parlance of partial ordering).<br />

Scheme 3 in [8] Function Facilities gives an R function named<br />

PrecPlot which accepts the output of the ProdOrdr function and<br />

produces a precedence plot.<br />

5.1.5 Representative Ranks<br />

Representative ranks show descriptive order statistics of indicator<br />

rankings for each district. Representative ranks are concerned<br />

with the rank distribution for any particular district. The rank<br />

numbers received by a given district across all criteria can be<br />

placed in a single array and sorted in ascending order. The<br />

minimum, median, and maximum ranks for the district are<br />

conveniently informative.<br />

5.2 Upper Level Set (ULS) Scan statistics<br />

By definition, a hotspot is a zone characterized by a high response<br />

rate, it makes intuitive sense that ULS method search for a hotspot<br />

begin with a cell with the highest response rate and annex to it<br />

other cells with high response rates. The need to bring a definite<br />

order for the search leads one to the concept of the ULS scan tree<br />

or just the ULS tree. Denoting for cell a the response rate ga =<br />

ya/na, a ∈ T over the connected region R, we define the ULS tree<br />

as follows:<br />

Let G = {ga | a ∈ T}<br />

Suppose r1 > r2 > … > rm are all distinct members of G.<br />

Define for i = 1, 2, …, m,<br />

Ti = { a ∈ T | ga = ri },<br />

and the upper level sets as unions of Ti’s,<br />

Ui = T1 ∪ T2 ∪ … ∪ Ti<br />

= { a ∈ T | ga ≥ ri }.<br />

Let Ci = set of connected components of Ui, i = 1, 2, …, m, and<br />

the reduced parameter space ΩULS = C1 ∪ … ∪ Cm.<br />

The ULS tree = (ΩULS) is a tree whose nodes are members<br />

of Ω ULS. The root of the ULS tree is Um = R. Let Z be a node<br />

belonging to the ULS tree. If all cells belonging to Z have the<br />

same response rate, then Z is a leaf node. Thus, members of C1 are<br />

leaf nodes. If not all cells in Z have the same rates then let e =<br />

min{ ga | a ∈ Z} and connected components of Z – {a ∈ Z: ga =<br />

e} are members of Z. [13]<br />

Since our search for a hotspot is based on cellular response rates,<br />

we assign a unique level to each node of the ULS tree in terms of<br />

its response rate and then scan the tree by level starting with nodes<br />

level 1 down. Members of C1 are level 1 nodes. The root is the<br />

level m node. In general, for 1 ≤ i ≤ m, the level of a zone Z is i if<br />

and only if min{ ga | a ∈ Z } = ri. Since each cell is introduced in<br />

229<br />

a unique node in the ULS tree, the ULS tree has at most N nodes.<br />

Hence |ΩULS| ≤ N, the equality holding if and only if m = N. [13]<br />

Several geometric properties should be satisfied by a collection of<br />

cells from tessellation of study area before it could be considered<br />

as a candidate for a hotspot cluster<br />

1) the union of the cells should comprise a geographically<br />

connected subset of the region R. Such collections of cells<br />

will be referred to as zones and the set of all zones is denoted<br />

by Ω. A zone Z∈Ω is a collection of cells that are connected.<br />

2) the zone should not be excessively large. This restriction is<br />

generally achieved by limiting the search for hotspot to zones<br />

that do not comprise more than fifty percent of the region.<br />

[14]<br />

6. RESULTS AND DISCUSSION<br />

6.1 Correlation Analysis and Descriptive<br />

Statistics<br />

Paired scatter plots make a simple way to see and get information<br />

about the strength of the relationship between two indicators.<br />

Scatter plots of pairs of infant health indicators are shown in<br />

Figure 2. If the observation points tend to be in a straight line,<br />

there is a correlation (relationship) between two variables<br />

(indicators). Table 2 shows the correlation coefficient between<br />

two indicators. Through this table it can be seen how strong the<br />

relationship between the two indicators is. According to Figure 2,<br />

there are tendencies of linear relationship between indicators infd<br />

and pov, infd and lbw, and pov and lbw. Table 2 shows that the<br />

correlation between infant death and poverty is 0.567. It can be<br />

said that there is a positive statistical association between those<br />

two indicators because it has p-value of 1.621e-10 (less than<br />

0.05). In statistics, if a parameter estimation (in this case is the<br />

coefficient correlation) has p-value of less than 0.05, it means the<br />

error probability of making conclusion is less than 5 percent. This<br />

is also true regarding infant death and low birth weight (r =<br />

0.4163; p-value = 7.425e-06); and for poverty and low birth<br />

weight (r = 0.399; p-value < 0.0001). There are positive<br />

relationships between infant death (infd) and absence of health<br />

personnel (abhp) and also between infd and average education<br />

shortfall (avedsf), even though both relationships are weaker. The<br />

relationship between infant death and births without health<br />

personnel (abhp) is stronger (r = 0.319; p-value = 0.0007531)<br />

than the relationship between infant death and average education<br />

shortfall (r = 0.290; p-value = 0.002375).<br />

Variability of the infant health indicators is shown by Figure 3a<br />

and 3b. Figure 3a shows boxplots of infm, pov (in thousands), and<br />

lbw. Thus, lbw is more variable than the other two indicators and<br />

contains outliers. Figure 3b shows boxplots of births without<br />

health personnel (%) and education shortfall.<br />

4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!