Download PDF - Ivie
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
3. Data Description and Empirical Specification<br />
Estimation is carried out on the US Census data for the year 2000, specifically its fivepercent<br />
sample “5% IPUMS data” (a 1-in-20 national random nationwide population), which<br />
provides -- in addition to standard samples of heterosexual individuals -- detailed demographic,<br />
labor and income information on the largest sample of gay and lesbian partners. Unmarried “heads”<br />
and “unmarried partners” and a random sample (twenty percent) of married “heads” and “spouses”<br />
were extracted from the Census using the variables “relationship to household head” and “marital<br />
status’. Records in these files were subsequently matched by means of the household identification<br />
code “serial” to create a single observation for each couple. Using the variable “sex”, couples with<br />
the head and the partner sharing the same gender were then identified as same-sex couples, gay and<br />
lesbian, and those with opposite gender as heterosexual couples. Individuals with imputed values<br />
for sex, marital status, and relationship to household head were excluded from our samples. This<br />
procedure is crucial to our purposes, to extract actual same-sex couples from the 2000 US Census.<br />
As documented in Black et al. (2006), Jepsen (2007), and in subsection 4.2 below, this method<br />
prevents heterosexual couples from being identified as homosexual, which could occur on account<br />
of a 2000 Census recoding error.<br />
In the Census, gays and lesbians are identified by their cohabiting relationship, a household<br />
being recorded as a same-sex union if the “relationship to head” is specified as “unmarried partner”,<br />
so that single gays or lesbians cannot be recovered. This limitation is of less concern to us since our<br />
analysis is exclusively of couples. However, most economic studies on homosexuals use Census<br />
data, from either 1990 or 2000. Others (e.g., Black et al., 2003; Blandford, 2003) use data from the<br />
General Social Survey (GSS), in which one can identify single gays and lesbians. Nevertheless, the<br />
GSS’s homosexual sample comprises far fewer observations (around three hundred total) thanthe<br />
Census data, and sexual orientation in the GSS is inferred from self-reported sexual activity,<br />
14