Michael Hay - Cornell University
Michael Hay - Cornell University
Michael Hay - Cornell University
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Michael</strong> <strong>Hay</strong><br />
Contact<br />
Education<br />
Research Interests<br />
4130 Upson Hall mhay@cs.cornell.edu<br />
<strong>Cornell</strong> <strong>University</strong> (413) 835-5618<br />
Ithaca, NY 14853<br />
<strong>University</strong> of Massachusetts Amherst<br />
Ph.D., Computer Science, 2010<br />
Dissertation: Enabling Accurate Analysis of Private Network Data<br />
Committee: Gerome Miklau, David Jensen, Don Towsley, Andrew Papachristos<br />
<strong>University</strong> of Massachusetts Amherst<br />
M.S., Computer Science, 2007<br />
GPA: 4.0 / 4.0<br />
Dartmouth College<br />
A.B. cum laude, Computer Science, 1998<br />
GPA: 3.6 / 4.0<br />
Data management and data mining, with a focus on privacy. Other interests: machine learning,<br />
social network analysis, security.<br />
Research Experience<br />
Computing Innovation Fellow, <strong>Cornell</strong> <strong>University</strong> 2010 - Present<br />
Mentor: Johannes Gehrke.<br />
Researching privacy-preserving data publishing. Developed a novel technique for generating<br />
a synopsis of a sensitive dataset that is both accurate and safe for public release. Currently<br />
investigating alternative definitions of privacy that are robust but not as conservative as<br />
differential privacy, thereby offering the promise of greater utility and broader application.<br />
Research Assistant, <strong>University</strong> of Massachusetts Amherst 2002 - 2010<br />
Co-advisors: David Jensen and Gerome Miklau.<br />
Thesis work: Investigated privacy implications of publishing anonymized network data<br />
and developed techniques for safe publication. Other work: Designed machine learning<br />
algorithms for various tasks—extraction, deduplication, classification—in network data.<br />
Teaching Experience<br />
Instructor, <strong>Cornell</strong> <strong>University</strong> Spring 2012<br />
CS 1330 Privacy in Bits: How Digital Technology is Reshaping Privacy<br />
A new course that I proposed and designed. Offered as a first-year writing seminar coordinated<br />
by <strong>Cornell</strong>’s Institute for Writing in the Disciplines. Through student writing and<br />
research, course investigates digital technologies and their apparent conflict with privacy.<br />
Instructor, <strong>Cornell</strong> <strong>University</strong> Summer 2011<br />
CS 2110 Object-Oriented Programming and Data Structures<br />
Responsible for all aspects of this “CS2” course of 43 students. Supervised two teaching<br />
assistants and three undergraduate graders. Used a combination of lecture and in-class<br />
exercises—both brief “think-pair-share” activities and worksheet-based group exercises.<br />
Instructor, <strong>University</strong> of Massachusetts Amherst Summer 2006<br />
CS 121 Introduction to Problem Solving with Computers<br />
Responsible for all aspects of this “CS1” course. Used a hybrid of lecture and laboratory.
Teaching Assistant, <strong>University</strong> of Massachusetts Amherst Spring 2006<br />
CS 121 Introduction to Problem Solving with Computers<br />
Responsible for teaching weekly section (roughly 40 students) and holding office hours in<br />
computer laboratory. Assistant to Professor Robert Moll.<br />
Guest Lecturer, <strong>Cornell</strong> <strong>University</strong><br />
CIS 3000 Introduction to Computer Game Design, Fall 2010, Fall 2011<br />
Guest Lecturer, <strong>University</strong> of Massachusetts Amherst<br />
CS 745 Advanced Database Systems, Fall 2008<br />
CS 121 Introduction to Problem Solving with Computers, Fall 2006<br />
CS 383 Artificial Intelligence, Fall 2006<br />
CS 591Y Knowledge Discovery and Data Mining, Spring 2005<br />
Ski Instructor, Various resorts 1992-1996<br />
Taught all ages and skill levels.<br />
Teaching Competencies<br />
Any lower-level courses (CS0, CS1, CS2)<br />
Upper-level courses: algorithms, artificial intelligence, databases<br />
Advanced topics: machine learning, data mining, social network analysis, privacy<br />
Math courses: discrete mathematics, probability, statistics<br />
Writing seminars<br />
Professional Experience<br />
Summer Intern, Adverplex, Cambridge, MA Summer 2008<br />
Applied machine learning algorithms to identify relevant keywords for search engine marketing<br />
campaigns.<br />
Program Manager, Second Nature, Boston, MA 2000 - 2002<br />
Created online portal to connect academic institutions with resources on environmental<br />
sustainability. Managed IT infrastructure.<br />
Volunteer Intern, The Nature Conservancy, Boulder, CO 1999 - 2000<br />
Volunteered part-time to research tracts of land for potential acquisition.<br />
Software Engineer, Kenan Systems, Denver, CO 1998 - 2000<br />
Developed software for customer care and data migration. Consulted clients on software<br />
configuration and live production.<br />
Trip Leader, Putney Student Travel, Arusha, Tanzania Summer 1998<br />
Led a five week community service trip for high school students.<br />
Awards Postdoctoral Fellowship, NSF/CRA Computing Innovation Fellow, 2010-2012<br />
Dissertation Award, ACM Special Interest Group on Knowledge Discovery & Data Mining, 2011<br />
Dissertation Award, U. of Massachusetts Amherst, CS Department, sponsored by Yahoo, 2011<br />
Best Student Paper Award, IEEE International Conference on Data Mining, 2009<br />
KDD Cup Winner, ACM International Conference on Knowledge Discovery & Data Mining, 2003<br />
Citation for Meritorious Scholarship, Dartmouth College, 1996<br />
<strong>Michael</strong> <strong>Hay</strong> Curriculum Vitae (2 of 4)
Publications<br />
Journal Articles<br />
<strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, David Jensen, Don Towsley, and Chao Li. Resisting structural<br />
re-identification in anonymized social networks. International Journal on Very Large Data Bases<br />
(VLDB Journal), 2010.<br />
Conference Papers<br />
Xiaokui Xiao, Gabriel Bender, <strong>Michael</strong> <strong>Hay</strong>, and Johannes Gehrke. iReduct: Differential privacy<br />
with reduced relative errors. In ACM International Conference on the Management of Data<br />
(SIGMOD), 2011.<br />
<strong>Michael</strong> <strong>Hay</strong>, Vibhor Rastogi, Gerome Miklau, and Dan Suciu. Boosting the accuracy of differentially<br />
private histograms through consistency. In Proceedings of the VLDB Endowment<br />
(PVLDB), 2010.<br />
Chao Li, <strong>Michael</strong> <strong>Hay</strong>, Vibhor Rastogi, Gerome Miklau, and Andrew McGregor. Optimizing<br />
linear counting queries under differential privacy. In ACM Symposium on Principles of Database<br />
Systems (PODS), 2010.<br />
<strong>Michael</strong> <strong>Hay</strong>, Chao Li, Gerome Miklau, and David Jensen. Accurate estimation of the degree<br />
distribution of private networks. In IEEE International Conference on Data Mining (ICDM),<br />
2009.<br />
Vibhor Rastogi, <strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, and Dan Suciu. Relationship privacy: output<br />
perturbation for queries with joins. In ACM Symposium on Principles of Database Systems<br />
(PODS), 2009.<br />
<strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, David Jensen, Donald F. Towsley, and Philipp Weis. Resisting<br />
structural re-identification in anonymized social networks. In Proceedings of the VLDB Endowment<br />
(PVLDB), 2008.<br />
Ben Wellner, Andrew McCallum, Fuchun Peng, and <strong>Michael</strong> <strong>Hay</strong>. An integrated, conditional<br />
model of information extraction and coreference for citation matching. In Conference on Uncertainty<br />
in Artificial Intelligence (UAI), 2004.<br />
Jennifer Neville, David Jensen, Lisa Friedland, and <strong>Michael</strong> <strong>Hay</strong>. Learning relational probability<br />
trees. In ACM International Conference on Knowledge Discovery and Data Mining (KDD),<br />
2003.<br />
David Jensen, Jennifer Neville, and <strong>Michael</strong> <strong>Hay</strong>. Avoiding bias when aggregating relational<br />
data with degree disparity. In International Conference on Machine Learning (ICML), 2003.<br />
Other Scholarship<br />
<strong>Michael</strong> <strong>Hay</strong>, Kun Liu, Gerome Miklau, Jian Pei, and Evimaria Terzi. Privacy-aware Data Management<br />
in Information Networks (Tutorial). In ACM International Conference on the Management<br />
of Data (SIGMOD), 2011.<br />
<strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, and David Jensen. Privacy-Aware Knowledge Discovery: Novel<br />
Applications and New Techniques, chapter Enabling accurate analysis of private network data.<br />
CRC Press, 2010.<br />
Amy McGovern, Lisa Friedland, <strong>Michael</strong> <strong>Hay</strong>, Brian Gallagher, Andy Fast, Jen Neville, and<br />
David Jensen. Exploiting relational structure to understand publication patterns in high-energy<br />
physics. ACM SIGKDD Explorations, 2003.<br />
Service Program Committees<br />
World Wide Web Conference (WWW), 2012<br />
<strong>Michael</strong> <strong>Hay</strong> Curriculum Vitae (3 of 4)
ACM International Conference on the Management of Data (SIGMOD), 2012<br />
IEEE International Conference on Data Engineering (ICDE), 2011<br />
International Conference on Machine Learning (ICML), 2011<br />
Privacy Aspects of Data Mining Workshop at the IEEE International Conference on Data<br />
Mining (ICDM), 2011<br />
Journal reviewing<br />
ACM Transactions on Database Systems (TODS), 2010<br />
Data Mining and Knowledge Discovery (DMKD), 2010<br />
Transactions on Data Privacy, 2010<br />
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2008<br />
Proposal reviewing<br />
National Science Foundation, Information & Intelligent Systems Division, 2011<br />
Professional Memberships<br />
Association for Computing Machinery (ACM)<br />
ACM Special Interest Groups:<br />
Computer Science Education (SIGCSE)<br />
Knowledge Discovery and Data Mining (SIGKDD)<br />
Management of Data (SIGMOD)<br />
<strong>Michael</strong> <strong>Hay</strong> Curriculum Vitae (4 of 4)