17.01.2013 Views

Michael Hay - Cornell University

Michael Hay - Cornell University

Michael Hay - Cornell University

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Michael</strong> <strong>Hay</strong><br />

Contact<br />

Education<br />

Research Interests<br />

4130 Upson Hall mhay@cs.cornell.edu<br />

<strong>Cornell</strong> <strong>University</strong> (413) 835-5618<br />

Ithaca, NY 14853<br />

<strong>University</strong> of Massachusetts Amherst<br />

Ph.D., Computer Science, 2010<br />

Dissertation: Enabling Accurate Analysis of Private Network Data<br />

Committee: Gerome Miklau, David Jensen, Don Towsley, Andrew Papachristos<br />

<strong>University</strong> of Massachusetts Amherst<br />

M.S., Computer Science, 2007<br />

GPA: 4.0 / 4.0<br />

Dartmouth College<br />

A.B. cum laude, Computer Science, 1998<br />

GPA: 3.6 / 4.0<br />

Data management and data mining, with a focus on privacy. Other interests: machine learning,<br />

social network analysis, security.<br />

Research Experience<br />

Computing Innovation Fellow, <strong>Cornell</strong> <strong>University</strong> 2010 - Present<br />

Mentor: Johannes Gehrke.<br />

Researching privacy-preserving data publishing. Developed a novel technique for generating<br />

a synopsis of a sensitive dataset that is both accurate and safe for public release. Currently<br />

investigating alternative definitions of privacy that are robust but not as conservative as<br />

differential privacy, thereby offering the promise of greater utility and broader application.<br />

Research Assistant, <strong>University</strong> of Massachusetts Amherst 2002 - 2010<br />

Co-advisors: David Jensen and Gerome Miklau.<br />

Thesis work: Investigated privacy implications of publishing anonymized network data<br />

and developed techniques for safe publication. Other work: Designed machine learning<br />

algorithms for various tasks—extraction, deduplication, classification—in network data.<br />

Teaching Experience<br />

Instructor, <strong>Cornell</strong> <strong>University</strong> Spring 2012<br />

CS 1330 Privacy in Bits: How Digital Technology is Reshaping Privacy<br />

A new course that I proposed and designed. Offered as a first-year writing seminar coordinated<br />

by <strong>Cornell</strong>’s Institute for Writing in the Disciplines. Through student writing and<br />

research, course investigates digital technologies and their apparent conflict with privacy.<br />

Instructor, <strong>Cornell</strong> <strong>University</strong> Summer 2011<br />

CS 2110 Object-Oriented Programming and Data Structures<br />

Responsible for all aspects of this “CS2” course of 43 students. Supervised two teaching<br />

assistants and three undergraduate graders. Used a combination of lecture and in-class<br />

exercises—both brief “think-pair-share” activities and worksheet-based group exercises.<br />

Instructor, <strong>University</strong> of Massachusetts Amherst Summer 2006<br />

CS 121 Introduction to Problem Solving with Computers<br />

Responsible for all aspects of this “CS1” course. Used a hybrid of lecture and laboratory.


Teaching Assistant, <strong>University</strong> of Massachusetts Amherst Spring 2006<br />

CS 121 Introduction to Problem Solving with Computers<br />

Responsible for teaching weekly section (roughly 40 students) and holding office hours in<br />

computer laboratory. Assistant to Professor Robert Moll.<br />

Guest Lecturer, <strong>Cornell</strong> <strong>University</strong><br />

CIS 3000 Introduction to Computer Game Design, Fall 2010, Fall 2011<br />

Guest Lecturer, <strong>University</strong> of Massachusetts Amherst<br />

CS 745 Advanced Database Systems, Fall 2008<br />

CS 121 Introduction to Problem Solving with Computers, Fall 2006<br />

CS 383 Artificial Intelligence, Fall 2006<br />

CS 591Y Knowledge Discovery and Data Mining, Spring 2005<br />

Ski Instructor, Various resorts 1992-1996<br />

Taught all ages and skill levels.<br />

Teaching Competencies<br />

Any lower-level courses (CS0, CS1, CS2)<br />

Upper-level courses: algorithms, artificial intelligence, databases<br />

Advanced topics: machine learning, data mining, social network analysis, privacy<br />

Math courses: discrete mathematics, probability, statistics<br />

Writing seminars<br />

Professional Experience<br />

Summer Intern, Adverplex, Cambridge, MA Summer 2008<br />

Applied machine learning algorithms to identify relevant keywords for search engine marketing<br />

campaigns.<br />

Program Manager, Second Nature, Boston, MA 2000 - 2002<br />

Created online portal to connect academic institutions with resources on environmental<br />

sustainability. Managed IT infrastructure.<br />

Volunteer Intern, The Nature Conservancy, Boulder, CO 1999 - 2000<br />

Volunteered part-time to research tracts of land for potential acquisition.<br />

Software Engineer, Kenan Systems, Denver, CO 1998 - 2000<br />

Developed software for customer care and data migration. Consulted clients on software<br />

configuration and live production.<br />

Trip Leader, Putney Student Travel, Arusha, Tanzania Summer 1998<br />

Led a five week community service trip for high school students.<br />

Awards Postdoctoral Fellowship, NSF/CRA Computing Innovation Fellow, 2010-2012<br />

Dissertation Award, ACM Special Interest Group on Knowledge Discovery & Data Mining, 2011<br />

Dissertation Award, U. of Massachusetts Amherst, CS Department, sponsored by Yahoo, 2011<br />

Best Student Paper Award, IEEE International Conference on Data Mining, 2009<br />

KDD Cup Winner, ACM International Conference on Knowledge Discovery & Data Mining, 2003<br />

Citation for Meritorious Scholarship, Dartmouth College, 1996<br />

<strong>Michael</strong> <strong>Hay</strong> Curriculum Vitae (2 of 4)


Publications<br />

Journal Articles<br />

<strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, David Jensen, Don Towsley, and Chao Li. Resisting structural<br />

re-identification in anonymized social networks. International Journal on Very Large Data Bases<br />

(VLDB Journal), 2010.<br />

Conference Papers<br />

Xiaokui Xiao, Gabriel Bender, <strong>Michael</strong> <strong>Hay</strong>, and Johannes Gehrke. iReduct: Differential privacy<br />

with reduced relative errors. In ACM International Conference on the Management of Data<br />

(SIGMOD), 2011.<br />

<strong>Michael</strong> <strong>Hay</strong>, Vibhor Rastogi, Gerome Miklau, and Dan Suciu. Boosting the accuracy of differentially<br />

private histograms through consistency. In Proceedings of the VLDB Endowment<br />

(PVLDB), 2010.<br />

Chao Li, <strong>Michael</strong> <strong>Hay</strong>, Vibhor Rastogi, Gerome Miklau, and Andrew McGregor. Optimizing<br />

linear counting queries under differential privacy. In ACM Symposium on Principles of Database<br />

Systems (PODS), 2010.<br />

<strong>Michael</strong> <strong>Hay</strong>, Chao Li, Gerome Miklau, and David Jensen. Accurate estimation of the degree<br />

distribution of private networks. In IEEE International Conference on Data Mining (ICDM),<br />

2009.<br />

Vibhor Rastogi, <strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, and Dan Suciu. Relationship privacy: output<br />

perturbation for queries with joins. In ACM Symposium on Principles of Database Systems<br />

(PODS), 2009.<br />

<strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, David Jensen, Donald F. Towsley, and Philipp Weis. Resisting<br />

structural re-identification in anonymized social networks. In Proceedings of the VLDB Endowment<br />

(PVLDB), 2008.<br />

Ben Wellner, Andrew McCallum, Fuchun Peng, and <strong>Michael</strong> <strong>Hay</strong>. An integrated, conditional<br />

model of information extraction and coreference for citation matching. In Conference on Uncertainty<br />

in Artificial Intelligence (UAI), 2004.<br />

Jennifer Neville, David Jensen, Lisa Friedland, and <strong>Michael</strong> <strong>Hay</strong>. Learning relational probability<br />

trees. In ACM International Conference on Knowledge Discovery and Data Mining (KDD),<br />

2003.<br />

David Jensen, Jennifer Neville, and <strong>Michael</strong> <strong>Hay</strong>. Avoiding bias when aggregating relational<br />

data with degree disparity. In International Conference on Machine Learning (ICML), 2003.<br />

Other Scholarship<br />

<strong>Michael</strong> <strong>Hay</strong>, Kun Liu, Gerome Miklau, Jian Pei, and Evimaria Terzi. Privacy-aware Data Management<br />

in Information Networks (Tutorial). In ACM International Conference on the Management<br />

of Data (SIGMOD), 2011.<br />

<strong>Michael</strong> <strong>Hay</strong>, Gerome Miklau, and David Jensen. Privacy-Aware Knowledge Discovery: Novel<br />

Applications and New Techniques, chapter Enabling accurate analysis of private network data.<br />

CRC Press, 2010.<br />

Amy McGovern, Lisa Friedland, <strong>Michael</strong> <strong>Hay</strong>, Brian Gallagher, Andy Fast, Jen Neville, and<br />

David Jensen. Exploiting relational structure to understand publication patterns in high-energy<br />

physics. ACM SIGKDD Explorations, 2003.<br />

Service Program Committees<br />

World Wide Web Conference (WWW), 2012<br />

<strong>Michael</strong> <strong>Hay</strong> Curriculum Vitae (3 of 4)


ACM International Conference on the Management of Data (SIGMOD), 2012<br />

IEEE International Conference on Data Engineering (ICDE), 2011<br />

International Conference on Machine Learning (ICML), 2011<br />

Privacy Aspects of Data Mining Workshop at the IEEE International Conference on Data<br />

Mining (ICDM), 2011<br />

Journal reviewing<br />

ACM Transactions on Database Systems (TODS), 2010<br />

Data Mining and Knowledge Discovery (DMKD), 2010<br />

Transactions on Data Privacy, 2010<br />

IEEE Transactions on Knowledge and Data Engineering (TKDE), 2008<br />

Proposal reviewing<br />

National Science Foundation, Information & Intelligent Systems Division, 2011<br />

Professional Memberships<br />

Association for Computing Machinery (ACM)<br />

ACM Special Interest Groups:<br />

Computer Science Education (SIGCSE)<br />

Knowledge Discovery and Data Mining (SIGKDD)<br />

Management of Data (SIGMOD)<br />

<strong>Michael</strong> <strong>Hay</strong> Curriculum Vitae (4 of 4)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!