The National Academies: Advisers to the Nation on Science, Engineering, and Medicine
NATIONAL ACADEMY OF SCIENCES NATIONAL ACADEMY OF ENGINEERING INSTITUTE OF MEDICINE NATIONAL RESEARCH COUNCIL
Current Operating Status
CNSTAT HOMEPAGE

WHAT'S NEW

ABOUT CNSTAT

CNSTAT MEMBERS

CURRENT PROJECTS

COMPLETED PROJECTS

MEETINGS

PUBLICATIONS

STAFF AND CONTACT INFORMATION

OTHER SITES OF INTEREST

LOCAL SEARCH


Current Projects

Workshop on Confidentiality of and Access to Data Research Files


_____________________________________________________________________________

Background

The Committee on National Statistics (CNSTAT), in consultation with the Institute of Medicine, convened a two-day workshop on October 14-15, 1999 to identify ways to advance the often conflicting goals of exploiting the research potential of micro-data while maintaining the appearance and reality of preserving confidentiality. Special attention was given to longitudinal data that have been linked to administrative records, because such data have greater research potential but are also more vulnerable to disclosures. The workshop brought together data producers from federal agencies and research organizations; data users, including academic researchers; and experts in statistical disclosure limitation techniques, confidentiality policies and administrative and legal procedures. Norman Bradburn, National Opinion Research Center, chaired the workshop.

The purpose of this workshop was to review the benefits and risks of providing public use research data files and to explore alternative procedures for restricting access to such data, especially longitudinal survey data that have been linked to administrative records. The workshop discussions addressed the impact of measures designed to reduce disclosure risk on each group involved—survey respondents, data producers, and data users. Workshop participants identified best current practices and suggested improvements to maximize the social return on investments in such databases, while fully complying with legal and ethical requirements.

The workshop discussion accomplished several goals:

  • reviewed current practices and concerns of federal agencies and other data producing organizations;
  • reviewed the types of research that are enhanced, or only made possible, using linked longitudinal data;
  • provided an overview of administrative arrangements to preserve confidentiality;
  • identified ways to foster data accessibility in secondary analysis; and
  • assessed the utility of statistical methods for limiting disclosure risk.

The Committee prepared a report covering workshop proceedings, which will include presentation and discussion summaries, and suggestions for future research. This report will be published in September 2000.

Plans are underway for developing follow-on activities to this workshop that could include convening a panel that would oversee one or more workshops and would then write a report with recommendations. It seems likely that future work would shift the focus away from what agencies are currently doing--which, after the first workshop, is fairly clear--and toward looking in more detail at: (1) cutting edge statistical techniques for manipulating data in ways that preserve important statistical properties and allow for broader general data release; (2) new, less burdensome ways (e.g., internet, remote access etc.) of providing researchers with access to restricted data sets; (3) researcher reaction to these potential new directions; and (4) the issue of licensing coupled with graduated civil and criminal penalties for infringement. Future efforts would focus more carefully on longitudinal microdata.

This activity is sponsored by the National Institute on Aging, the Agency for Health Care Policy and Research, the Bureau of Labor Statistics, the National Library of Medicine, the Office of the Assistant Secretary for Planning and Evaluation of the U.S. Department of Health and Human Services, the Office of Planning and Evaluation of the U.S. Department of Health and Human Services, and the Social Security Administration.


_____________________________________________________________________________

Workshop Papers

Prior to the workshop, the papers that were prepared and presented at the workshop were available electronically from this website. However, so that the authors may incorporate revisions or publish their papers elsewhere, we have removed the papers. Copies of the versions of the workshop papers that were distributed at the workshop are still available in hard copy. Please call the Committee office, 202-334-3096, to request a copy of any or all of these papers.

  • How Policy Variables Influence the Timing of Social Security Disability Insurance Applications” Richard V. Burkhauser, Department of Policy Analysis and Management, Cornell University; J.S. Butler, Department of Policy Analysis and Management, Cornell University; Robert R. Weathers II, Center for Policy Research, Syracuse University. August 1999.
  • Confidential Data Files Linked to the National Longitudinal Survey of Youth, 1979 Cohort: A Case Study” Rachel A. Gordon, Department of Sociology and Institute of Government and Public Affairs, University of Illinois at Chicago.
  • Protecting Confidentiality of Linked Datasets: Don't Throw the Baby out with Bathwater". J. Michael Dean and Lenora Olson, University of Utah School of Medicine.
  • The Value of Linked Data in Aging Research”: Summary Paper from a Conference held May 19, 1999. Richard Woodbury, National Bureau of Economic Research; Alan Gustman, Dartmouth College and National Bureau of Economic Research; Lee Lillard, University of Michigan; Olivia Mitchell; University of Pennsylvania and National Bureau of Economic Research; Robert Willis, University of Michigan
  • Regulating Access to Research Data Files: Ethical Issues” Mary Ann Baily, Institute for Ethics, American Medical Association
  • Human Subject Protections in Research Utilizing Data Files” Jeffery Cohen and Thomas Puglisi, Office of Protection from Research Risks, National Institutes of Health, U.S. Department of Health and Human Services
  • A Survey of Statistical Disclosure Limitation (SDL) Practices of Organizations that Distribute Public Use Microdata” Thomas Jabine, Consultant, Committee on National Statistics; Alice Robbin, Florida State University; Heather Koball, Committee on National Statistics
  • Multiple Imputation in the Survey of Consumer Finances” Arthur B. Kennickell, Board of Governors of the Federal Reserve System
  • Risk of Statistical Confidentiality Disclosure: A Preliminary Comparison of Masked and Synthetic Data Release” Sallie Keller-McNulty, Los Alamos National Laboratory, and George Duncan, Carnegie Mellon University
  • Review of Data Licensing Agreements at U.S. Government Agencies and Research Organizations” Paul Massell, U.S. Census Bureau
  • National Center for Health Statistics Approaches to the Release of Microdata: Data Perturbation and the Research Data Center. John Horm, National Center for Health Statistics
  • National Center for Education Statistics: Data Licensing Systems
  • Marilyn McMillen, National Center for Education Statistics


_____________________________________________________________________________

Staff

  • Christopher Mackie, Study Director
  • Thomas B. Jabine, Consultant

_____________________________________________________________________________

Publications

RSS News Feed | Subscribe to e-newsletters | Feedback | Back to Top