Big data has created complex new challenges to data privacy. One advantage of administrative big data is the enhanced feasibility of large scale record linkage. How can we make more data available to inform decision making without creating “Big Brother”? How can we inform this needed revolution in privacy protection without cutting back access to data?
In this webinar, Cavan Capps and Micah Altman will review their comprehensive analysis of an ACS use case that can be used to inform key decisions on how to protect data privacy while leveraging the latest data technologies. The results suggest that a multi-tiered access system to the data may be warranted in the future, potentially including traditional tabulations and regressions protected by Differential Privacy or variants of Secure Multi-party Computing (SMC) in software or in hardware using SGX, among other options. The webinar will discuss some of the strengths and weaknesses of the tools mentioned above and propose how such an infrastructure might be constructed.
Finally, the webinar will provide an update on our work continuing work to examine the practical use of SGX-SMC and software based SMC for data collection and integrating shared confidential data from different sources. This enables data sharing while maintaining individual privacy of individual during any analysis. Differential privacy will be used to ensure that any outputs remain confidential.
Micah Altman, Head Research Scientist, MIT Libraries
Cavan Capps, Big Data Lead, U.S. Census Bureau
Did you know that there are at least three sources of unemployment statistics in the United States? In this APDU webinar you’ll learn about the three primary data sources—Current Population Survey (CPS), Local Area Unemployment Statistics (LAUS), and American Community Survey (ACS)—and how they differ. Then we’ll explore how to access the official national and state unemployment statistics, based on CPS.
Garrett Schmitt, Senior Economist, Bureau of Labor Statistics
Mortality data are in the news on a daily basis. Accurate data is key to tracking the spread of COVID-19. However, there are important nuances that data users need to know:
- How are mortality data collected?
- When are data released?
- Where can you access the data?
- What are the differences between provisional and final mortality data?
Register for this APDU webinar today to learn more about mortality data from the CDC.
Robert N. Anderson, Ph.D., Chief, Mortality Statistics Branch, National Center for Health Statistics
The coronavirus pandemic is unprecedented in recent history. To meet this challenge, the U.S. Census Bureau, in collaboration with other federal statistical agencies, has developed two new experimental surveys: a Household Pulse Survey and a Small Business Pulse Survey. This webinar will provide an overview of the household survey, including the following:
- What type of information is collected in the household survey?
- What is the data collection method?
- What levels of geography are being reported?
- Where can someone find the data?
Jason Fields, Senior Researcher for Demographic Programs, U.S. Census Bureau
Have you ever looked for data from ACS or CPS, but published tables did not have exactly what you needed? Have you ever wished you could create quick crosstabs without writing code? The Census Bureau is developing a new microdata access feature on data.census.gov that allows users to create custom crosstabulations from ACS and CPS. Register for this webinar today to learn more about this feature and to submit feedback to the Census Bureau about how microdata access could be improved.
Tyson Weister, Survey Statistician, U.S. Census Bureau
In July 2019, the Census Bureau released the Business Formation Statistics (BFS), a new data product that tracks trends in business applications and formations at the state, regional and national levels.
The BFS consists of four business application series and eight business formation series. It’s unique because it relies on administrative data from the IRS, specifically the data on applications for an Employer Identification Number (EIN) via IRS Form SS-4, to determine the number of business applications submitted in a quarter, and how many result in businesses with employees.
The BFS also includes projections for business formations in the near future. The BFS began in 2012 as a research project in the Census Bureau’s Center for Economic Studies and was first released in beta form in February 2018. It’s the culmination of research efforts by the Census Bureau, Board of Governors of the Federal Reserve System, Federal Reserve Bank of Atlanta, University of Maryland and the University of Notre Dame. This webinar will provide an overview of BFS and demonstrate how to access BFS data available on the Census website.
Jason Jindrich, Survey Statistician, US Census Bureau
Jeff McHugh, Chief, New and Emerging Indicators Programs, US Census Bureau
Rebecca Hutchinson, Big Data Lead, Economic Indicators Division, US Census Bureau
APDU, C2ER, and LMI Institute Members: Free
The Census Bureau is introducing a new framework to protect individual data in the Decennial Census: “Differential Privacy”. This has implications for the reliability and availability of invaluable federal statistics – decreasing accuracy for small areas and small sub-population counts and reducing the scope of various data products in exchange for improved privacy protections.
This webinar from the Association of Public Data Users will provide a background on Disclosure Avoidance, details on the policy decisions leading to Differential Privacy and its subsequent implementation, and comparisons of recently released data comparing previously available 2010 Census data with data demonstrating the impact of Differential Privacy. Register today to learn more.
Beth Jarosz, Senior Research Associate, Population Reference Bureau
Kathryn Pettit, Principal Research Associate, Urban Institute
David Van Riper, Director of Spatial Analysis, IPUMS
APDU, C2ER, and LMI Institute Members: Free
Five years ago, staff at the Bureau of Labor Statistics had to read and manually code hundreds of thousands of written descriptions of work-related injuries and illnesses each year. Today, more than two thirds of these codes are now assigned by a deep neural network, which evaluations suggest is substantially more accurate on average than trained human workers.
In this webinar, Alex Measure will discuss how BLS addressed some of the many challenges inherent in this transition. Attendees will learn:
- how BLS built these new computer systems
- how they decided when and how to use them
- how to evaluate their performance
- how BLS monitors and maintain them to continually improve performance.
Alex Measure, Economist, Bureau of Labor Statistics
APDU, C2ER, and LMI Institute Members: Free
When the U.S. Commission on Evidence-Based Policymaking issued its unanimous recommendations to Congress in 2017, it called for the exploration of new approaches that promote data access and privacy preservation at the same time. This webinar discusses an application of one such technology – multi-party computation – in a real-world setting to assess the applicability of the approach in public agencies.
A demonstration project in Allegheny County, Pennsylvania applied privacy-preserving approaches to generate responses to policy-relevant questions about mental health services, homelessness services, and other public health policies. This demonstration project offers a compelling example of how the technologies can be deployed—which can advance consideration of the approach within agencies at all levels of government. Register today to learn how this new technology could impact the data you rely on.
Nick Hart, Ph.D., CEO, Data Coalition Fellow, Bipartisan Policy Center
In this demonstration, Treasury’s Bureau of the Fiscal Service will provide a general overview of the Data Lab and its guiding principles and values. The Data Transparency team will discuss the Data Lab’s relationship to USAspending.gov and our commitment to open data, human-centered design, and agile development. We will examine some Data Lab analyses, such as the recently launched Your Guide to America’s Finances, and review our methodology. Importantly, we will also explain how attendees, and the public at large, can leverage USAspending.gov’s open data to gain insights into public issues they care about. We welcome questions and feedback throughout the discussion.
The Data Lab is a platform designed to help generate public understanding of government spending through interactive data visualizations. It creates tools and visualizations that seek to answer interesting but complicated questions, and shows the power of the data available on USAspending.gov.
Justin Marsico, Product Manager, Research & Analytics for Data Transparency, Bureau of the Fiscal Service, U.S. Department of the Treasury