The Democratization of Big Data

Already a major technology trend, 2012 promises to be a watershed for "big data." A shorthand term for the proliferation of large datasets, big data also refers to the expansion of analytic techniques for teasing meaning from the vast archives of information produced by the digital world. The New York Times' Steve Lohr declared we have entered the "age of big data" in a recent article that compared it with another revolutionary research tool -- the microscope.

4 minute read

February 27, 2012, 7:40 AM PST

By Robert Goodspeed @rgoodspeed


Already a major technology trend, 2012 promises to be a watershed for "big data." A shorthand term for the proliferation of large datasets, big data also refers to the expansion of analytic techniques for teasing meaning from the vast archives of information produced by the digital world. The New York Times' Steve Lohr declared we have entered the "age of big data" in a recent article that compared it with another revolutionary research tool -- the microscope.

As I observed last year, big data is beginning to filter into the urban planning world. Here are a few examples of the intersection of cities and big data (two from this PlaceMatters blog post):

What do all these exciting examples of big data have in common? If you have modest technical skills and work for a local government or community-based organization, you probably do not have access to the data and skills necessary to replicate the projects.

Inequalities of data access are not new in planning. Sixteen years ago David Sawicki and William Craig argued in a Journal of the American Planning Association article titled "The Democratization of Data" that the most important ingredients to expanded access to the first generation of data wasn't advances of computing power or analysis skills, but the rise of data intermediaries that worked with community groups in low-income communities to ensure they had access to quality data and skills. Whether nonprofits, local governments, or university-led projects, these intermediaries helped equalize access to data in the public sphere.

However, as the size of datasets has increased, so have the skills necessary to manage and analyze the data. No longer is mastery of a few desktop applications sufficient for analysis, since wrangling today's large datasets requires database servers and analysts skilled at statistical and algorithmic data mining techniques. Although government datasets may have been the original big data, many of the new datasets are provided by corporations, introducing a morass of ethical and practical challenges. Frequently collected at the individual level, negotiating access requires navigating privacy and security concerns. Even when companies provide public access, extracting and using their data requires programming skills to tap application programming interfaces (APIs) or manipulate unusual data formats.

Finally, lurking beneath the big data hype are problematic unstated assumptions about the nature of truth. In the 1980s, the so-called quantitative-qualitative debate raged across several social science fields among scholars arguing the merits of various research methods. Some researchers stressed the need to collect empirical evidence and rely solely on quantitative analysis for research. Others argued social science required qualitative analysis such as interviews and observation to understand society. Although the debate is different today, important differences of opinion remain.

We should be cautious about claims that big data will necessarily answer important or relevant research or policy questions. Are cell phone traces sufficient to intuit travel behavior, or are surveys or interviews required to understand how people make choices? Can postings to social networking websites provide as much insight as a windshield survey, or an in-depth interview of community residents? The big data hype also runs counter to important developments in social science that stress the role of experiments and counterfactual reasoning, instead of relying on ever-more-complicated statistical models to explain the world.

What are some practical steps that big data could take to expand access by community-based organizations? A start might be to provide data in formats and sizes (perhaps through summary versions) that they can be analyzed in common software packages, such as ArcMap, Excel, and Google Earth. Data providers should provide documentation about the source, variables, and assumptions used to collect and process the data. Existing data intermediaries should explore the new datasets, and strategically expand their expertise where it seems appropriate. Although the proliferation of broadband and Internet-connected smartphones has reduced the prominence of the "digital divide," we must take steps now to reduce the emergence of a new "data divide" between sophisticated analysts and communities seeking to plan for their futures.


Robert Goodspeed

Robert Goodspeed is an Assistant Professor of Urban Planning at the University of Michigan. He holds a PhD from the MIT Department of Urban Studies and Planning and previously worked for the Boston Metropolitan Area Planning Council. See his academic website for more on his teaching and research.

portrait of professional woman

I love the variety of courses, many practical, and all richly illustrated. They have inspired many ideas that I've applied in practice, and in my own teaching. Mary G., Urban Planner

I love the variety of courses, many practical, and all richly illustrated. They have inspired many ideas that I've applied in practice, and in my own teaching.

Mary G., Urban Planner

Cover CM Credits, Earn Certificates, Push Your Career Forward

Logo for Planetizen Federal Action Tracker with black and white image of U.S. Capitol with water ripple overlay.

Planetizen Federal Action Tracker

A weekly monitor of how Trump’s orders and actions are impacting planners and planning in America.

June 11, 2025 - Diana Ionescu

Metrorail train pulling into newly opened subterranean station in Washington, D.C. with crowd on platform taking photos.

Congressman Proposes Bill to Rename DC Metro “Trump Train”

The Make Autorail Great Again Act would withhold federal funding to the system until the Washington Metropolitan Area Transit Authority (WMATA), rebrands as the Washington Metropolitan Authority for Greater Access (WMAGA).

June 2, 2025 - The Hill

Large crowd on street in San Francisco, California during Oktoberfest festival.

The Simple Legislative Tool Transforming Vacant Downtowns

In California, Michigan and Georgia, an easy win is bringing dollars — and delight — back to city centers.

June 2, 2025 - Robbie Silver

Man in teal shirt opening door to white microtransit shuttle with cactus graphics and making inviting gesture toward the camera.

Albuquerque’s Microtransit: A Planner’s Answer to Food Access Gaps

New microtransit vans in Albuquerque aim to close food access gaps by linking low-income areas to grocery stores, cutting travel times by 30 percent and offering planners a scalable model for equity-focused transit.

June 13 - U.S. Department Of Transportation

Group of people at table set ouf with picnic food on street during a neighborhood block party.

This City Will Pay You to Meet Your Neighbors

A North Kansas City grant program offers up to $400 for residents to throw neighborhood block parties.

June 13 - The Kansas City Star

Crowd gathered with protest signs on April 5, 2025 on steps of Minnesota state capitol protesting Trump cuts to social security and other federal programs.

Commentary: Our Silence Will Not Protect Us

Keeping our heads down and our language inoffensive is not the right response to the times we’re in. Solidarity and courage is.

June 13 - Shelterforce Magazine