Statistics and Data Science Subject Guide
- Get Started
- Find Data
- Evaluate Data
- Find Articles
- Find Books
- Cite Sources with Zotero
- Other Library Resources
Librarian

1510 St. Olaf Avenue
St. Olaf College
Northfield, MN 55057
Finding Datasets: Questions to Consider
When looking for data, begin by considering some key questions. This will help you strategize regarding where to search and make your research process more efficient.
- What is your research topic or question?
- What are the characteristics of the data you need?
- Unit of analysis: individuals, households, companies, players, teams, countries, states, nations, etc.
- Geography: parcels in a city, countries in a region, birds in a forest. etc.
- Time period: e.g. 1980-2006
- Frequency: annual, quarterly, etc.
- Who is likely to collect data on this topic? Consider:
- Specific researchers
- Government agencies
- NGOs/IGOs
- Think tanks and research organizations
- Where are these data likely to be indexed or published?
- Compendia, portals, and indexes: When data are likely to be compiled or reported, these tools allow you to search by topic and discover data and data producers. Examples include: Data Planet, Social Explorer, Data.gov
- When data are likely to be shared by the researchers who produced them, they are likely to deposit the data in repositories. Examples include: ICPSR, Dryad, Figshare
- What existing publications might use the data you need? Finding books, articles, or other research publications addressing your topic of interest can help you look "backwards" to find data. After all, researchers need to cite the datasets they used... and you might be able to use those datasets, too!
"Finding Datasets: Questions to Consider" is adapted from the "Data Reference Worksheet" created by Kristin Partlo and Danya Leebaw, used under CC-BY-SA 4.0. "Finding Datasets: Questions to Consider" is licensed under CC-BY-SA 4.0 by Audrey Gunn.
Places to Find Data and Datasets
This is just a small sample of the many places you can search for data and datasets. If you're not finding what you need, consider exploring our complete list of St. Olaf's paid databases for finding data/statistics. Another option is to search for relevant articles using one of our research databases, then look at the datasets cited in those articles.
-
Data Citation Index This link opens in a new windowData Citation Index fully indexes a significant number of the world's leading data repositories of critical interest to the scientific community, including over two million data studies and datasets.
-
Data.govFind datasets produced by U.S. government agencies. Includes a vast array of federal data, as well as data from some state and local governments.
-
Sage Data This link opens in a new windowFormerly Data-Planet, Sage Data has data sets in many subject areas, including crime and law enforcement, federal assistance, military deployment and deaths, economics, health, and demographics. Includes Data Planet Statistical Datasets, Data-Planet EASI Market Planner, and Data-Planet China Data Center.
-
DryadDryad is a data repository used by researchers to share the data they collect. Its strengths are in the natural sciences, but it accepts data from any and all fields of research.
-
FigshareFigshare is a repository used by researchers to share data, as well as other research materials like images or code.
-
Google Dataset SearchLike regular Google, but for finding datasets. A word of caution: many datasets online are hidden behind paywalls, and data in some fields (like business) can be extremely expensive to access. After running your search, you can use a filter labeled "Free," located below the search bar, to limit your results to free (non-paywalled) data.
-
Harvard Dataverse"The Harvard Dataverse Repository is a free data repository open to all researchers from any discipline, both inside and outside of the Harvard community, where you can share, archive, cite, access, and explore research data."
-
Inter-university Consortium for Political and Social Research (ICPSR) This link opens in a new windowICPSR maintains a data archive of more than 500,000 files of research in the social sciences. It hosts 16 specialized collections of data in education, aging, criminal justice, substance abuse, terrorism, and other fields.
-
Social Explorer This link opens in a new windowProvides quick and easy access to current and historical census data and demographic information. The web interface lets users create maps and reports to better illustrate, analyze and understand demography and social change.
Finding Government Data
One of the best tricks for finding government data is searching Google by domain suffix. Here's how it works:
Let's say you want to find data from the Minnesota state government about wolves. You know that websites run by the state of Minnesota typically end in the domain suffix .mn.us. Therefore, your Google search will look like this:
site:.mn.us wolf data
By typing "site:" and then the name of the domain (or domain suffix) you want to search, you will only find results from the site(s) you specified. This works for all kinds of sites: site:.gov will give you results from websites ending in .gov (often U.S. federal government websites, but includes other levels of U.S. government, too), while site:northfieldmn.gov will give you results from the City of Northfield's website.
This trick is especially useful when finding government data for two reasons. First, many different government agencies publish data, and it can be messy to try and figure out which agency will have the data you need. This strategy lets you search across many different agencies within the same government, all at once. Second, government domain suffixes are generally restricted, meaning not just anybody can get a website with that suffix. You can be fairly confident that the results you find will be from the government in question, though it's always a good idea to double check.
- Last Updated: Sep 3, 2025 3:07 PM
- URL: https://libraryguides.stolaf.edu/sds
- Print Page