Data

Welcome to the data page, please use the table of contents to navigate. If you are looking for data, whether you know what you want or not, this directory is a good place to start. The data here is for Not all resources on this page are free, especially in the Finance section. If you are a student, check with your universities library, as you may already have access to some paid data resources. All listings are in alphabetical order. Titles (in blue) of each item in this directory are hyperlinked to their sources, simply clink on one to be redirected to that site.


Contents


Here are some great data sites which cover many topics. You never know what you might find.

Amazon Web Services – Amazon Web Services has a lot of open data which can be analyzed on your computer or in the cloud using services like AWS EC2. To check out the datasets you will need to create an AWS account (which is free). There is also AWS Marketplace which has a variety of data products from reputable companies (Not all these data products are free).

DataHub – Great place to find data on a various topics.

Data World Open Data – Data world also has a large collection of data which cover topics from economics to censuses to environment to energy. The main website is here.

FiveThirtyEight – FiveThirtyEight is a website which writes pieces on politics, sports, economics, culture, and health/science. Along with the site does political and sports forecasts. A nice feature of the site is they post all their data and forecast data. The data is both here and on Github in the data repository. Also, ESPN (FIveThirtyEight’s owner) produces a wide range of stats available.

Github – Github is a software building community which also has some data sets shared by users.

Google Dataset Search – If you know what you want, and have some key words for your search, I would suggest using this tool provided by google to find your data.

Google Trends – When something is hard to measure, like popularity of something, a potential alternative to measuring popularity is using Google Trends. The problem with this is instead of giving you raw statistics, it gives you an index of everything.

Governments:

A quick google search can be done to see if your municipality, state/province or country has open data

Kaggle – Kaggle is a great resource for everything data science. You will just need to create a free account to download dataset.

NOAA – nearly 100,000 datasets mostly related to weather, climate and the environment.

ProPublica – ProPublica is a journalism website, but they also publish all their data on their Data Store.

Reddit – Reddit has a sub reddit which is specifically for datasets.

Socrata – Socrata has an open data store which cover a wide range of topics.

SportsData – Real time and historical sports data. To see what is available use the Data Dictionary

statmuse – StatMuse, an artificial intelligence company, is the leader in search for sports information, stories and trivia. Coverage includes stats, scores, schedules, standings, fantasy, bios, recaps and more — from live games all the way back to the inaugural seasons — for the NBA, NFL, MLB and NHL.

Tableau Public Sample Data – has some great sample data sets to work with, as well as Web data connectors. These web connectors allow you to do things like analyze your own social media data.

UCI Machine Learning Repository – maintains 497 data sets as a service to the machine learning community.


Agricultural Economics

Center for International Earth Science Information Network (CIESIN) – Provides data on the changing relationship between human beings and the environment. Make sure to also take a look at the See Socioeconomic Data and Applications Center.

Centre for PRTR Data – Includes PRTR data from some OECD countries.

Environmental Data Explorer – Portal to a wealth of international and regional data on many aspects of the environment including agriculture, air quality, biodiversity, coral reefs, ecosystems, endangered and threatened species, emissions, erosion, forests, human settlements, land use, ozone, pesticides, and weather.

Gemstar Global Water Database – Surface and ground water quality data sets collected from the GEMS/Water Global Network, including more than 3,000 stations, close to four million records, and over 100 parameters.

FluxNet – Long term measurements of energy, water vapor, carbon dioxide, and energy from a variety of global ecosystems.

Intergovernmental Panel on Climate Change Socio-economic Centre. Data related to population, economic development, technology and natural resources for use in climate impact assessments.

International Energy Agency. Detailed international data on coal, electricity, natural gas, oil, renewables, carbon dioxide emissions from fuel combustion, energy prices and taxes, energy technology research and development, world energy statistics and balances, and forecasts from energy policies.

World Bank Climate Change Data – Large collection of climate change data from the world bank, from forest areas to CO2 emissions to urban populations.

World Bank - Environment – Various international, regional, and country data on various aspects of the environment.

World Input-Output Database – Time series of world input-output tables and international supply and use tables; national input-output tables and national supply and use tables; socio-economic accounts; and environmental accounts for 27 EU countries and 13 other major countries in the world for 1995 to 2009.

World Resources Institute – Provides an overview of environmental trends. Includes many data tables. Also includes datasets


Behavioural Economics

The General Social Survey – Contains a standard ‘core’ of demographic, behavioral, and attitudinal questions, plus topics of special interest. Many of the core questions have remained unchanged since 1972 to facilitate time-trend studies as well as replication of earlier findings.

Health and Retirement Study – Longitudinal panel study that surveys a representative sample of more than 26,000 Americans over the age of 50 every two years., The HRS explores the changes in labor force participation and the health transitions that individuals undergo toward the end of their work lives and in the years that follow.

Inter-university Consortium for Political and Social Research – Consortium of institutions working together to acquire and preserve social science data. Maintained at University of Michigan, ICPSR receives, processes, and distributes data on social phenomena in countries worldwide.

Odum Dataverse – A repository for all Louis Harris public opinion data and Roper Center nonproprietary poll data, arranged by title, principal investigator, and subject.

Panel Study of Income Dynamics – Nationally representative sample of over 18,000 individuals living in 5,000 families in the United States that has been collected continuously over time, including data covering employment, income, wealth, expenditures, health, marriage, childbearing, child development, philanthropy, education, and numerous other topics.

The World Values Survey – Global research project that explores people’s values and beliefs, their stability or change over time and their impact on social and political development of the societies in different countries of the world.


Crime Economics

City crime – Some cities make all their crime data public (except for serious crimes because of privacy), for example:

Some countries have rich crime data available for all areas


Development Economics

Africa Development Indicators – From the World Bank. Data on Africa, containing over 1,600 indicators, covering 53 African countries from 1961 to present.

AID Data – Microlevel aid data from around the world, articles on aid data from peer reviewed journals, and “aid data raw” which allows researchers to subset data by donor, recipient, and purpose for multiple years.

Balance of Payment Statistics (from IMF) – Balance of payments and international investment position data of individual countries and regions for the balance of payments from 1967 to the present. Useful for foreign direct investment.

CIA World Factbook – Good source for preliminary analysis on a country of interest.

FAOSTAT – Data on agriculture, food supply, food security, prices, commodities, forest products, and fisheries from the FAO.

International Food Policy Research Instute – Data on a variety of agricultural and resource topics, including hunger and poverty, social capital, livestock and more. To get to the data go to Publications & Tools > Datasets

Human Development Report Statistics – Statistics from the United Nations Development Programme (UNDP). Links to the text of national human development reports from over 250 countries and regions.

OECD iLibrary – Leading International Organization for Foreign Aid data. Includes International Development Statistics as well as statistical databases on External Debt Statistics and Geographical Distribution of Financial Aid Flows to Developing Countries.

Replication Data Wiki – This wiki serves as a database of empirical studies, the availability of replication material for them and of replication studies.

UNCTADStat – From the United Nations Conference on Trade & Development.

UN Data – Wide range of economic, social, cultural, and demographic indicators: population, environment, health, economics, technology, trade, refugees, and more.

U.S. Bureau of Economic Analysis – Detailed data on foreign direct investment involving the United States.

World Bank Open Data – Database with many variables and countries included. Be careful, there a lot of missing values for developing countries. Microdata is also available.

World Development Indicators – Time series data from 1960 for 207 countries in the areas of population, labour, education, economics, the environment and much more.

World Income Inequality Database – Information for countries on income inequalities at both cross-country and time series levels over the period 1950-98, with a focus on the period since 1980. See also the University of Texas Income Inequality Project.


Demographics

The American Community Survey (US Census Bureau) – Began in 2005, annual survey replacing the census long (15%) sample. Consider using IPUMS.

American Factfinder (U.S. Census Bureau) – Database for online source for accessing population, housing, economic and geographic data from the from the U.S. Census, including the Census of Population and Housing and the American Community Survey.

Current Population Survey – Includes many supplemental surveys on various topics such as child support, tobacco, voting, computer use, identity theft.

Health Nutrition and Population Statistics – Health, nutrition and population statistics gathered from international sources.

Human Mortality Database – Mortality and population data for researchers interested in the history of human longevity. From UC Berkeley and the Max Planck Institute for Demographic Research.

Integrated Public Use Microdata Series (IPUMS) – From the Minnesota Population Center. Provides census and survey data from around the world integrated across time and space. Includes the following: U.S. census micro-data with enhanced documentation. Data includes decennial censuses from 1790 to 2010 and American Community Surveys (ACS) from 2000 to the present. Now includes preliminary complete count data for several censuses up to 1940. Micro-data from the monthly U.S. labor force survey, the Current Population Survey (CPS), from 1962 to the present. Demographic and employment data as well as special topics such as fertility, tobacco use, volunteer activities, voter registration, computer and internet use, food security, etc.. and Census microdata for over 90 countries worldwide.

United States Census Bureau – The principal source of periodic U.S demographic data. Major programs include the Census of Population and Housing available via American Factfinder and the American Community Survey.

UN Demographic Yearbook System – Statistics on population, births, deaths, marriage and divorce annually as well as economic activity, educational attainment, housing, ethnicity, language, and foreign-born and foreign populations by country from 1948.


Economic History

Archival Federal Reserve Economic Data (ALFRED) – “Economic data time travel” from the St. Louis Federal Reserve. Includes a section on International Data.

EHNet Databases – Historical data sets from the Economic History Association: global financial data, wages, bond trading, early securities prices, developing country exports, historical labor statistics, and much more.

European State Finance Database – Collaborative research project for the collection and dissemination of data on European fiscal history across the medieval, early modern & modern periods. NOTE - this is available on the UK Data Archive. See the entry for their catalog and how to register on this page.

FRASER – FRASER is a digital library of U.S. economic, financial, and banking history—particularly the history of the Federal Reserve System.

Global Financial Data – Extremely long runs of historical data on U.S. stock prices and macroeconomic data for the U.S. and foreign countries.

Global Commodity Prices Database – Data lists can be viewed by geographic market, currency, and commodity, and can be queried. Data is from 1260 to 1914.

Global Price and Income History Group Prices, wages, income, and measures of economic well-being for countries and cities around the world before 1950. Focuses primarily on the middle ages.

Historic Public Debt Database – Comprehensive database from the IMF on gross government debt-to-GDP ratios spanning an exceptionally long time period.

IMF eLibrary – Includes data on exchange and interest rates, balance of payments, government finance, national accounts, aggregate trade, prices, foreign reserves, and more. From the International Monetary Fund (IMF). From 1946 to present.

Integrated Public Use Microdata Series (IPUMS) International – From the Minnesota Population Center. Micro-data for censuses from the United States back to 1850 as well as other nations.

Jorda-Schularick-Taylor Macrohistory Database – Covers 17 advanced economies since 1870 annually, comprised 25 real and nominal variables with over 90 percent of advanced-economy output and 50 percent of world output.

League of Nations Statistical Yearbooks From Northwestern University – Digital League of Nations statistical yearbooks from 1926-1944. Includes data on population, commerce, public finance, currency, production, prices and more.

Maddison Project Database – Project from colleagues of Angus Maddison to continue his work on measuring long-term economic performance for different regions, time periods and topics.

NBER Macroeconomic History Data – Covers pre-WWI and interwar economies: production, construction, employment, money, prices, foreign trade, and government activity. Some international coverage.

Total Economy Database – From the Conference Board. Annual data covering GDP, population, employment, hours, labor quality, capital services, labor productivity, and total factor productivity for over 120 countries in the world.

TRADHIST – Bilateral Trade and Gravity Data set with more than 1.9 million bilateral trade observations for the 188 years from 1827 to 2014. Includes bilateral nominal trade flows, country level aggregate nominal exports and imports, nominal GDP, exchange rates, and bilateral factors known to favor or hamper trade, including geographical distance, colonial and linguistic links and bilateral tariffs. The dataset is used to estimate the evolution of trade costs over two centuries.

World Trade Historical Database – Annual series of trade by polity from 1800 to 1938 by country, continent and world totals.


Environmental/Energy Economics

Biking Data (Biking is a growing area in environmental economics) is published in some cities. Specifically these are bike sharing programs:

Center for International Earth Science Information Network – Provides data and other information on the changing relationship between human beings and the environment. See also the Socioeconomic Data and Applications Center.

Centre for PRTR Data – Includes PRTR (Pollutant Release and Transfer Registers; USA version is the Toxic Release Inventory data from selected OECD countries.

Gemstar Global Water Database – Surface and ground water quality data sets collected from the GEMS/Water Global Network, including more than 3,000 stations, close to four million records, and over 100 parameters.

FluxNet – Long term measurements of energy, water vapor, carbon dioxide, and energy from a variety of global ecosystems.

Intergovernmental Panel on Climate Change – Data on population, economic development, technology and natural resources for climate assessment.

International Energy Agency – International data on coal, electricity, natural gas, oil, renewables, emissions, energy prices and taxes, energy R&D, world energy statistics, and forecasts.

JODI oil – Joint Organizations Data Initiative . International JODI database providing current data on oil produced in 30 countries.

NOAA – nearly 100,000 datasets mostly related to weather, climate and the environment.

Primary, Final and Useful Energy Database – Historical database of energy use by country or region, energy level, sector, energy carrier, and end-use type for 20 countries, for the 20th century.

RICE Model – Examines alternative outcomes for emissions, climate change, and damages under different policy scenarios using the RICE model (Regional Integrated model of Climate and the Economy).

United Nations Environmental Data Explorer – Portal to a wealth of international and regional data on many aspects of the environment including agriculture, air quality, biodiversity, coral reefs, ecosystems, endangered species, emissions, erosion, forests, human settlements, land use, ozone, pesticides, and weather.

World Bank Climate Change Data – Large collection of climate change data from the world bank, from forest areas to CO2 emissions to urban populations.

World Bank. Data and Statistics. Environment. – Various international, regional, and country data on various aspects of the environment.

World Input-Output Database – Time series of world input-output tables and international supply and use tables; national input-output tables and national supply and use tables; socio-economic accounts; and environmental accounts for 27 EU countries and 13 other major countries in the world for 1995 to 2009.

World Resources Institute – Flagship publication is World Resources Report which provides an overview of environmental trends. Includes many data tables. Also includes datasets.


Finance

Amadeus – Pan-European financial database containing information on approximately 20 million companies from 43 countries, including all the EU countries and Eastern Europe. Up to 10 years of detailed information and consolidated statements provided when available. From Bureau Van Dyck

CEIC Global – Database of over 400,000 time series covering a growing list of countries in Asia, Europe, and the Americas. Data items include: National Accounts, Industrial, Sales, Construction-Property, Domestic and Foreign Trade, Stock Markets, Banking, Inflation, Monetary, Forex, Investment, and more.

Factset – A comprehensive platform used to analyze financial data from global equity and fixed income markets, and public and private companies. Requires registration, UCB Students Staff and Faculty only.

Global Financial Data – Extremely long runs of historical data on U.S. stock prices and macroeconomic data for the U.S. and foreign countries. Long-term Financial, Interest Rate, United States Daily Stock Market, Global Stock Market, Total Return, Annual Data, and Dow Jones Industrial Average Intraday.

Infogroup – United States Historical Business Data. Geo-coded records of millions of US businesses with basic information on each entity, such as contact information, industry, revenues, employees, and other data annually from 1997-2018. Data files are compressed and software such as 7-Zip is required to unzip the archives. ​The following files for using the data can also be accessed from the link above.

Yahoo Finance API – great for free basic stock market data. for alternatives see here.


Health Economics

California Health Interview Survey – The largest state health survey in the United States conducted every two years on public health topics and access to health care for Californians.

CDC Wonder – AIDS cases by metropolitan and rural area, natality, cancer statistics, environmental data, mortality data, infant deaths, tuberculosis data, STD morbidity, vaccine adverse reporting, and population estimates.

Chronic Disease and Health Promotion Indicators Data – Portal of raw data covering multiples factors relating to Chronic disease, its causes. and its prevention.

IPUMS Demographic Health Surveys – Facilitates analysis of the Demographic and Health Surveys, administered in low- and middle-income countries since the 1980’s with consistently coded variables on the health and well-being of women, children, and births. Additional surveys are available from USAID.

Health Nutrition and Population Statistics – Health, nutrition and population statistics gathered from international sources. From the World Bank.

HCUPNet – Provides access to national health statistics and information on hospital inpatient and emergency department utilization.

Global Health Observatory – Health and health-related epidemiological and statistical information available from the World Health Organization. See their Global Health Expenditures Database.

OECD Health Statistics – Health and health systems data across OECD countries. Indicators include: health status, health care resources and utilization, expenditures & financing, social protection, pharmaceuticals, and demographic and economic variables.

National Center for Health Statistics – Website for the nation’s principal health statistics agency, compiling statistical information to guide actions and policies to improve the health of Americans.

National Vital Statistics System – Listing of surveys and some fast statistics for births, deaths, marriages, divorces, and fetal deaths for the United States.

Integrated Health Interview Series – Harmonized data and documentation for the U.S. National Health Interview Survey (NHIS)

National Health Care Surveys – Survey data to answer questions of interest to health care policy makers, public health professionals, and researchers.

National Longitudinal Study of Adolescent Health – Study of adolescents in the United States followed into adulthood. Data on social, economic, psychological and physical well-being, the family, neighborhood, community, school, friendships, peer groups, etc.

Robert Wood Johnson Data Hub – Tracks state-level data and allows users to customize and visualize facts and figures on key health and health care topics. Health insurance coverage estimates from the Current Population Survey’s Annual Social and Economic Supplement and the American Community Survey.


Labour

American Factfinder (U.S. Census Bureau) Database for online source for accessing population, housing, economic and geographic data from the from the U.S. Census, including the Census of Population and Housing and the American Community Survey.

CHASSMicro Data Surveys From the University of Toronto social sciences data center. Canadian Labour Force Surveys here.

CRDC – Provides universities, governments and other approved researchers ready access to a vast array of social, economic and health confidential microdata in secure computer facilities located on university campuses across the country. You must request this data and there is an approval process.

Integrated Public Use Microdata Series (IPUMS) – From the Minnesota Population Center. Provides census and survey data from around the world integrated across time and space. Includes the following:

  • IPUMS USA – U.S. census micro-data with enhanced documentation. Data includes censuses from 1790 to 2010 and American Community Surveys (ACS) from 2000 to the present. Now includes preliminary complete count data for several censuses up to 1940.

  • IPUMS CPS – Micro-data from the monthly U.S. labor force survey, the Current Population Survey (CPS), from 1962 to the present. Demographic and employment data as well as special topics such as fertility, tobacco use, volunteer activities, voter registration, computer and internet use, food security, etc..

Longitudinal Employer Household Dynamics – Several data products that characterize workforce dynamics for specific groups. The Quarterly Workforce Indicators (QWI), LEHD Origin-Destination Employment Statistics (LODES), Job-to-Job Flows (J2J), and Post-Secondary Employment Outcomes (PSEO) are available for public use.

  • Quarterly Workforce Indicators – Employment, job creation, earnings, and other measures of employment based on detailed firm characteristics (geography, industry, age, size) and worker demographics (sex, age, education, race, ethnicity) at the state, metropolitan, county, and Workforce Investment Board (WIB) areas.

  • LEHD Origin-Destination Employment Statistics (LODES) – Detailed spatial distributions of workers’ employment and residential locations and the relation between the two at the Census Block level as well as detail on age, earnings, industry distributions, and local workforce indicators.

  • Job-to-Job Flows – Statistics on worker reallocation in the United States constructed from the LEHD data. The initial release of national data distinguishes hires and separations associated with job change from hires from and separations to non-employment.

  • Post-Secondary Outcomes – Experimental tabulations with earnings and employment outcomes for college and university graduates by degree level, degree major, and post-secondary institution.

Neighborhood Change Database – Contains US tract-level data from the 1970, 1980, 1990, 2000 and 2010 decennial censuses with variables and tract boundaries that are consistently defined across census years with details such as population, household, and housing characteristics, income, poverty status, education level, employment, housing costs, immigration, and other variables.

United States Census Bureau – The principal source of periodic U.S demographic data. Major programs include the Census of Population and Housing available via American Factfinder and the American Community Survey.

United States Bureau of Labor Statistics – Data on inflation and prices, unemployment, productivity, wages, and much more. The BLS manages the Current Population Survey (see also IPUMS version) and the National Longitudinal Surveys.


International/Macro/Monetary Economics

Amadeus – European financial database containing information on approximately 20 million companies from 43 countries. Up to 10 years of detailed information and consolidated statements provided when available.

CEPII – Collection of world trade database with accompanying trade models built by CEPII, a French international trade research center. All databases are free except BACI for which users have to demonstrate their organization has a UN Comtrade subscription (UC Berkeley subscribes).

Central Banks – central banks can be a great source of data, but keep in mind of which countries are more likely to have corrupted data. Here is a link for the Bank of Canada

The Economist – data on major economic and financial indicators. Also see the big mac index data.

European Central Bank – The central bank for Europe’s single currency, the Euro. This website contains statistics on inflation, GDP, the unemployment rate and labor productivity, amongst other topics.

FRED – Over 650,000 US and International economic time-series.

Global Financial Data – Long runs of historical data (over 6,000 series) and macroeconomic trends for over 150 countries: financials, interest rates, exchange rates and more. Includes online encyclopedias that describe each dataset.

Global Consumption Database – From the World Bank. One-stop source of data on household consumption patterns in developing countries

OECD iLibrary – Data from the Organization for Economic Co-operation and Development (OECD). Includes data on economics and finance, trade, telecommunications, development, foreign aid, migration, and other categories.

Statistics Canada – Time series data for the Canadian economy and its people: national accounts, prices, trade, demographics, historical census data, elections, agricultural, education, energy, and more. Also see their main website.

Total Economy Database – From the Conference Board. Annual data covering GDP, population, employment, hours, labor quality, capital services, labor productivity, and total factor productivity for over 120 countries.

UN Data – From the United Nations. Wide range of economic, social, cultural, and demographic indicators: population, environment, health, economics, technology, trade, refugees, and more.

UN Comtrade – Detailed trade data from 130 countries, detailed by commodity and partner country.

UNCTADStat – Includes data on Foreign Direct Investment, remittances, merchandise trade, the information and creative economies, and maritime transport, and trade openness indicators.

UNIDO Statistics Data Portal – International statistical data on industries and mining from the United Nations Industrial Development Organization.

World Bank Data Catalog – Catalog of international economic, financial, and socio-economic data sets from the World Bank. Among the most useful of the many datasets offered is World Development Indicators featuring time series data from 1960 for 207 countries in the areas of population, labour, education, economics, the environment and much more.

World Competitiveness Online – Country profiles and quantitative competitiveness rankings on criteria related to domestic and international economics, government efficiency, talent and training, business efficiency, and infrastructure. About 1/3 of the data are taken from privately administered surveys.

World Inequality Database – On the historical evolution of the world distribution of income and wealth, both within countries and between countries.

World Input-Output Database – Time series of world input-output tables and international supply and use tables; national input-output tables and national supply and use tables; socio-economic accounts; and environmental accounts for 27 EU countries and 13 other major countries.

Wharton Research Data Service – Business data research service from The Wharton School providing access to financial, economic, and company data.


Political Economy

Canadian Elections

Center for Systemic Peace – Sponsor of the Integrated Network for Societal Conflict Research which has data on armed conflict and interventions, political regime characteristics, armed conflict, state fragility, and much more.

CIRI Human Rights Data Project – Data measuring national government respect for internationally recognized human rights instruments from 1981-2011.

Comparative Constitutions Data – Investigates the sources and consequences of constitutional choices by collecting and analyzing data on the characteristics of constitutions for independent states since 1789.

Correlates of War – Data on wars, non-state wars, and interstate disputes dating back to the 19th century. Associated or correlated measures include membership in international organizations, religion, alliances, colonial dependencies, diplomatic exchanges, and more

Database of Political Institutions – Data on the political institutions and regime characteristics of countries from 1975.

Democracy Times Series Data – Contains data on the social, economic and political characteristics of 191 nations with over 600 variables from 1971.

Followthemoney.org – Provides free access to federal and state level campaign contributions. Data can also be accessed through an API or downloaded.

Global Terrorism Databases – Open-source databases with information on terrorist events around the world from 1970. Includes data on domestic as well as international terrorist incidents.

International Military Intervention – Records events involving “the movement of regular troops or forces (airborne, seaborne, shelling, etc) of one country inside another, in the context of some political issue or dispute.”

Inter-university Consortium for Political and Social Research – Consortium of institutions working to acquire and preserve social science data. Maintained at University of Michigan.

LobbyView – Via MIT, allows tracking of lobby funds going back to the late 1990’s for many organizations, companies, cities, etc. Track by organization or bill number. Also provides access to lobbying disclosure forms.

Political Terror Scale – Codes countries on human-rights and the rule of law, based on reports from Amnesty International and the U.S. State Department.

RAND Database of Worldwide Terrorism Incidents – Records terrorist incidents that occurred from 1968-2009.

Stockholm International Peace Research Institute – Data on security, peace operations, military expenditures, arms transfers, arms embargoes, and arms exports.

State Fragility Index and Matrix – Annual state fragility, effectiveness, and legitimacy indices and component indicators for world countries with populations greater than 500,000.​

US elections


North American Football

CFL Database – Complete CFL database on the Canadian football league going back to its conception. This includes franchise, financial, stadium, player, league, and game information.

Football Outsiders – Football analytics and other advanced stats at both the player and team level. The stats go back to 1986. For definitions of what each stat is, use the glossary.

The Football Database – Database on football. Complete history of the NFL, AAF, and CFL.

NextGenStats – Another football analytics website with data back to the 2016. For explanations on the stats posted, use the glossary. The site also has visualization tools.

The NFL Big Data Bowl – Data on Kaggle from the NFL Big Data Bowl.

NFL Savant – NFLsavant.com is a web site dedicated to providing advanced NFL statistics in a simple to use interface. Also has excellent, downloadable play by play data.

NFL Website – Full of information and stats for both past and present.

Pro Football Reference – Football Stats and Histories. The complete source for current and historical NFL/AFL players, teams, scores and leaders. Data can also by copied into excel files. There is also a college football database here. For explanations on the stats posted and terms used, see the glossary.


Baseball

Baseball Prospectus – Site has Interesting statistical analysis, articles, projections, and fantasy information.

Baseball Reference – Devoted to tracking statistics for baseball teams and players from around the world. Data can also by copied into excel files. For explanations on the stats used, see their glossary.

Chadwick Baseball Bureau – Great source of free baseball data.

Fan Graphs – Both a blog and baseball database. Good scouting information about players and teams is on the site. There are also great player profiles and tools. For explanations on the stats used, see their glossary.

MLB Trade Rumours – A database which tracks the transaction side of the MLB, mainly trades, signings and roster moves.

Retrosheet – Retrosheet is a very rich baseball database that has free data downloads.

For a more extensive list of other resources for baseball, use Baseball Guru.


Basketball

Basketball Reference – Basketball Stats and Histories. The complete source for current and historical NBA/ABA players, teams, scores and leaders. Data can also by copied into excel files. There is also a college basketball database here. For information on any stats on the site, use the glossary.

Cleaning The Glass – The stats site aims to provide advanced statistics and tools that are more accurate, easier to use, and hard to find elsewhere to help you do your own analysis.

NBA – Official site of the NBA. Full of all recent stats.

NBAstuffer – Lots of basketball analytics data and referee data.

82games – has in-depth stats for NBA teams and players, designed to accurately highlight true performance levels. The site also posts its data.


Cricket

CricSheet – The site provides ball-by-ball data for Men’s and Women’s Test Matches, One-day internationals, Twenty20 Internationals, some other international T20s, and various club competitions such as all Indian Premier League seasons, and some Big Bash League, T20 Blast, and Pakistan Super League matches.


Football (Soccer)

Data Hub Resources – If you are looking for free football data, this resource page is a great place to start.

Football Manager. Football manager is an excellent football simulation game. In order to make this game, vast amounts of data are required on players, clubs and leagues. You can download a free demo of the game and explore history on clubs and players.The game also has a separate app for their database as well. Finally, there are other sites which post football manager data like SortItOutSI, Fmdataba, and Fmscout.

Football Reference – Football Stats and Histories. The complete source for current and historical players, teams, scores and leaders in all major soccer leagues. Data can also by copied into excel files.

Football Data UK – Great place to download rich match level data for all major soccer leagues. Just click on your league of interest to be redirected to data files for that league, organized by season.

Footystat – FootyStats is the premier football stats and analysis site, with data coverage in 500+ football leagues worldwide including UK, Europe, and South America. Team stats, League stats, and Player stats are covered with details on form, goals scored, conceded, shots, xG, corner stats, and more. You can also download the data using our CSV or API service.

Transfer Markt – This site has excellent information on transfers in world football. Along with transfers, it also has a database on current player market values. The site has player, coach, and club level statistics as well.

Sofifa – A FIFA (The video game) player database which allows you to search players based on their attributes and ratings.

Who Scored – Great stats and insights as well as live scores.\


Golf

Data Golf – A website focused on predictive modelling as well as historical data. The site provides advanced insight into how players on the PGA are doing, and why they may not be performing as expected. Visualization tools are also available.

Golf Stats – The site contains a very complete database if you are wanting to look up facts about players or tournaments. A data export tool is also available on the site.


Hockey

CapFriendly – CapFriendly is an independent compiler of contract information for the National Hockey League. Any information on the business side of the NHL can be found here.

Corsica Hockey – Provider of statistics, advanced stats, predictions and betting resources.

Elite Prospects – Focused on Hockey prospects from around the world.

Evolving Hockey Contains some of the most advanced stats you can find, including “above replacement” stats. There are also visualization tools and by becoming a patron you can download their data.

hockeydb – Site is full of player statistics and histories of leagues.

Hockey Abstract – Full of analytics information and very rich datasets, available for download here

Hockey Reference – NHL Stats and History. The complete source for current and historical NHL players, teams, scores and leaders. Data can also by copied into excel files. For explanations on the stats on the site, use the glossary.

MoneyPuck – Site full of hockey analytics, predictions and visualization tools. The site also posts all of its analytics data – free to download. For a deeper explanation about how the stats and game model work, see the about page.

Natural Stat Trick – Good place for hockey analytics. Also a good place to download analytics csv files for recent seasons.

NHL – The official site of the National Hockey League.

NHL Trade Tracker – Every trade that has ever happened in the NHL. Trying to figure out who went where, and when? This is the place to go.

PuckIQ – uses analytics to rank NHL players and shares their analytics data.

Puckpedia – Similar to CapFriendly, this is another site focused on the business side of hockey – finances and transactions.

The Unofficial Uniform Database – Curious what a teams uniform looked like in a given season? This is the place to go.


Tennis

Tennis Abstract – The Tennis abstract is a great database if you are looking for specific tennis facts. Forecasts are also available for tournaments.

Tennis Data UK – If you are looking for tennis results data, this is a great place to go. The site has data easy to download in excel files, Under the odds and results tab on the right just select a tournament.


The Business Side of Sports

Sportrac – If you are looking for contract information, and club payroll information across several sports leagues, Sportrac has a very complete database which can be freely accessed. To use the sites premium tools like historical contracts and complete league breakdowns, you will need to create an account.