Data Sources
Roadway Data
-
OpenStreetMap (OSM)
- OpenStreetMap (OSM) is a great resource for roadways when local data is not available or useful. Anyone can submit edits and additions to the dataset which makes it more accurate in many cases. However, it's accuracy and completeness can vary from location to location. OSM also contains data other than streets, such as buildings, land use, natural features, etc.
- Ways to access
- Geofabrik Extracts contain OSM data in the shapefile format which can be used in GIS software like QGIS. The shapefile format available from Geofabrik is a good starting point for most users, but may not be suitable for all needs and applications (e.g routing). They also have extracts for region in .pbf format which can be used with other tools like Osmium, Osmosis, osm2pgsql, etc.
- OSMNX is a Python library that simplifies the process of working with OSM data, allowing users to easily download, model, and analyze street networks. The library can be used to download OSM data for specific areas, and it provides tools for network analysis, visualization, and more. It requires python knowledge and is best suited for users comfortable with programming.
- Other tools and libraries also exist to download OSM data most of which are listed in the OSM Wiki.
-
Highway Performance Monitoring System (HPMS)
- FHWA's HPMS is a nationwide database that provides detailed information on the extent, condition, performance, and use of U.S. public roads. It supports federal and state transportation planning, performance measurement, and funding allocation. However, its limitations include inconsistencies in data collection across states and limited granularity for local or non-NHS (National Highway System) roads.
- Spatial data can be accessed from https://geo.dot.gov/server/rest/services/Hosted. This link has data available for 2023 for most states. It is not sure if this continues to be location to access HPMS GIS data in the future.
-
State DOT and Local Roadway Inventories
- Varies by location
Safety Data
-
Fatality Analysis Reporting System (FARS)
- The Fatality Analysis Reporting System (FARS) is a nationwide database maintained by the National Highway Traffic Safety Administration (NHTSA) that provides detailed information on all fatal motor vehicle crashes in the United States. A crash is included in FARS if it results in the death of a motorist, pedestrian, bicyclist, or other road user within 30 days of the incident.
- FARS crash data can be analyzed using their online Query Tool. Additionally, you can download historical crash data directly using their Crash API (documentation) or the FARS file transfer protocol (FTP) site.
-
State DOT Crash Databases
- Varies by state. SSPF webpage has a list of crash datasources by state here: SSPF Public Datasets Page. The list was compiled in 2021 and some of the links may not work or be out of date.
-
NHTSA Fatality and Injury Reporting System Tool (FIRST)
- The Fatality and Injury Reporting System Tool (FIRST) is an interactive database maintained by NHTSA that enables users to query and analyze fatal and injury crash data. It combines information from both the Fatality Analysis Reporting System (FARS) and the Crash Report Sampling System (CRSS), making it easier for practitioners and researchers to access and explore national crash statistics.
-
NHTSA State Traffic Safety Information (STSI)
- The State Traffic Safety Information (STSI) tool by NHTSA provides traffic fatality data at the state and county level, categorized by behavioral concerns such as speeding, impaired driving, and more. It supports safety planning and policy-making by offering clear, comparative visuals and statistics tailored to geographic areas and specific risk factors.
Demographic Data
-
U.S. Census Bureau - American Community Survey (ACS)
- ACS provides demographic, social, economic, and other data about the U.S. population at different geographic scales down to the block group level in many cases.
- IPUMS NHGIS provides a Census data in a more accessible format.
- Census API provides access to Census data via API. Useful for more dynamically query data and embed in other processes but may require programming knowledge.
-
Safer Streets and Roads for All (SS4A) Underserved Communities Tool
- The SS4A Underserved Communities Tool is an interactive web dashboard that displays U.S. Census tracts and highlights those designated as underserved—including tribal lands, territories, and disadvantaged communities per SS4A criteria—helping users visually assess whether jurisdictions or project areas fall within these areas. Applicants for SS4A grants use this tool to calculate and report the percentage and population residing in underserved communities as required by the grant application process.
-
Census Origin-Destination Employment Statistics (LODES)
- The LODES dataset, part of the U.S. Census Bureau’s Longitudinal Employer-Household Dynamics (LEHD) program, provides detailed spatial information on where workers live and where they are employed, enabling robust analysis of commuting patterns and employment distribution. These data support transportation planning, land use analysis, and economic development strategies by offering high-resolution origin-destination employment insights across the country.
-
National Household Travel Survey (NHTS)
- The NHTS dataset collects detailed information on the travel behavior of the American public, including data on daily trips by mode, purpose, and time of day, as well as household demographics and vehicle availability. Transportation agencies use NHTS data to inform planning decisions, forecast travel demand, and evaluate infrastructure needs across regions.
Public Health
-
National Emergency Medical Services Information System (NEMSIS)
- The National Emergency Medical Services Information System (NEMSIS) is a national database that collects standardized patient care data from EMS (Emergency Medical Services) agencies across the U.S. It enables analysis of EMS responses to support public health, safety planning, and research by providing insight into trends in pre-hospital care and emergency response.
-
Health Resources and Services Administration (HRSA) Medically Underserved Areas Tool
- The MUA Find tool, hosted by HRSA, enables users to determine if a specific location falls within a Medically Underserved Area or Population, based on federal designations using metrics like provider availability, poverty, elderly population, and infant mortality. It provides searchable filters such as geographic area, population type, and rural status, with results exportable by location.
-
Trauma Center Association of America (TCAA) "Find Your Local Trauma Center" tool
- The TCAA “Find Your Local Trauma Center” tool is an interactive map that displays U.S. trauma centers along with verified burn centers and transplant or donor centers, allowing users to locate facilities by address, ZIP code, or proximity.
-
CDC PLACES: Local Data for Better Health
- The CDC PLACES initiative provides model-based health data at multiple geographic levels including counties, places, census tracts, and ZIP Code Tabulation Areas, covering measures like chronic disease prevalence, risk behaviors, and social determinants of health. Leveraging small-area estimation models grounded in CDC and ACS data, PLACES empowers public health practitioners, planners, and community leaders to tailor interventions and prioritize resources with equitable and data-driven insights.
-
CDC Behavioral Risk Factor Surveillance System (BRFSS)
- The BRFSS is the largest continuously conducted health survey system in the world, collecting data via telephone from U.S. adults on health-related risk behaviors, chronic health conditions, and use of preventive services. Public agencies use BRFSS data to guide policy and program development, particularly in areas of chronic disease prevention and health equity.
-
CDC Youth Risk Behavior Surveillance System (YRBSS)
- The YRBSS monitors health-risk behaviors among middle and high school students across the U.S., including substance use, sexual behaviors, diet, physical activity, and behaviors that contribute to injuries and violence. The dataset helps schools, agencies, and community partners address trends and develop interventions to improve youth health outcomes.
-
Homelessness Management Information System (HMIS)
- The Homelessness Management Information System (HMIS) is a web-based data system that Continuums of Care (CoC) use to collect and manage client-level information on individuals and families experiencing homelessness, including demographics, service use, and outcomes, according to HUD-mandated standards. This system enables providers to coordinate care, while agencies and policymakers use aggregated HMIS data to understand homelessness at local, state, and national levels and inform policies and resource distribution.
Miscellaneous / Supplemental
-
Highway Environmental Planning Geographic Information System (HEPGIS)
- The Highway Environmental Planning Geographic Information System (HEPGIS) is the FHWA Office of Planning, Environment, and Realty's central hub for web-based GIS applications. It enables users to visualize, interact with, and download a variety of transportation-related maps and datasets, such as demographics, air quality, safety metrics, and equity indicators. Users can also find links to their state DOT's GIS website where they can search their state's spatial data resources.
-
Bureau of Transportation Statistics (BTS) National Transportation Atlas Database (NTAD)
- The National Transportation Atlas Database (NTAD), managed by the Bureau of Transportation Statistics (BTS), is a comprehensive collection of over 90 nationwide geographic datasets covering transportation networks, facilities, and infrastructure across all modes. These GIS-ready datasets include spatial and attribute data, such as modal networks and intermodal terminals, and are accompanied by metadata that aligns with federal standards, making them valuable for research, analysis, and planning at national, regional, and local levels.
-
Federal Railway Administration (FRA) Grade Crossing Inventory
- The FRA Grade Crossing Inventory is a national database maintained by the Federal Railroad Administration (FRA) that catalogs every public and private highway-rail crossing in the United States. Each crossing is reported by railroads and state DOTs using Form FRA F 6180.71, and is assigned a unique U.S. DOT Grade Crossing ID. The inventory includes detailed information such as crossing location and type, railroad characteristics, traffic control devices, and physical infrastructure features. It supports safety analysis, planning, and regulatory compliance related to rail-grade crossings.
-
Federal Transit Administration (FTA) National Transit Database (NTD)
- The National Transit Database (NTD) is a comprehensive resource maintained by the Federal Transit Administration (FTA) that provides detailed data on transit agencies across the U.S. It includes information on funding sources, vehicle and facility inventories, safety incidents, service metrics, and transit workforce data, supporting long-term planning, trend analysis, and informed decision-making by local, state, and regional agencies.
-
EPA National Walkability Index
- The National Walkability Index provides a standardized measure of walkability for every Census block group in the U.S., using factors such as intersection density, land use mix, and transit accessibility. It helps communities, planners, and researchers identify areas with strong or weak walkability to support health, equity, and sustainability goals.
-
Commodity Flow Survey Data (CFS)
- The Commodity Flow Survey (CFS) tracks the movement of goods in the U.S. by measuring tonnage, value, and mode of transport across regions and industries. Policymakers and planners use this data to assess freight transportation demands, infrastructure investment needs, environmental impacts, and safety risks related to goods movement.
-
PeopleForBikes Bicycle Network Analysis (BNA)
- The Bicycle Network Analysis (BNA) provides a comprehensive assessment of bicycle network connectivity and accessibility in urban areas. It uses a combination of data sources and data analysis to evaluate the quality and completeness of bicycle infrastructure. The BNA helps cities identify gaps and stressful areas in their bicycle networks and prioritize safety investments in infrastructure improvements.
- The BNA logic can also be implemented for custom local data using the pyBNA python library.
The resources and links provided on this webpage are offered solely for general informational purposes. They direct users to publicly available materials developed and maintained by third-party organizations, including governmental agencies and private entities. Toole Design does not create, control, maintain, or guarantee the accuracy, relevance, timeliness, or completeness of any content linked from this page.
By using this site and any of the external resources linked herein, you acknowledge and agree that Toole Design assumes no responsibility or liability for any loss, harm, injury, claim, or damage arising from your access to or use of these resources. All use is entirely at your own risk.
This page does not constitute an endorsement of any third-party organization or its content, nor does it represent a comprehensive or authoritative listing of all available safety analysis resources. The inclusion of links does not imply any partnership, recommendation, or approval.
Toole Design expressly disclaims any and all liability for damages of any kind resulting from the use, misuse, or reliance on the information presented on this site or any linked external sites.