Open Road Risk
  • Home
  • Project
    • Project overview
    • Current model status
    • AI-assisted development
  • Background
    • Metrics and methodology
    • Literature evidence register
  • Literature
    • Crash frequency models
    • Exposure and traffic volume
    • Spatial methods and network risk
    • Junctions and conflict structure
    • Severity modelling
    • Validation and metrics
    • Transferability and open data limits
  • Data Sources
    • Overview
    • STATS19 Collisions
    • OS Open Roads
    • AADF Traffic Counts
    • WebTRIS Sensors
    • Network Model GDB
  • Methodology
    • Methodology Overview
    • Joining the Datasets
    • Feature Engineering
    • Empirical Bayes Shrinkage
  • Exploratory Data Analysis
    • Collision EDA
    • Collision-Exposure Behaviour
    • Vehicle Mix Analysis
    • Road Curvature
    • Months and Days of Week
    • Traffic Volume EDA
    • OSM Coverage
  • Models
    • Modelling Approach
    • Stage 1a: Traffic Volume
    • Stage 1b: Time-Zone Profiles
    • Stage 2: Collision Risk Model
    • Facility Family Split
    • Model Inventory
  • Outputs
    • Top-risk map
  • Future Work

On this page

  • Traffic
  • Collision risk
  • What this project is
  • Why this matters
  • Site guide

Open Road Risk

Exposure-adjusted road risk modelling from open transport data

Traffic

Collision risk

What this project is

Open Road Risk is an open-source pipeline for estimating exposure-adjusted collision risk across the road network.

The core idea is simple:

raw collision counts are misleading without traffic context.

A road with ten reported collisions may be unusually dangerous if it carries very little traffic, or relatively safe if it carries very high traffic volumes. To compare roads fairly, collisions need to be interpreted relative to exposure.

This project combines open road network, traffic, and collision datasets to estimate risk at road-link level and support network screening, corridor comparison, and safety analysis.

Why this matters

Most roads do not have direct traffic measurements.

Official traffic counts are concentrated on major roads and sampled count points, while collision records exist across the network. That creates a common problem in road safety analysis:

  • collisions are observed widely
  • exposure is only partially observed
  • direct comparison is therefore difficult

This pipeline addresses that by estimating traffic where it is not measured and then modelling collision risk in relation to that estimated exposure.

Site guide

The site is organised around the logic of the pipeline:

  • Project Overview — overview of repository layout, modelling stages, and current status
  • Background — metrics, benchmarks, and methodology for road crash modelling
  • Data Sources — what each dataset contains and what it can and cannot tell us
  • Methodology — how sources are joined and transformed into modelling inputs
  • Analysis — model behaviour, outputs, and exploratory evaluation
  • Future Work — research questions and extensions the pipeline can support

A good place to start is with the source pages for STATS19, AADF, and WebTRIS, then move to the methodology pages on joining and feature engineering.

For possible extensions, Future Work collects research questions that are not in the active backlog but are natural next uses of the same road-link, exposure, and collision-risk infrastructure.

Open Road Risk

 

Built with Quarto