Integrating Large Language Models for Traffic Crash Classification in Data-Constrained Regions

dc.contributor.authorAshqar, Huthaifa
dc.contributor.authorJaber, Ahmed
dc.contributor.authorAl-Sahili, Khaled
dc.contributor.authorAshqer, Mujahid
dc.contributor.authorHassouna, Fady
dc.date.accessioned2026-01-09T05:43:30Z
dc.date.available2026-01-09T05:43:30Z
dc.date.issued2025-09-29
dc.descriptionNumber of pages: 8, 2025 Engineering for Palestine Conference (ENG4PAL) PPU, Hebron, Palestine, September 29-30, 2025en_US
dc.description.abstractTraffic crashes remain a critical public safety issue in developing regions, where poor data quality, inconsistent reporting, and inadequate classification frameworks hinder effective analysis and policy action. This study presents a novel methodology for enhancing traffic crash classification using advanced large language models in a data-constrained environment. We employ OpenAI’s GPT-4o to reclassify 48,815 crash records from Palestine (2019–2022), converting unstructured narratives into a structured taxonomy aligned with international standards. Using few-shot learning, GPT-4o achieved over 92% classification accuracy, revealing dominant causes such as failure to yield (42.64%) and driver negligence (34.76%), while identifying 11.55% of cases as pedestrian related. To explore predictive potential, we integrate this refined dataset into a ConvLSTM (Convolutional Long Short-Term Memory) model, capturing spatio-temporal dependencies to forecast crash frequencies across regions and time intervals. This integration of large language model-driven reclassification with deep spatio-temporal learning not only improves the reliability of crash datasets but also uncovers actionable insights for targeted road safety interventions. Comparative analysis with existing literature highlights the novelty of applying LLMs in tandem with spatio-temporal models for traffic analysis in low resource settings. The findings carry significant implications for data-driven policymaking, infrastructure design, and urban traffic safety strategies.en_US
dc.identifier.urischolar.ppu.edu/handle/123456789/9397
dc.language.isoenen_US
dc.publisherPalestine Polytechnic Universityen_US
dc.subjectIntegrating, Large, Language, Models, Traffic, Crash, Classification, Data-Constrained, Regionsen_US
dc.titleIntegrating Large Language Models for Traffic Crash Classification in Data-Constrained Regionsen_US
dc.typeWorking Paperen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Book_submission_163_pdf (p_564-571)_77.pdf
Size:
553.99 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: