Bogota City Traffic Accidents - Social Media Datasets
This dataset falls under the category Planning & Policy Safety of the transport space.
It contains the following data: The main data set is composed of 4,538,305 Twitter post in Spanish language related to road accidents gathered during the period between October 2018 and July 2019 in Bogota city, Colombia. The data was collected using the Twitter Search and Stream APIs, using a combination of traffic accident keywords and the information made available by users such as traffic police, transportation companies and local government agencies from Bogota city. The collected raw data was processed in order to remove duplicate entries and generate a dataset free of non-printable characters, stop worlds and non-relevant information, using techniques such as lemmatization and stemming. Finally, a sample of 9640 registers of the dataset was labelled to discriminate if the record is relevant to road accident information. The proposed used of the dataset is academic research of traffic accident detection and real time traffic modelling.
This dataset was scouted on 2022-02-22 as part of a data sourcing project conducted by TUMI. License information might be outdated: Check original source for current licensing. The data can be accessed using the following URL / API Endpoint: https://data.mendeley.com/datasets/c2r6tk9hbg/1

À propos de cet ensemble de données

Auteur/autrice
créé
Groupe
Organisation
Formats

Accès et utilisation

Public
Non-federal
Licence

Download & Ressources

Additional Metadata

Info additionnelle
Champ Valeur
Source https://data.mendeley.com/datasets/c2r6tk9hbg/1
Version 1.0.0
Auteur/autrice see data set URL
Courriel de l'auteur
Mainteneur Scouted by TUMI project; resp. Frederic Tesfay
Courriel du mainteneur feedback@tumidata.org