Sources and Mitigation of Bias in Big Data for Transportation Safety [Supporting Dataset]
-
2018-12-14
Details:
-
Creators:
-
Corporate Creators:
-
Corporate Contributors:
-
Subject/TRT Terms:
-
Publication/ Report Number:
-
DOI:
-
Resource Type:
-
Geographical Coverage:
-
Corporate Publisher:
-
Abstract:Emerging big data resources and practices provide opportunities to improve transportation safety planning and outcomes. However, researchers and practitioners recognize that big data includes biases in who the data represents and accuracy related to transportation safety statistics. This study systematically reviews both the sources of bias and approaches to mitigate bias through review of published studies and interviews with experts. The study includes quantified analysis of topic frequency and evaluation of the reliability of concepts by using two independent trained coders. Results show a need to keep transportation experts and the public central in determining the right goals and metrics to evaluate transportation safety, in the development of new methods to relate big data to the total population’s transportation safety needs, in the use of big data to solve difficult problems, and to work ahead of emerging trends and technologies. The total size of the described zip file is 1.42 GB. Files with the .xlsx extension are Microsoft Excel spreadsheet files. These can be opened in Excel or open-source spreadsheet programs. Text files can be view in notepad or any document reading software. JPG files can be opened using the system's photo viewer. Python Files hold python project. They can be opened using open source software such as PyCharm. AVI files are video files that can be opened using the system's video player. IPYNB files can be opened using software such as Jupyter Notebook, which allows programmers to make and share documents with live code. Docx files are document files created in Microsoft Word. These files can be opened using Microsoft Word or with an open source text viewer such as Apache OpenOffice. PNG files can be opened using the system's photo viewer. Files that end in .INO are created using Arduino, an open source electronics prototyping software, and can be opened using that software. DB files can be opened using dBase but can also be opened through Microsoft Excel or an open-source spreadsheet program. Swiftdoc and swiftmodule files can be opened using Swift software, which is free. JSON files are files used to store and share data objects and can be opened using any text reader software such as Notepad. File extension .md is used in creating GitHub Issues and can be opened in a basic text editor. The following file types are standard for GIS mapping software: AUX, CSF, DBF, PRJ, SBN, SBX, SGML, SHP, LOCK, SHX, CSF, CPG, LYR, MXD, FDBINDEXES, GDBTABLE, GDBTABLX, ATX, SPX. Because the files pertain to map layers and images, they are best viewed using the Kingdom: Seismic and Geological Interpretation software that the team used or with any open source 2D and 3D mapping software.
-
Content Notes:National Transportation Library (NTL) Curation Note: As this dataset is preserved in a repository outside U.S. DOT control, as allowed by the U.S. DOT's Public Access Plan (https://doi.org/10.21949/1503647) Section 7.4.2 Data, the NTL staff has performed NO additional curation actions on this dataset. The current level of dataset documentation is the responsibility of the dataset creator. NTL staff last accessed this dataset at its repository URL on 2022-11-11. If, in the future, you have trouble accessing this dataset at the host repository, please email NTLDataCurator@dot.gov describing your problem. NTL staff will do its best to assist you at that time.
-
Format:
-
Funding:
-
Collection(s):
-
Main Document Checksum: