A Systematic Evaluation of Generative Models on Tabular Transportation Data
-
2025-01-01
-
Details
-
Creators:
-
Corporate Creators:
-
Subject/TRT Terms:
-
Resource Type:
-
Geographical Coverage:
-
Corporate Publisher:
-
Abstract:The sharing of large-scale transportation data is beneficial for transportation planning and policymaking; however, there are privacy concerns with data sharing, as it can include identifiable personal information, such as individuals’ home locations. To address these concerns, synthetic data generation based on real transportation data offers a promising solution that allows privacy protection while potentially preserving data utility. Although there are various synthetic data generation techniques, they are often not tailored to the unique characteristics of transportation networks. In this paper, we use New York City taxi data as a case study to conduct a systematic evaluation of the performance of widely used tabular data generative models. In addition to traditional metrics such as distribution similarity, coverage, and privacy preservation, we propose a novel graph-based metric tailored specifically for transportation data.
-
Content Notes:Copyright The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025
X. Wu et al. (Eds.): PAKDD 2025, LNAI 15876, pp. 106–118, 2025. https://doi.org/10.1007/978-981-96-8298-0_9
-
Format:
-
Collection(s):
-
Main Document Checksum:urn:sha-512:387290da193b3396ef4ba5fb418d5c8177ebd5b8a7331781e487aa59bbd71e11cb8b29d4874cce1cd956f15104d283e07c3c9aa98b5029aebb98ca9da923635a
-
Download URL:
-
File Type: