TY - JOUR
T1 - Hybrid data-driven approach for truck travel time imputation
AU - Karimpour, Abolfazl
AU - Ariannezhad, Amin
AU - Wu, Yao Jan
N1 - Publisher Copyright:
© The Institution of Engineering and Technology 2019.
PY - 2019/10/1
Y1 - 2019/10/1
N2 - Truck travel time data plays a critical role in freight performance measurement and is usually collected with probevehicle technologies. However, due to low sampling rates, truck data usually suffers from missing values. The primary purpose of this study is to develop a hybrid model to accurately impute missing truck travel time data by leveraging multiple data sources. The proposed model imputes missing values by considering the interaction, similarity, and differences of the data as well as incorporating available historical information. The hybrid model achieves robust results by combining both probe vehicle and loop detector data to impute continuous missing truck travel time data in sparse datasets. The proposed model was used to impute missing truck travel time data in the National Performance Measures Research Dataset (NPMRDS). The imputation performance of the proposed model was compared with several popular imputation models including historical, spline interpolation, random forest, and bootstrapping EM. The results indicated that the proposed model was capable of imputing missing data in sparse datasets, notably when the data was missing continuously. With ∼13% mean-absolute percentage error, the hybrid model outperformed other models in imputing an entire day of missing data.
AB - Truck travel time data plays a critical role in freight performance measurement and is usually collected with probevehicle technologies. However, due to low sampling rates, truck data usually suffers from missing values. The primary purpose of this study is to develop a hybrid model to accurately impute missing truck travel time data by leveraging multiple data sources. The proposed model imputes missing values by considering the interaction, similarity, and differences of the data as well as incorporating available historical information. The hybrid model achieves robust results by combining both probe vehicle and loop detector data to impute continuous missing truck travel time data in sparse datasets. The proposed model was used to impute missing truck travel time data in the National Performance Measures Research Dataset (NPMRDS). The imputation performance of the proposed model was compared with several popular imputation models including historical, spline interpolation, random forest, and bootstrapping EM. The results indicated that the proposed model was capable of imputing missing data in sparse datasets, notably when the data was missing continuously. With ∼13% mean-absolute percentage error, the hybrid model outperformed other models in imputing an entire day of missing data.
UR - http://www.scopus.com/inward/record.url?scp=85072726463&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85072726463&partnerID=8YFLogxK
U2 - 10.1049/iet-its.2018.5469
DO - 10.1049/iet-its.2018.5469
M3 - Article
AN - SCOPUS:85072726463
SN - 1751-956X
VL - 13
SP - 1518
EP - 1524
JO - IET Intelligent Transport Systems
JF - IET Intelligent Transport Systems
IS - 10
ER -