Skip to main content

Advertisement

Log in

Imputing trip purposes for long-distance travel

  • Published:
Transportation Aims and scope Submit manuscript

Abstract

Planning and policy analysis at the national, state and inter-regional corridor levels depends on reliable information and forecasts about long-distance travel. Emerging passive data collection technologies such as GPS, smartphones, and social media provide the opportunity for researchers and practitioners to potentially supplement or replace traditional long-distance travel surveys. However, certain important trip information, such as trip purpose, travel mode, and travelers’ socio-demographic characteristics, is missing from passively collected travel data. One promising solution to this data issue is to impute the missing information based on supplementary data (e.g., land use) and advanced statistical or data mining algorithms. This paper develops machine learning methods, including decision tree and meta-learning, to estimate trip purposes for long-distance passenger travel. A passively collected long-distance trip dataset is simulated from the 1995 American Travel Survey for the development and validation of the machine learning methods. The predictive accuracy of the proposed methods is evaluated for several scenarios varying with trip purposes and the extent of data availability as inputs. This research design will provide not only a practically useful approach for long-distance trip purpose imputation, but also generate valuable insights for future long-distance travel surveys. Results show that the accuracy of the trip purpose imputation methods based on all available data decreases from 95 % with two purposes (business and non-business) to 77 % with four purposes (business, personal business, social visit, and leisure). Based on a two-purpose scheme, the predictive accuracy of the imputation algorithms decreases from 95 % when all input data is used (a full-information model), to 72 % with a minimum information model that only utilizes the passively collected data. If traveler’s socio-demographic characteristics are available (possibly through other imputation models), the predictive accuracy only decreases from 95 to 91 %.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  • Bohte, W., Maat, K.: Deriving and validating trip destinations and modes for multi-day GPS-based travel surveys: a large-scale application in the Netherlands. Transp. Res. Part C 17, 285–297 (2009)

    Article  Google Scholar 

  • Chen, C., Gong, H.B., Lawson, C., Bialostozky, E.: Evaluating the feasibility of a passive travel survey collection in a complex urban environment: lessons learned from the New York City case study. Transp. Res. Part A 44, 830–840 (2010)

    Google Scholar 

  • Chen, C., Bian, L., Ma, J.: From traces to trajectories: how well can we guess activity locations from mobile phone traces? Transp. Res. Part C 46, 326–337 (2014)

    Article  Google Scholar 

  • Cohen, H., Horowitz, A., Pendyala, R.: Forecasting Statewide Freight Toolkit. Transportation Research Board, Washington, DC (2008)

    Google Scholar 

  • Deng, Z., Ji, M.: Deriving rules for trip purpose identification from GPS travel survey data and land use data: a machine learning approach. Traffic Transp. Stud., 768–777 (2010)

  • Doherty, S., Noël, N., Lee-Gosselin, M., Sirois, C., Ueno, M., Theberge, F.: Moving beyond observed outcomes: integrating Global Positioning Systems and interactive computer-based travel behavior surveys. Transp. Res. E-Circ. C 26, 449–466 (2001)

    Google Scholar 

  • Du, J., Aultman-Hall, L.: Increasing the accuracy of trip rate information from passive multi-day GPS travel datasets: automatic trip end identification issues. Transp. Res. Part A 41(3), 220–232 (2007)

    Google Scholar 

  • Federal Highway Administration: Design of a completely new approach for a national household-based long distance travel survey instrument. FHWA exploratory advanced research program final report, Prepared by Battelle, The Urban Institute, and the University of Maryland (2013)

  • Giaimo, G.T., Schiffer, R. (eds.) Statewide travel demand modeling: a peer exchange. Transp. Res. Circ. #E-C075 (2005)

  • Gong, H., Chen, C.: Automating web collection and validation of GPS data for longitudinal urban travel studies. Project report. Contract Number:49111-19-21. http://ntl.bts.gov/lib/46…340/FinalReportGPSData.pdf (2012)

  • Griffin, T., Huang, Y.: A decision tree based classification model to automate trip purpose derivation. In: The Proceedings of the 18th International Conference on Computer Applications in Industry and Engineering, Honolulu, Hawaii (2005)

  • Horowitz, A.J.: Statewide travel forecasting models (NCHRP Synthesis #358). Transp. Res. Board (2006)

  • Horowitz, A.J.: White paper: statewide travel demand forecasting. Requested by AASHTO and presented at the conference on meeting federal surface transportation requirements in statewide and metropolitan transportation planning (2008)

  • Kotsiantis, S.B., Kanellopoulos, D., Pintelas, P.E.: Data preprocessing for supervised leaning. Int J Comput Sci 1(2), 111–117 (2006)

    Google Scholar 

  • Liu, F., Janssens, D., Wets, G., Cools, M.: Annotating mobile phone location data with activity purposes using machine learning algorithms. Expert Syst. Appl. 40(8), 3299–3311 (2013)

    Article  Google Scholar 

  • Lu, Y., Zhu, S., Zhang, L.: Imputing trip purpose based on GPS travel survey data and machine learning methods. Paper presented at the 92nd transportation research board annual meeting, Washington DC, 2013

  • Lu, Y., Chen, X., Zhang, L.: A national travel demand model for the U.S.: A Person-based microsimulation approach. Paper presented at the 94th transportation research board annual meeting, Washington DC, 2015

  • Rohne, A.: OKI Experience with AirSage Data. Presented at the 2014 Innovation in Travel Modeling Conference, April 27–30, Baltimore, MD, 2014

  • Schönfelder, S., Axhausen, K., Antille, N., Bierlaire, M., Lausanne, E.: Exploring the potentials of automatically collected GPS data for travel behavior analysis: a Swedish data source. GI-Technologien für Verkehr und Logistik 13, 155–179 (2002)

    Google Scholar 

  • Schiffer, R.G.: Long-distance and rural travel transferable parameters for statewide travel forecasting models (NCHRP 735). Transp. Res. Board (2012)

  • Souleyrette, R.R., Hans, Z.N., Pathak, S.: Statewide Transportation Planning Model and Methodology Development Program. Iowa State University, Ames (1996)

    Google Scholar 

  • Shen, L., Stopher, P.R.: A process for trip purpose imputation from Global Positioning System data. Transp. Res. Part C 36, 261–267 (2013)

    Article  Google Scholar 

  • Shen, L., Stopher, P.R.: Review of GPS travel survey and GPS data-processing methods. Transp. Rev. 34(3), 316–334 (2014a)

    Article  Google Scholar 

  • Shen, L., Stopher, P.R.: Using SenseCam to pursue “ground truth” for global positioning system travel surveys. Transp. Res. Part C 42, 76–81 (2014b)

    Article  Google Scholar 

  • Stopher, P.R., Greaves, S., FitzGerald, C.: Developing and deploying a new wearable GPS device for transport applications. Paper presented to the 28th Australasian transport research forum, Sydney, 28–30 Sept 2005

  • Stopher, P., Clifford, E., Zhang, J., FitzGerald, C.: Deducing mode and purpose from GPS data. Working paper of the austrian key centre in transport and logistics, University of Sydney, Sydney, Australia (2008a)

  • Stopher, P., FitzGerald, C., Zhang, J.: Search for a Global Positioning System device to measure personal travel. Transp. Res. Part C 16(3), 350–369 (2008)

    Article  Google Scholar 

  • Wang, D., Sun, B.: Detecting activity types and trip purposes from passive GPS data: a data mining approach. In: Space-time integration in geography and GI science, pp. 211–234. Springer, Netherlands (2015)

  • Wolf, J., Guensler, R., Bachman, W.: Elimination of the travel diary: an experiment to derive trip purpose from GPS travel data. In: 80th Annual Meeting of the Transportation Research Board, p. 24. Washington DC. (2001)

  • Wolf, J., Guensler, R., Bachman, W.: Elimination of the travel diary: experiment to derive trip purpose from global positioning system travel data. Transp. Res. Rec. J. Transp. Res. Board 1768(1), 125–134 (2001)

    Article  Google Scholar 

  • Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, Burlington (2005)

    Google Scholar 

  • Zhang, L., Southworth, F., Xiong, C., Sonnenberg, A.: Methodological options and data sources for the development of long-distance passenger travel demand models: a comprehensive review. Transp. Rev. 32(4), 399–433 (2012)

    Article  Google Scholar 

  • Zhang, L., Krause, C.: An activity space based approach for capturing long distance travel using longitudinal GPS survey data, Transportation Research Board 92nd annual meeting. Washington D.C, Paper number: 13-3079 (2013)

  • Zhu, S., Levinson, D., Liu, H.: Measuring winners and losers from the new I-35W Mississippi River Bridge. In: The 89th Annual Conference of Transportation Research Board. Washington D.C (2010)

Download references

Acknowledgments

This research is financially supported by the Federal Highway Administration. The authors would also like to thank Ben Pierce and Robert Lordo from Battelle, and Robert Santos from the Urban Institute for their assistance and comments. Statements in this paper do not necessarily reflect the official views of the Federal Highway Administration. The Federal Highway Administration assumes no liability for the content or use of this paper. The authors would also like to thank the three anonymous reviewers and the guest editors for their valuable comments and suggestions that have helped improve the paper. The authors are solely responsible for all statements in this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lei Zhang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lu, Y., Zhang, L. Imputing trip purposes for long-distance travel. Transportation 42, 581–595 (2015). https://doi.org/10.1007/s11116-015-9595-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11116-015-9595-0

Keywords

Navigation