Journal Article
Inferring Patterns in the Multi-week Activity Sequences of Public Transport Users

The public transport networks of dense cities such as London serve passengers with widely different travel patterns. In line with the diverse lives of urban dwellers, activities and journeys are combined within days and across days in diverse sequences. From personal- ized customer information, to improved travel demand models, understanding this type of heterogeneity among transit users is relevant to a number of applications core to public transport agencies’ function. In this study, passenger heterogeneity is investigated based on a longitudinal representation of each user’s multi-week activity sequence derived from smart card data. We propose a methodology leveraging this representation to identify clusters of users with similar activity sequence structure. The methodology is applied to a large sample (n=33,026) from London’s public transport network, in which each passenger is represented by a continuous 4-week activity sequence. The application re- veals 11 clusters, each characterized by a distinct sequence structure. Socio-demographic information available for a small sample of users (n=1,973) is combined to smart card transactions to analyze associations between the identified patterns and demographic attributes including passenger age, occupation, household composition and income, and vehicle ownership. The analysis reveals that significant connections exist between the demographic attributes of users and activity patterns identified exclusively from fare transactions.

Title
Publication TypeJournal Article
Year of Publication2016
AuthorsGoulet-Langlois G, Koutsopoulos HN, Zhao J
JournalTransportation Research Part C
Pagination1-16
Date Published02/2016
KeywordsActivity Sequence, Public Transportation, Smart Card Data, Travel Behavior, User Clustering
Abstract

The public transport networks of dense cities such as London serve passengers with widely different travel patterns. In line with the diverse lives of urban dwellers, activities and journeys are combined within days and across days in diverse sequences. From personal- ized customer information, to improved travel demand models, understanding this type of heterogeneity among transit users is relevant to a number of applications core to public transport agencies’ function. In this study, passenger heterogeneity is investigated based on a longitudinal representation of each user’s multi-week activity sequence derived from smart card data. We propose a methodology leveraging this representation to identify clusters of users with similar activity sequence structure. The methodology is applied to a large sample (n=33,026) from London’s public transport network, in which each passenger is represented by a continuous 4-week activity sequence. The application re- veals 11 clusters, each characterized by a distinct sequence structure. Socio-demographic information available for a small sample of users (n=1,973) is combined to smart card transactions to analyze associations between the identified patterns and demographic attributes including passenger age, occupation, household composition and income, and vehicle ownership. The analysis reveals that significant connections exist between the demographic attributes of users and activity patterns identified exclusively from fare transactions.