skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Data-driven Bus Crowding Prediction Models Using Context-specific Features
Public transit is one of the first things that come to mind when someone talks about “smart cities.” As a result, many technologies, applications, and infrastructure have already been deployed to bring the promise of the smart city to public transportation. Most of these have focused on answering the question, “When will my bus arrive?”; little has been done to answer the question, “How full will my next bus be?” which also dramatically affects commuters’ quality of life. In this article, we consider the bus fullness problem. In particular, we propose two different formulations of the problem, develop multiple predictive models, and evaluate their accuracy using data from the Pittsburgh region. Our predictive models consistently outperform the baselines (by up to 8 times).  more » « less
Award ID(s):
1739413
PAR ID:
10229761
Author(s) / Creator(s):
;
Date Published:
Journal Name:
ACM/IMS Transactions on Data Science
Volume:
1
Issue:
3
ISSN:
2691-1922
Page Range / eLocation ID:
1 to 33
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Urban public transit planning is crucial in reducing traffic congestion and enabling green transportation. However, there is no systematic way to integrate passengers' personal preferences in planning public transit routes and schedules so as to achieve high occupancy rates and efficiency gain of ride-sharing. In this paper, we take the first step tp exact passengers' preferences in planning from history public transit data. We propose a data-driven method to construct a Markov decision process model that characterizes the process of passengers making sequential public transit choices, in bus routes, subway lines, and transfer stops/stations. Using the model, we integrate softmax policy iteration into maximum entropy inverse reinforcement learning to infer the passenger's reward function from observed trajectory data. The inferred reward function will enable an urban planner to predict passengers' route planning decisions given some proposed transit plans, for example, opening a new bus route or subway line. Finally, we demonstrate the correctness and accuracy of our modeling and inference methods in a large-scale (three months) passenger-level public transit trajectory data from Shenzhen, China. Our method contributes to smart transportation design and human-centric urban planning. 
    more » « less
  2. null (Ed.)
    Smart city projects aim to enhance the management of city infrastructure by enabling government entities to monitor, control and maintain infrastructure efficiently through the deployment of Internet-of-things (IoT) devices. However, the financial burden associated with smart city projects is a detriment to prospective smart cities. A noteworthy factor that impacts the cost and sustainability of smart city projects is providing cellular Internet connectivity to IoT devices. In response to this problem, this paper explores the use of public transportation network nodes and mules, such as bus-stops as buses, to facilitate connectivity via device-to-device communication in order to reduce cellular connectivity costs within a smart city. The data mules convey non-urgent data from IoT devices to edge computing hardware, where data can be processed or sent to the cloud. Consequently, this paper focuses on edge node placement in smart cities that opportunistically leverage public transit networks for reducing reliance on and thus costs of cellular connectivity. We introduce an algorithm that selects a set of edge nodes that provides maximal sensor coverage and explore another that selects a set of edge nodes that provide minimal delivery delay within a budget. The algorithms are evaluated for two public transit network data-sets: Chapel Hill, North Carolina and Louisville, Kentucky. Results show that our algorithms consistently outperform edge node placement strategies that rely on traditional centrality metrics (betweenness and in-degree centrality) by over 77% reduction in coverage budget and over 20 minutes reduction in latency. 
    more » « less
  3. This paper proposes a framework to optimally employ static VAR compensators (SVCs) within a customized reconfiguration of system topology, leading to remediation of voltage violations caused by false data injection (FDI) cyberattacks targeting smart distribution grids. The designed framework contains formulations associated with planning and operation phases. In the planning phase, the scrutinized system, modified by photovoltaic (PV) units, is enhanced by optimally allocating static VAR compensators (SVCs) to keep the unity power factor throughout the system. Then, distribution system operator (DSO), being in attacker’s shoe, examines relevant cyberattack scenarios leading to voltage violations within the distribution system. Finally, in the operation phase, DSO takes advantage of the optimally planned SVCs to identify proper vectors (i.e., remedial actions) to cope with such potential scenarios of cyberattacks. These (to be recognized) vectors are associated with the variable shunt susceptance of the mentioned SVCs, which will be identified by solving a customized distribution feeder reconfiguration (DFR) problem in the operation phase. The main objective of the customized DFR is to maximize the contributions of SVCs through enhancing the voltage profile of the targeted system. This will enable DSO to mitigate the negative impacts of the FDI attacks and recover the voltage profile of the smart distribution network. The effectiveness of the proposed RAS is validated on three different smart test systems (i.e., 33-bus, 95-bus, and 136-bus systems), which are modified to contain SVC components and renewable-based distributed generation (DG) units. 
    more » « less
  4. null (Ed.)
    Warmest thanks to Rafael Pass and Muthu Venkitasubramaniam for this issue's guest column, "Average-Case Complexity Through the Lens of Interactive Puzzles." When I mentioned to them that my introduction would have a section on Alan Selman's passing, they immediately wrote back that they were very sorry to hear of Alan's passing, and mentioned (as you will see discussed in the second page of their article), "The main problem that we are addressing actually goes back to a paper of Even, Selman, and Yacobi from 1984: "The Complexity of Promise Problems with Applications to Public-Key Cryptography'." It is beautiful, and a tribute to the lasting influence of Alan's research, that in the 2020s his work from many decades earlier is helping shape the field's dialogue. 
    more » « less
  5. An effective real-time estimation of the travel time for vehicles, using AVL (Automatic Vehicle Locators) has added a new dimension to the smart city planning. In this paper, the authors used data collected over several months from a transit agency and show how this data can be potentially used to learn patterns of travel time during specially planned events like NFL (National Football League) games and music award ceremonies. The impact of NFL games along with consideration of other factors like weather, traffic condition, distance is discussed with their relative importance to the prediction of travel time. Statistical learning models are used to predict travel time and subsequently assess the cascading effects of delay. The model performance is determined based on its predictive accuracy according to the out-of-sample error. In addition, the models help identify the most significant variables that influence the delay in the transit system. In order to compare the actual and predicted travel time for days having special events, heat maps are generated showing the delay impacts in different time windows between two timepoint-segments in comparison to a non-game day. This work focuses on the prediction and visualization of the delay in the public transit system and the analysis of its cascading effects on the entire transportation network. According to the study results, the authors are able to explain more than 80% of the variance in the bus travel time at each segment and can make future travel predictions during planned events with an out-of-sample error of 2.0 minutes using information on the bus schedule, traffic, weather, and scheduled events. According to the variable importance analysis, traffic information is most significant in predicting the delay in the transit system. 
    more » « less