We're sorry but this page doesn't work properly without JavaScript enabled. Please enable it to continue.
Feedback

Multi-branch Deep learning Based Transport Mode Detection using Weakly Supervised Labels

00:00

Formal Metadata

Title
Multi-branch Deep learning Based Transport Mode Detection using Weakly Supervised Labels
Title of Series
Number of Parts
351
Author
License
CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
Language
Production Year2022

Content Metadata

Subject Area
Genre
Abstract
Multi-branch Deep learning Based Transport Mode Detection using Weakly Supervised Labels Mobility data, based on global positioning system (GPS) tracking, have been widely used in many areas. These include analyzing travel patterns, investigating transport safety and efficiency, and evaluating travel impacts. Transport Mode Detection (TMD) is an essential factor in understanding mobility within the transport system. A TMD model assigns a GPS point or a GPS trajectory to a particular transport mode based on the user's activity and medium of travel [1]. However, the complexity of the prediction procedure increases with the number of modes that need to be predicted. For example, it is comparatively easy to predict whether a user is 'static' or 'slow moving' or 'fast moving' but it's hard to predict detailed transport modes such as walk, bike, car, bus, train, boat, etc. Therefore, this study proposes a multi-branch deep learning-based TMD model which can predict multi-class transport modes. Two major challenges need to be addressed in order to generate a state-of-the-art deep learning model. The first is to prepare ground-truth data. There are insufficient open-sourced ground-truth data available for transport modes in Japan. Hence, we proposed a transport mode label generation approach using snorkel [2]. Snorkel is a weakly supervised labeling function, a first-of-its-kind system that enables users to train state-of-the-art models without hand labeling any training data. Instead, experts write labeling functions that express arbitrary heuristics based on the logic that can be drawn from understanding the data and the physical actions they represent. In this study, we used snorkel for generating the ground truth data for transport mode. Initially, we considered publicly available road networks, railway networks, bus routes, etc., for creating road, bus, train labels by overlaying GPS points on these transportation networks. However, there are multiple occasions where the road, bus, and train classes overlap each other, especially in a city region. Hence, we introduced a boolean (True/False) based soft-labeling function, where the same GPS point might have multiple True values for road or railway. Second, we derived mobility-related features from the raw GPS data. Raw GPS raw data is typically composed of latitude, longitude, and timestamps. The raw GPS data were used to generate point-level features such as speed, speed difference, acceleration, acceleration difference, initial bearing, and bearing difference. Apart from that, we also generated trajectory level features such as average speed and average acceleration. Transportation network-based soft-labeling and other mobility features are used to define labeling functions in the snorkel. These label functions are used to create true ground truths using a generative machine learning model with a portion of the GPS data. The generated labels (walk, cycle, bus, car, train, boat/ship) were then used to train the proposed deep learning model. To construct the model we opted to use two branches where raw GPS latitude and longitude values were used in one and the derived mobility features are used in the other. We used 3 fully-connected hidden layers for raw GPS data (lat/lon) and 4 fully connected hidden layers for mobility features. Features derived from the two branches are concatenated. Further, 3 fully connected hidden layers and softmax cross-entropy were used as a loss function. The proposed deep learning model has 108,614 trainable parameters and Adam is used as an optimizer. This particular two-branch model structure achieves better accuracy as it combines raw data as well as the derived mobility features in the network. An example of the benefit from this approach benefit can be the network's ability to relate GPS coordinates with road driving classes, thus inherently inferring that location as on a road. Note, many of these inferences that improve classification accuracy are possible via dramatically more advanced pre-processing to build out additional features. However, that approach is more time-consuming and could never catch all the potential inferences that an unbiased set of deep learning layers can inherently extract. We evaluated the trained model's effectiveness in two ways. We compared the results against the popular XGBoost classifier, with our model producing over 5% higher accuracy for the benchmark Geolife dataset [3]. Moreover, we collected smartphone-based GPS trajectories for multiple modes of transportation collected by testers in Bengaluru, India, and Tokyo, Japan. With this new absolute ground truth data, we compared the resulting predicted classes between operating system-provided activity classifications, the above XGBoost model, and our own. Our experiments show promising results with improved accuracy and increases in number of labeled data points. Of key note is that the iOS [4] and android [5]in-built activity recognition tools provide the 'automotive' class as a single class, while our proposed model efficiently distinguishes automotive classes as car, bus, and train with improved accuracy. This work completely depends upon Free and Open Source Solutions (FOSS) for data preparation, mobility feature generation, deep learning model training, and big data computing. That includes various geospatial libraries such as geopandas, shapely, rtree, weakly label generation platform snorkel, deep learning platform tensorflow, keras, big-data computing platforms such as pyspark, hadoop, hive, etc.
Keywords
202
Thumbnail
1:16:05
226
242
Group actionMultiplicationEndliche ModelltheorieTrailTrajectoryPoint (geometry)AreaComputer animation
Endliche ModelltheorieObservational studyTrajectoryTask (computing)Set (mathematics)Computer animation
Data modelAndroid (robot)ArchitecturePredictionDivisorPhysical systemKolmogorov complexityFunction (mathematics)Mathematical modelComputer networkCAE Inc.Transportation theory (mathematics)Hausdorff dimensionPopulation densityDirection (geometry)MathematicsMultiplicationData structurePoint cloudProcess (computing)Set (mathematics)Observational studyEndliche ModelltheoriePoint (geometry)Process (computing)Wave packetComputer architectureMobile appMaxima and minimaSupervised learningBus (computing)Row (database)Branch (computer science)ImplementationSoftwareGene clusterFunctional (mathematics)Utility softwareSmartphoneStapeldateiFunctional (mathematics)MultiplicationTable (information)outputResultantAndroid (robot)Public domainMathematical modelSoftware testingService (economics)Basis <Mathematik>TrajectoryInsertion lossOpen sourceComputer animation
Data modelComputing platformPlastikkartePredictionNumberAndroid (robot)MehrplatzsystemAndroid (robot)NumberoutputRight angleEndliche ModelltheorieComputer animation
Transcript: English(auto-generated)
Mobility data based on global positioning system tracking have been widely used in several areas. A transport mode model assigns a GPS point or a GPS trajectory to a particular transport mode based on user's activity. Unavailability of transport mode training datasets make it a challenging task to build a TMD model.
Hence, this study proposed a weakly supervised model using snorkel to generate synthetic training data. Further study proposed a two-branch deep learning architecture that well suited for transport mode detection. Results are evaluated generously with previous machine learning model as well as with the iOS and Android inbuilt tool.
In any supervised learning approach, label data availability is critical to maximize model performance. Hence, this study proposes a transport mode label generation approach using snorkel with programmatic labeling as there is no ground truth data available for this study.
We considered publicly available datasets related to road network, railway network, bus routes, etc. These datasets were used to create road bus train labels by overlapping GPS points on associated transportation network.
These labels are called soft labels. Transportation network based soft labeling and other mobility features are used to define multiple labeling functions in snorkel. Study proposed a two-branch deep learning architecture where road GPS latitude and longitude values
are used in one branch and derived mobility features are used in the other branch. We used three fully connected hidden layers for road GPS data with latitude and longitude values. For mobility features, we used four fully connected hidden layers.
Features derived from the two branches are concatenated in feature domains. Furthermore, three fully connected hidden layers and softmax cross-entropy were used as a loss function. This study utilized various free and open source solutions to implement the deep learning model using a scheduled pipeline for estimating TMD on a daily basis.
Multiple dedicated mobile applications collect big GPS data and the pipeline will gather daily data and process using function as a service. The jobs are scheduled and triggered as daily batch using Apache Airflow.
The implementation utilized Apache Hadoop clusters to process big GPS data and Apache Hive tables are used to store the daily batch TMD results. We compared the proposed model with our own previous model AXG Boost Classifier and improved accuracy by nearly 2x.
Moreover, we collected smartphone-based GPS trajectories for multiple modes of transportation collected by testers in Tokyo, Japan. And resulted in higher average accuracy compared to the Android and iOS inbuilt.
Overall, 17.86% better than iOS Android inbuilt tool. Proposed model improved number of modes and user coverage. Top right animation demonstrates the transportation detection for a single user in a day.
Thank you for listening.