SSeg-LSTM: Semantic Scene Segmentation for Trajectory Prediction

@article{Syed2019SSegLSTMSS,
  title={SSeg-LSTM: Semantic Scene Segmentation for Trajectory Prediction},
  author={Arsal Syed and Brendan Tran Morris},
  journal={2019 IEEE Intelligent Vehicles Symposium (IV)},
  year={2019},
  pages={2504-2509}
}
  • Arsal Syed, B. Morris
  • Published 9 June 2019
  • Computer Science
  • 2019 IEEE Intelligent Vehicles Symposium (IV)
In this paper, we propose the use of semantic segmentation to incorporate scene information for better understanding of human motion in crowded environments. Our proposed SSeg-LSTM method leverages SegNet, which is a semantic segmentation encoder-decoder architecture, to extract semantically meaningful scene features. We then train the Social Scene LSTM (SS-LSTM) model with the contextual information regarding dynamics, social neighborhood, and scene semantics to predict future trajectory… 

Figures, Tables, and Topics from this paper

SRA-LSTM: Social Relationship Attention LSTM for Human Trajectory Prediction
TLDR
A Social Relationship Attention LSTM (SRA-LSTM) model to predict future trajectories of pedestrian trajectory prediction for surveillance video achieves superior performance compared with state-of-theart methods.
Social Pooling with Edge Convolutions on Local Connectivity Graphs for Human Trajectory Prediction in Crowded Scenes
TLDR
A novel multi-layer network architecture based on a new Edge Convolutional operator acting on irregular data which is able to generalize local human-human interactions on a semantic social context is developed and integrated into a state-of-the-art trajectory prediction framework based on Generative Adversarial Networks.
Scene Gated Social Graph: Pedestrian Trajectory Prediction Based on Dynamic Social Graphs and Scene Constraints
TLDR
In this work, a novel trajectory prediction method named Scene Gated Social Graph (SGSG) is proposed, which uses dynamic graphs to describe the social relationship among pedestrians and achieves superior performance on two widely used trajectory prediction benchmarks.
Review of Pedestrian Trajectory Prediction Methods: Comparing Deep Learning and Knowledge-based Approaches
TLDR
A comparison of relatively new deep learning algorithms with classical knowledge-based models that are widely used to simulate pedestrian dynamics shows that the combination of both approaches seems to be promising to overcome disadvantages like the missing explainability of the deep learning approach.
Time Series Segmentation of Flood Flow Based on Bi-LG-LSTM Neural Network
  • Jun Feng, Haohang Wang, Yirui Wu
  • Computer Science
    2020 IEEE 5th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA)
  • 2020
TLDR
This paper proposes a time series segmentation method based on Bi-LG-LSTM neural network, which is modified on the base of the LSTM/Bi-L STM neuralnetwork, which can extract a large number of effective segments of the time series through the supervised learning method.
Review on Vehicle Detection Technology for Unmanned Ground Vehicles
Unmanned ground vehicles (UGVs) have great potential in the application of both civilian and military fields, and have become the focus of research in many countries. Environmental perception
A Survey on Sensor Technologies for Unmanned Ground Vehicles
  • Qi Liu, S. Yuan, Zirui Li
  • Computer Science
    2020 3rd International Conference on Unmanned Systems (ICUS)
  • 2020
TLDR
A brief review on sensor technologies for UGVs, highlighting the strengths and weaknesses of different sensors as well as their application scenarios are compared and the hotspots of sensor technologies are forecasted to point the development direction.
DATA-DRIVEN APPROACH TO HOLISTIC SITUATIONAL AWARENESS IN CONSTRUCTION SITE SAFETY MANAGEMENT
TLDR
The overarching goal of this research is to minimize the risk of struck-by accidents on construction jobsite by enhancing the holistic situational awareness of the unstructured and dynamic construction environment through a novel data-driven approach.
A Novel Graph based Trajectory Predictor with Pseudo Oracle
TLDR
The Graph-based Trajectory Predictor with Pseudo-Oracle (GTPPO), an encoder-decoder-based method conditioned on pedestrians' future behaviors, which is evaluated on ETH, UCY, and Stanford Drone datasets, and the results demonstrate state-of-the-art performance.
...
1
2
...

References

SHOWING 1-10 OF 16 REFERENCES
SS-LSTM: A Hierarchical LSTM Model for Pedestrian Trajectory Prediction
TLDR
A novel hierarchical LSTM-based network is proposed to consider both the influence of social neighbourhood and scene layouts in pedestrian trajectory prediction, showing that the method outperforms other methods and that using circular shape neighbourhood improves the prediction accuracy.
Social LSTM: Human Trajectory Prediction in Crowded Spaces
TLDR
This work proposes an LSTM model which can learn general human movement and predict their future trajectories and outperforms state-of-the-art methods on some of these datasets.
Convolutional Social Pooling for Vehicle Trajectory Prediction
  • Nachiket Deo, M. Trivedi
  • Computer Science
    2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
  • 2018
TLDR
This paper proposes an LSTM encoder-decoder model that uses convolutional social pooling as an improvement to social Pooling layers for robustly learning interdependencies in vehicle motion and outputs a multi-modal predictive distribution over future trajectories based on maneuver classes.
Context-Aware Trajectory Prediction
TLDR
This work proposes a “context-aware” recurrent neural network LSTM model, which can learn and predict human motion in crowded spaces such as a sidewalk, a museum or a shopping mall, and evaluates the model on a public pedestrian datasets.
An Evaluation of Trajectory Prediction Approaches and Notes on the TrajNet Benchmark
TLDR
It is shown that a Recurrent-Encoder with a Dense layer stacked on top, referred to as RED-predictor, is able to achieve sophisticated results compared to elaborated models in such scenarios and some recommendations for overcoming demonstrated shortcomings are given.
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling
TLDR
The results show that SegNet achieves state-of-the-art performance even without use of additional cues such as depth, video frames or post-processing with CRF models.
Semantic object classes in video: A high-definition ground truth database
TLDR
The Cambridge-driving Labeled Video Database (CamVid) is presented as the first collection of videos with object class semantic labels, complete with metadata, and the relevance of the database is evaluated by measuring the performance of an algorithm from each of three distinct domains: multi-class object recognition, pedestrian detection, and label propagation.
Probabilistic vehicle trajectory prediction over occupancy grid map via recurrent neural network
TLDR
The proposed trajectory prediction method employs the recurrent neural network called long short-term memory (LSTM) to analyze the temporal behavior and predict the future coordinate of the surrounding vehicles.
SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints
TLDR
SoPhie is presented; an interpretable framework based on Generative Adversarial Network (GAN), which leverages two sources of information, the path history of all the agents in a scene, and the scene context information, using images of the scene.
You'll never walk alone: Modeling social behavior for multi-target tracking
TLDR
A model of dynamic social behavior, inspired by models developed for crowd simulation, is introduced, trained with videos recorded from birds-eye view at busy locations, and applied as a motion model for multi-people tracking from a vehicle-mounted camera.
...
1
2
...