BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting

Human pose forecasting that aims to predict the body poses happening in the future is an important task in computer vision. However, long-term pose forecasting is particularly challenging because modeling long-range dependencies across the spatial-temporal level is hard for joint-based poses. Another challenge is uncertainty prediction since the future prediction is not a deterministic process. In this work, we present a novel Bayesian Spatial-Temporal Graph Transformer (BSTG-Trans) for predicting accurate, diverse, and uncertain future poses. First, we apply a spatial-temporal graph transformer as an encoder and a temporal-spatial graph transformer as a decoder for modeling the long-range spatial-temporal dependencies across pose joints to generate the long-term future body poses. Furthermore, we propose a Bayesian sampling module for uncertainty quantization of diverse future poses. Finally, a novel uncertainty estimation metric, namely Uncertainty Absolute Error (UAE) is introduced for measuring both the accuracy and uncertainty of each predicted future pose. We achieve state-of-the-art performance against other baselines on Human3.6M and HumanEva-I in terms of accuracy, diversity, and uncertainty for long-term pose forecasting. Moreover, our comprehensive ablation studies demonstrate the effectiveness and generalization of each module proposed in our BSTG-Trans.

Environment Setup

Please install the requirements, run

pip install -r requirements.txt

Data Preparation

Please follow the data preprocessing steps (DATASETS.md) inside the VideoPose3D repo. Place the prepocessed data data_3d_h36m.npz and data_2d_h36m_gt.npz under the data folder.

Model

A trained model checkpoint.pkl is provided in CHECKPOINT.

Running Instructions

To perform experiments with BSTG-Trans, run:

python main_body_pose.py --dataset Human36M \
    --batch_size 128 \
    --init_lr 0.001 \
    --gpu_id 0,1,2,3,4,5,6,7 \
    --seed 2022 \
    --config 'configs/bayesian_stgt_human36m.json'

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
common		common
configs		configs
data		data
figures		figures
layers		layers
nets		nets
train		train
LICENSE		LICENSE
README.md		README.md
main_body_pose.py		main_body_pose.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting

Environment Setup

Data Preparation

Model

Running Instructions

About

Releases

Packages

Languages

License

stoneMo/BSTG-Trans

Folders and files

Latest commit

History

Repository files navigation

BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting

Environment Setup

Data Preparation

Model

Running Instructions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages