Some recent (2016-now) Human-Pose related deep learning studies. If you find any errors or problems, please feel free to comment and PR. Early version is shared on [Zhihu] and [Mind-map].
- Human-Pose Estimation
- Human-Pose Recognition
- Human-Pose Generation
- Human-Pose Reconstruction and Rendering (Mesh or Appearance)
- Open-source Toolbox
[Top-down]
Popular papers
- Simple Pose
- Mask R-CNN
- CPM
- CPN
- RSN
- AlphaPose
- MSPN
- HRNet
Others
- Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
- Efficient Human Pose Estimation by Learning Deeply Aggregated Representations
- Fast Human Pose Estimation
- Learning to Refine Human Pose Estimation
[Bottom-up]
PAF Family
- PifPaf
- OpenPose
- Improved PifPaf
Associate Family
- Associative Embedding
- HigherHRNet
- Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation
Graph Family
- Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation
Offset Family
- Personlab
[One-stage]
- Deeply Learned Compositional Models for Human Pose Estimation
- Single-Stage Multi-Person Pose Machines
[Interaction-aware]
- I^2RNet
- Integral Human Pose Regression
- LCR-Net: Localization-Classification-Regression for Human Pose
[Non-rigid Structure from Motion]
- c3dpo
- Deep Interpretable Non-Rigid Structure from Motion
[2D lift to 3D]
- A simple yet effective baseline for 3d human pose estimation
[Depth-aware]
- HMOR
[Others]
- Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation
- PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation
- Semantic Graph Convolutional Networks for 3D Human Pose Regression
- UDP
- DarkPose
- Rethink
- Adversarial PoseNet
- Adversarial Semantic Data Augmentation for Human Pose Estimation
- CrowdPose
- Peeking into occluded joints
- Adversarial Semantic Data Augmentation for Human Pose Estimation
- 3D Human Pose Estimation using Spatio-Temporal Networks with Explicit Occlusion Training
- Occlusion-Aware Networks for 3D Human Pose Estimation in Video
- Occlusion-Aware Siamese Network for Human Pose Estimation
- A Semantic Occlusion Model for Human Pose Estimation from a Single Depth
- Occluded Joints Recovery in 3D Human Pose Estimation based on Distance Matrix
- Object-Occluded Human Shape and Pose Estimation from a Single Color Image
- LCR-Net Localization-Classification-Regression for Human Pose
- Adaptive Multiview Fusion for Accurate Human Pose Estimation in the wild
- Cross View Fusion for 3D Human Pose Estimation
- Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS
- Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views
- Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation A Geometric Approach
- View-Invariant Probabilistic Embedding for Human Pose
- Combining detection and tracking for human pose estimation in videos
- Key Frame Proposal Network for Efficient Pose Estimation in Videos
- Learning Temporal Pose Estimation from Sparsely-Labeled Videos
- Personalizing Human Video Pose Estimation
- Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos
- VideoPose3D
- PoseNet3D
- Motion Guided 3D Pose Estimation from Videos
- 2D or 3D Pose Estimation and Action Recognition using Multitask Deep Learning
- 3D Human Pose Estimation from Monocular Video
- Attention Mechanism Exploits Temporal Contexts Real-time 3D Human Pose Reconstruction
- VNect
- 3D Human Pose Estimation in RGBD Images for Robotic Task Learning
- Human Pose Estimation for RGBD Imagery with Multi-Channel Mixture of Parts and Kinematic Constraints
- 2D or 3D Pose Estimation and Action Recognition using Multitask Deep Learning
- SlowFast
- Long-term Feature Bank
- AlphAction
- 2D or 3D Pose Estimation and Action Recognition using Multitask Deep Learning
- Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis
- FOMM
- MRAA
- Thin
- Speech2Gesture: Learning Individual Styles of Conversational Gesture
- Speech2Video: Synthesis with 3D Skeleton Regularization and Expressive Body Poses
- Dancing to Music
- ChoreoNet: Towards Music to Dance Synthesis with
- LISTEN TO DANCE
- Music2Dance
- Audio to Body Dynamics
- Multi-Instrumentalist Net Unsupervised Generation of Music from Body Movements
- Dance with Melody: An LSTM-autoencoder Approach to
- Text2Action
- Language2Pose: Natural Language Grounded Pose Forecasting
- TriModel
- Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation
- VIBE: Video Inference for Human Body Pose and Shape Estimation
- METRO
- Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans (CVPR 2021) [Project] [Code] [Paper]
- Neural actor: neural free-view synthesis of human actors with pose control (SIGGRAPH Asia 2021) [Project] [Code] [Paper]
- Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies (ICCV 2021) [Project] [Code] [Paper]
- A-NeRF: Articulated Neural Radiance Fields for Learning Human Shape, Appearance, and Pose (NeurIPS 2021) [Project] [Code] [Paper]
- Neural Articulated Radiance Field (ICCV 2021) [Project] [Code] [Paper]
- HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video (CVPR 2022) [Project] [Code] [Paper]
- HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs (CVPR 2022) [Project] [Code] [Paper]
From https://github.com/jintaiWang/NeRF-About-Human-Pose-Reconstruction-and-Rendering
Coming soon...