[PDF] Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation

Skip to search formSkip to main contentSkip to account menu

Semantic ScholarSemantic Scholar's Logo

Corpus ID: 269929954

@inproceedings{Gu2024FocusOL, title={Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation}, author={Zejun Gu and Zhongqiu Zhao and Hao Shen and Zhao Zhang}, year={2024}, url={https://api.semanticscholar.org/CorpusID:269929954}}

Zejun Gu, Zhongqiu Zhao, Zhao Zhang
Published 19 May 2024
Computer Science, Engineering

A Multi-Granular Information-Lossless (MGIL) model is proposed to replace the downsampling layers to address the above issues and outperforms the SOTA methods by 7.7 mAP on COCO and performs well with different input resolutions, different backbones, and different vision tasks.

[PDF] Semantic Reader

Figures and Tables from this paper

figure 1
table 1
figure 2
table 2
figure 3
table 3
figure 4
table 4
table 5
figure 5
table 6
table 7
table 8

Ask This Paper
BETA
AI-Powered

Our system tries to constrain to information found in this paper. Results quality may vary. Learn more about how we generate these answers.

Feedback?

75 References

Deep High-Resolution Representation Learning for Human Pose Estimation

Ke SunBin XiaoDong LiuJingdong Wang

Computer Science

2019 IEEE/CVF Conference on Computer Vision and…

2019

This paper proposes a network that maintains high-resolution representations through the whole process of human pose estimation and empirically demonstrates the effectiveness of the network through the superior pose estimation results over two benchmark datasets: the COCO keypoint detection dataset and the MPII Human Pose dataset.

3,153
Highly Influential

[PDF]

Simple Baselines for Human Pose Estimation and Tracking

Bin XiaoHaiping WuYichen Wei

Computer Science

ECCV

2018

This work provides simple and effective baseline methods for pose estimation that are helpful for inspiring and evaluating new ideas for the field and achieved on challenging benchmarks.

1,510
Highly Influential

[PDF]

Microsoft COCO: Common Objects in Context

Tsung-Yi LinM. Maire C. L. Zitnick

Computer Science

ECCV

2014

We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene…

35,128
Highly Influential

[PDF]

Effective Whole-body Pose Estimation with Two-stages Distillation

Zhendong YangAiling ZengChun YuanYu Li

Computer Science

2023 IEEE/CVF International Conference on…

2023

This work presents a two-stage pose Distillation for Whole-body Pose estimators, named DWPose, to improve their effectiveness and efficiency and releases a series of models with different sizes, from tiny to large, for satisfying various downstream tasks.

[PDF]

Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes

Xu JuAiling ZengJianan WangQian XuLei Zhang

Art, Computer Science

2023 IEEE/CVF Conference on Computer Vision and…

2023

The Human-Art dataset is introduced and contains 50k high-quality images with over 123k person instances from 5 natural and 15 artificial scenarios, which are annotated with bounding boxes, keypoints, self-contact points, and text information for humans represented in both 2D and 3D.

[PDF]

DistilPose: Tokenized Pose Regression with Heatmap Distillation

Suhang YeYingyi Zhang Rongrong Ji

Computer Science

2023 IEEE/CVF Conference on Computer Vision and…

2023

A novel human pose estimation framework termed DistilPose, which bridges the gaps between heatmap-based and regression-based methods and maximizes the transfer of knowledge from the teacher model to the student model through Token-distilling Encoder (TDE) and Simulated Heatmaps.

[PDF]

Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation

Jie YangAiling ZengSiyi LiuFeng LiRuimao ZhangLei Zhang

Computer Science

ICLR

2023

This paper presents a novel end-to-end framework with Explicit box Detection for multi-person Pose estimation, called ED-Pose, where it unifies the contextual learning between human-level and keypoint-level information and surpasses heatmap-based Top-down methods under the same backbone.

[PDF]

Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition

Sungho ShinJoosoon LeeJunseok LeeYeonguk YuKyoobin Lee

Computer Science

ECCV

2022

An attention similarity knowledge distillation approach, which transfers attention maps obtained from a high resolution (HR) network as a teacher into an LRnetwork as a student to boost LR recognition performance, outperforming state-of-the-art results by simply transferring well-constructed attention maps.

[PDF]

No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects

Raja SunkaraTie Luo

Computer Science

ECML/PKDD

2022

A new CNN building block called SPD-Conv is proposed in place of each strided convolution layer and each pooling layer, and it is shown that this approach significantly outperforms state-of-the-art deep learning models, especially on tougher tasks with low-resolution images and small objects.

[PDF]

End-to-End Multi-Person Pose Estimation with Transformers

Dahu ShiXing WeiLiangqi LiYe RenWenming Tan

Computer Science

2022 IEEE/CVF Conference on Computer Vision and…

2022

The proposed PETR method views pose estimation as a hierarchical set prediction problem and effectively removes the need for many hand-crafted modules like RoI cropping, NMS and grouping post-processing, and largely overcomes the feature misalignment difficulty in pose estimation and improves the performance considerably.

...

Related Papers

Showing 1 through 3 of 0 Related Papers

[PDF] Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation | Semantic Scholar (2024)

Figures and Tables from this paper

Ask This PaperBETAAI-Powered

75 References

Related Papers

References

Ask This Paper
BETA
AI-Powered