[PDF] Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation | Semantic Scholar (2024)

Skip to search formSkip to main contentSkip to account menu

Semantic ScholarSemantic Scholar's Logo
  • Corpus ID: 269929954
@inproceedings{Gu2024FocusOL, title={Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation}, author={Zejun Gu and Zhongqiu Zhao and Hao Shen and Zhao Zhang}, year={2024}, url={https://api.semanticscholar.org/CorpusID:269929954}}
  • Zejun Gu, Zhongqiu Zhao, Zhao Zhang
  • Published 19 May 2024
  • Computer Science, Engineering

A Multi-Granular Information-Lossless (MGIL) model is proposed to replace the downsampling layers to address the above issues and outperforms the SOTA methods by 7.7 mAP on COCO and performs well with different input resolutions, different backbones, and different vision tasks.

Figures and Tables from this paper

  • figure 1
  • table 1
  • figure 2
  • table 2
  • figure 3
  • table 3
  • figure 4
  • table 4
  • table 5
  • figure 5
  • table 6
  • table 7
  • table 8

Ask This Paper

BETA

AI-Powered

Our system tries to constrain to information found in this paper. Results quality may vary. Learn more about how we generate these answers.

Feedback?

75 References

Deep High-Resolution Representation Learning for Human Pose Estimation
    Ke SunBin XiaoDong LiuJingdong Wang

    Computer Science

    2019 IEEE/CVF Conference on Computer Vision and…

  • 2019

This paper proposes a network that maintains high-resolution representations through the whole process of human pose estimation and empirically demonstrates the effectiveness of the network through the superior pose estimation results over two benchmark datasets: the COCO keypoint detection dataset and the MPII Human Pose dataset.

  • 3,153
  • Highly Influential
  • [PDF]
Simple Baselines for Human Pose Estimation and Tracking
    Bin XiaoHaiping WuYichen Wei

    Computer Science

    ECCV

  • 2018

This work provides simple and effective baseline methods for pose estimation that are helpful for inspiring and evaluating new ideas for the field and achieved on challenging benchmarks.

  • 1,510
  • Highly Influential
  • [PDF]
Microsoft COCO: Common Objects in Context
    Tsung-Yi LinM. Maire C. L. Zitnick

    Computer Science

    ECCV

  • 2014

We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene

  • 35,128
  • Highly Influential
  • [PDF]
Effective Whole-body Pose Estimation with Two-stages Distillation
    Zhendong YangAiling ZengChun YuanYu Li

    Computer Science

    2023 IEEE/CVF International Conference on…

  • 2023

This work presents a two-stage pose Distillation for Whole-body Pose estimators, named DWPose, to improve their effectiveness and efficiency and releases a series of models with different sizes, from tiny to large, for satisfying various downstream tasks.

Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
    Xu JuAiling ZengJianan WangQian XuLei Zhang

    Art, Computer Science

    2023 IEEE/CVF Conference on Computer Vision and…

  • 2023

The Human-Art dataset is introduced and contains 50k high-quality images with over 123k person instances from 5 natural and 15 artificial scenarios, which are annotated with bounding boxes, keypoints, self-contact points, and text information for humans represented in both 2D and 3D.

DistilPose: Tokenized Pose Regression with Heatmap Distillation
    Suhang YeYingyi Zhang Rongrong Ji

    Computer Science

    2023 IEEE/CVF Conference on Computer Vision and…

  • 2023

A novel human pose estimation framework termed DistilPose, which bridges the gaps between heatmap-based and regression-based methods and maximizes the transfer of knowledge from the teacher model to the student model through Token-distilling Encoder (TDE) and Simulated Heatmaps.

Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
    Jie YangAiling ZengSiyi LiuFeng LiRuimao ZhangLei Zhang

    Computer Science

    ICLR

  • 2023

This paper presents a novel end-to-end framework with Explicit box Detection for multi-person Pose estimation, called ED-Pose, where it unifies the contextual learning between human-level and keypoint-level information and surpasses heatmap-based Top-down methods under the same backbone.

Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition
    Sungho ShinJoosoon LeeJunseok LeeYeonguk YuKyoobin Lee

    Computer Science

    ECCV

  • 2022

An attention similarity knowledge distillation approach, which transfers attention maps obtained from a high resolution (HR) network as a teacher into an LRnetwork as a student to boost LR recognition performance, outperforming state-of-the-art results by simply transferring well-constructed attention maps.

No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects
    Raja SunkaraTie Luo

    Computer Science

    ECML/PKDD

  • 2022

A new CNN building block called SPD-Conv is proposed in place of each strided convolution layer and each pooling layer, and it is shown that this approach significantly outperforms state-of-the-art deep learning models, especially on tougher tasks with low-resolution images and small objects.

End-to-End Multi-Person Pose Estimation with Transformers
    Dahu ShiXing WeiLiangqi LiYe RenWenming Tan

    Computer Science

    2022 IEEE/CVF Conference on Computer Vision and…

  • 2022

The proposed PETR method views pose estimation as a hierarchical set prediction problem and effectively removes the need for many hand-crafted modules like RoI cropping, NMS and grouping post-processing, and largely overcomes the feature misalignment difficulty in pose estimation and improves the performance considerably.

  • 61
  • PDF

...

...

Related Papers

Showing 1 through 3 of 0 Related Papers

    [PDF] Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation | Semantic Scholar (2024)

    References

    Top Articles
    Latest Posts
    Article information

    Author: Tyson Zemlak

    Last Updated:

    Views: 5565

    Rating: 4.2 / 5 (43 voted)

    Reviews: 90% of readers found this page helpful

    Author information

    Name: Tyson Zemlak

    Birthday: 1992-03-17

    Address: Apt. 662 96191 Quigley Dam, Kubview, MA 42013

    Phone: +441678032891

    Job: Community-Services Orchestrator

    Hobby: Coffee roasting, Calligraphy, Metalworking, Fashion, Vehicle restoration, Shopping, Photography

    Introduction: My name is Tyson Zemlak, I am a excited, light, sparkling, super, open, fair, magnificent person who loves writing and wants to share my knowledge and understanding with you.