搜索结果：Vision research

共找到 20 条结果

高级筛选 ▾

Towards a Better Understanding of the Computer Vision Research Community in Africa

arXiv2023-05-11作者：Abdul-Hakeem Omotayo, Mai Gamal, Eman Ehab

Computer vision is a broad field of study that encompasses different tasks (e.g., object detection). Although computer vision is relevant to the African communities in various applications, yet computer vision research is under-explored in the continent and constructs only 0.06% of top-tier publications in the last ten years. In this paper, our goal is to have a better understanding of the computer vision research conducted in Africa and provide pointers on whether there is equity in research or not. We do this through an empirical analysis of the African computer vision publications that are Scopus indexed, where we collect around 63,000 publications over the period 2012-2022. We first study the opportunities available for African institutions to publish in top-tier computer vision venues. We show that African publishing trends in top-tier venues over the years do not exhibit consistent growth, unlike other continents such as North America or Asia. Moreover, we study all computer vision publications beyond top-tier venues in different African regions to find that mainly Northern and Southern Africa are publishing in computer vision with 68.5% and 15.9% of publications, resp. Nonet

搜索结果：Vision research

Towards a Better Understanding of the Computer Vision Research Community in Africa

What can robotics research learn from computer vision research?

Image Generators are Generalist Vision Learners

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Real-Time Digital Twins: Vision and Research Directions for 6G and Beyond

Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video

Steering CLIP's vision transformer with sparse autoencoders

Two Decades of Research at the University of Lagos (2004-2023): A Scientometric Analysis of Productivity, Collaboration, and Impact

Vision Generalist Model: A Survey

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Power, Prescription, and Postpositivism: Considerations for collecting and representing neurodiversity demographic information in physics education research

Mapping a Decade of Avian Influenza Research (2014-2023): A Scientometric Analysis from Web of Science

Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

Is Tracking really more challenging in First Person Egocentric Vision?

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Bootstrapping Vision-language Models for Self-supervised Remote Physiological Measurement

Vision-LSTM: xLSTM as Generic Vision Backbone

NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction

Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation

搜索结果：Vision research

Towards a Better Understanding of the Computer Vision Research Community in Africa

What can robotics research learn from computer vision research?

Image Generators are Generalist Vision Learners

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Real-Time Digital Twins: Vision and Research Directions for 6G and Beyond

Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video

Steering CLIP's vision transformer with sparse autoencoders

Two Decades of Research at the University of Lagos (2004-2023): A Scientometric Analysis of Productivity, Collaboration, and Impact

Vision Generalist Model: A Survey

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Power, Prescription, and Postpositivism: Considerations for collecting and representing neurodiversity demographic information in physics education research

Mapping a Decade of Avian Influenza Research (2014-2023): A Scientometric Analysis from Web of Science

Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

Is Tracking really more challenging in First Person Egocentric Vision?

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Bootstrapping Vision-language Models for Self-supervised Remote Physiological Measurement

Vision-LSTM: xLSTM as Generic Vision Backbone

NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction

Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation