Large Banner Image

Special Session: Data-efficient Vision Transformers: Challenges & Applications

International Joint Conference on Neural Networks (IJCNN) https://2025.ijcnn.org/
Rome, Italy
30 June – 5 July, 2025

Overview

Vision Transformers (ViTs) have emerged as a breakthrough architecture in the field of computer vision, achieving state-of-the-art results across a variety of tasks. However, ViTs are data-hungry, often requiring large datasets to achieve optimal performance. This has led to challenges in data-scarce scenarios, where collecting and labelling large amounts of training data is impractical. The "Data-efficient Vision Transformers" special session aims to explore new techniques, challenges, and applications that address the issue of data efficiency in Vision Transformers.

This special session will gather experts from academia, industry, and research institutions to discuss cutting-edge solutions for improving the data efficiency of ViTs, such as novel training setups and architecture, self-supervised learning, transfer learning, and augmentation strategies. The goal is to bridge the gap between the high-performance potential of ViTs and their real-world applicability in scenarios with limited training data.


Objectives

The special session will aim to:


Scope and Topics

As the field of artificial intelligence continues to advance, the deployment of deep learning models, especially Vision Transformers (ViTs), is expanding into sectors such as healthcare, autonomous vehicles, satellite imaging, and more. However, many of these applications need more labelled data, posing a major barrier to achieving high accuracy.

This special session will provide a platform to discuss innovative strategies that mitigate data scarcity challenges in Vision Transformers, including but not limited to:


Target Audience

This special session will attract:


Submission Guidelines

Please follow the instructions carefully for submitting your paper to IJCNN 2025:

For more details, visit https://2025.ijcnn.org/authors/initial-author-instructions.


Important Dates


Organizers

Dr. Haider Raza

Dr. Haider Raza

University of Essex, UK

Dr. Raza is a Senior Lecturer with a strong background in AI, deep learning, and computer vision. His research focuses on developing AI solutions for healthcare, autonomous systems, and digital technology. He has published extensively on efficient AI models and has a track record of organising successful special sessions and conferences.

Dr. Muhammad Haris Khan

Dr. Muhammad Haris Khan

MBZUAI, UAE

Dr. Khan is an Assistant Professor at MBZUAI with expertise in computer vision, Vision Transformers, and data-efficient learning methods. His research addresses the challenges of model generalizability to new domains, data and label scarcity, and efficiency in AI models.

Prof. John Q Gan

Prof. John Q Gan

University of Essex, UK

Prof. Gan is a professor of artificial intelligence with extensive experience in deep learning and its applications in image and video classification, medical image analysis, and understanding.

Mohsin Ali

Mohsin Ali

University of Essex, UK

Mohsin Ali is a PhD scholar at the University of Essex, specializing in computer vision with a focus on Vision Transformers (ViTs) and explainable AI. His research aims to enhance the interpretability and efficiency of deep learning models in real-world applications.


Contact Information

For inquiries regarding the special session, feel free to contact:

This special session is co-located at the IJCNN 2025. More details can be found on the official conference website: https://2025.ijcnn.org/


Institutes

University of Essex

University of Essex

Colchester, United Kingdom

MBZUAI

MBZUAI

Abu Dhabi, United Arab Emirates

We believe this special session will offer significant insights and foster discussions on the future of Vision Transformers in data-efficient settings, aligning with the goals of IJCNN 2025. We look forward to contributing to the conference with this engaging and timely topic.