Nikolaos Giakoumoglou

Research Postgraduate at Imperial College London

Nikolaos Giakoumoglou is currently a postgraduate researcher at Imperial College London in the Department of Electrical and Electronic Engineering (EEE) in the Communications and Signal Processing (CSP) group, where he is pursuing his PhD under the guidance of Professor Tania Stathaki. Before his current role, he worked as a Research Assistant at the Centre for Research and Technology Hellas (CERTH) in the Information Technologies Institute (ITI) department. He obtained his Diploma in Electrical and Computer Engineering in 2021 from the Department of Electrical & Computer Engineering at Aristotle University of Thessaloniki. Nikolaos' research is primarily focused on Artificial Intelligence, Machine Learning, and Deep Learning, with a special interest in applications within the field of Computer Vision.

News

Jul 1, 2026	Review paper accepted at AIPA
Jun 26, 2026	Paper accepted at ICIP Satellite Workshop 2026 (Tampere, Finland) - Anastasis Varvarigos FYP
Feb–Mar, 2026	Started working on Vision-Language Models and Joint Embedding Predictive Architectures
Jan 13, 2026	Paper accepted (oral) at ISBI 2026 (London, UK)
Dec 19, 2025	Paper accepted at VISAPP 2026 (Marbella, Spain) - Lucas Iijima FYP
Nov 4, 2025	Paper accepted at NeurIPS Workshop 2025 (Copenhagen, Denmark)
Aug 26, 2025	Paper accepted at ICCV Workshop 2025 (Hawaii, US)
May 20, 2025	Paper accepted at ICIP 2025 (Alaska, US)
Apr 20, 2025	Paper accepted at CVPR Workshop 2025 (Nashville, US)
Apr 2, 2025	Paper presented at Imperial Research Computing Showcase Day 2025 (London, UK)

Research Interests

Computer Vision
Deep Learning
Self-supervised Learning

Education

Imperial College London, London, United Kingdom

PhD in Electrical and Electronic Engineering, supervised by Prof. Tania Stathaki

January 2024 — Present

Aristotle University of Thessaloniki, Thessaloniki, Greece

Integrated MSc in Electrical and Computer Engineering, GPA: 8.91/10.00

September 2016 — November 2021

Teaching Experience

Graduate Teaching Assistant

Lab demonstrator for Digital Image Processing (ELEC70078)

September 2024 — Present

Accepted Papers for Publication

2026

AIPA

A Review on Artificial Intelligence Methods for Plant Disease and Pest Detection

N. Giakoumoglou, D. Kapetas, K. M. Papadopoulos, P. Christakakis, T. Stathaki, E. M. Pechlivani

In AI and Precision Agriculture, 2026

MDPI

Artificial intelligence (AI) has emerged as a transformative tool for plant health monitoring, offering new opportunities for scalable, timely, and data-driven pest and disease management in agriculture. This review provides a comprehensive synthesis of AI-based methods for pest and plant disease detection, systematically organizing existing literature across sensing modalities, learning paradigms, and deployment scales. We distinguish between population-level pest monitoring, plant-centric visual inspection, and field-scale surveillance, as well as between post-symptomatic disease recognition and pre-symptomatic detection enabled by spectral imaging technologies. Beyond summarizing recent advances, this work places strong emphasis on critical analysis, discussing fundamental limitations related to data scarcity, domain shift, generalization under field conditions, and the challenge of disentangling biotic from abiotic stress factors. The review further examines the distinction between correlation-driven AI predictions and causal disease understanding, positioning AI as a complementary decision-support tool alongside established diagnostic methods. Building on these insights, we outline key future research directions, including multimodal sensor fusion, explainable and trustworthy AI, edge-based deployment for real-time monitoring, and the development of foundation models for unified agricultural intelligence. This review aims to serve as both an accessible entry point and a critical reference for advancing AI-driven plant health management.

@article{giakoumoglou2026plantreview, title={{A Review on Artificial Intelligence Methods for Plant Disease and Pest Detection}}, author={Giakoumoglou, Nikolaos and Kapetas, Dimitrios and Papadopoulos, Kleanthis Marios and Christakakis, Panagiotis and Stathaki, Tania and Pechlivani, Eleftheria Maria}, journal={AI and Precision Agriculture}, volume={1}, number={1}, pages={2}, year={2026}, doi={10.3390/aipa1010002}, publisher={MDPI} }
ISBI

Expert Clustering and Knowledge Transfer for Whole Slide Image Classification

K. M. Papadopoulos, N. Giakoumoglou, A. Floros, P. L. Dragotti, T. Stathaki

In ISBI (Oral), 2026

IEEE Code

Multiple Instance Learning (MIL) is widely adopted for Whole Slide Image (WSI) classification. Existing MIL methods suffer from representation bottlenecks where slide-level aggregation compresses diverse patch information, limiting performance. Our proposed Divide-and-Distill (D&D) framework addresses this by partitioning the feature space into representation-coherent clusters, training specialized expert models on each cluster, and distilling their collective knowledge into a unified model. Experiments across three datasets and six MIL methods demonstrate consistent performance gains without added inference cost.

@inproceedings{papadopoulos2026expert, author={Papadopoulos, Kleanthis Marios and Giakoumoglou, Nikolaos and Floros, Andreas and Dragotti, Pier Luigi and Stathaki, Tania}, booktitle={2026 IEEE 23rd International Symposium on Biomedical Imaging (ISBI)}, title={Expert Clustering and Knowledge Transfer for Whole Slide Image Classification}, year={2026}, volume={}, number={}, pages={1-5}, keywords={Modeling;Training;Labeling;Printing;Learning (artificial intelligence);Equations;Pathology;Accuracy;Imaging;Digital Pathology;Knowledge Distillation, Multiple Instance Learning;Representation Bottleneck}, doi={10.1109/ISBI61048.2026.11515371}}
VISAPP

A Multimodal Approach for Cross-Domain Image Retrieval

L. Iijima, N. Giakoumoglou and T. Stathaki

In VISAPP, 2026

SciTePress arXiv

Cross-Domain Image Retrieval (CDIR) is a challenging task in computer vision, aiming to match images across different visual domains such as sketches, paintings, and photographs. This paper introduces a novel unsupervised approach to CDIR that incorporates textual context by leveraging pre-trained vision-language models. Our method, dubbed as Caption-Matching (CM), uses generated image captions as a domain-agnostic intermediate representation, enabling effective cross-domain similarity computation without the need for labeled data or fine-tuning. We evaluate our method on standard CDIR benchmark datasets, demonstrating state-of-the-art performance in unsupervised settings with improvements of 24.0% on Office-Home and 132.2% on DomainNet over previous methods.

@conference{iijima2024caption, author={Lucas Iijima and Nikolaos Giakoumoglou and Tania Stathaki}, title={Caption-Matching: A Multimodal Approach for Cross-Domain Image Retrieval}, booktitle={Proceedings of the 21st International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP}, year={2026}, pages={600-607}, publisher={SciTePress}, organization={INSTICC}, doi={10.5220/0014460000004084}, isbn={978-989-758-804-4}, issn={2184-4321}, }

2025

NeurIPS

Mitigating Representation Bottlenecks in Multiple Instance Learning

K. M. Papadopoulos, N. Giakoumoglou, A. Floros, T. Stathaki

In NeurIPS Workshop "MedEurIPS" (non-archival), 2025

OpenReview

Multiple Instance Learning (MIL) is widely used for Whole Slide Image classification in computational pathology, yet existing approaches suffer from a representation bottleneck where diverse patch-level features are compressed into a single slide-level embedding. We propose Divide-and-Distill (D&D), which clusters the feature space into coherent regions, trains expert models on each cluster, and distills their knowledge into a unified model. Experiments demonstrate that D&D consistently improves six state-of-the-art MIL methods in both accuracy and AUC while maintaining single-model inference efficiency.

@inproceedings{papadopoulos2025mitigating, title={{Mitigating Representation Bottlenecks in Multiple Instance Learning}}, author={Papadopoulos, Kleanthis Marios and Giakoumoglou, Nikolaos and Floros, Andreas and Dragotti, Pier Luigi and Stathaki, Tania}, booktitle={Medical Imaging meets NeurIPS Workshop (MedNeurIPS)}, year={2025}, url={https://openreview.net/forum?id=nywAT7N8Do} }
ICIP

Cluster Contrast for Unsupervised Visual Representation Learning

N. Giakoumoglou, T. Stathaki

In ICIP, 2025

IEEE arXiv

We introduce Cluster Contrast (CueCo), a novel approach to unsupervised visual representation learning that effectively combines the strengths of contrastive learning and clustering methods. CueCo is designed to simultaneously scatter and align feature representations within the feature space. Our method achieves 91.40% top-1 classification accuracy on CIFAR-10, 68.56% on CIFAR-100, and 78.65% on ImageNet-100 using linear evaluation with a ResNet-18 backbone.

@inproceedings{giakoumoglou2025cluster, title={{Cluster Contrast for Unsupervised Visual Representation Learning}}, author={Giakoumoglou, Nikolaos and Stathaki, Tania}, booktitle={2025 IEEE International Conference on Image Processing (ICIP)}, pages={133--138}, year={2025}, organization={IEEE} }
ICCV

Training Self-Supervised Vision Transformers with Synthetic Data and Synthetic Hard Negatives

N. Giakoumoglou, A. Floros, K. M. Papadopoulos, T. Stathaki

In ICCV Workshop "LIMIT" (non-archival), 2025

OpenReview arXiv Code

We build on existing self-supervised learning approaches for vision, drawing inspiration from the adage "fake it till you make it". We investigate two forms of "faking it" in vision transformers: leveraging synthetic data from generative models and generating synthetic hard negatives in the representation space. Our framework, dubbed Syn2Co, combines both approaches and evaluates whether synthetically enhanced training can lead to more robust and transferable visual representations on DeiT-S and Swin-T architectures.

@inproceedings{giakoumoglou2025fake, title={{Fake \& Square: Training Self-Supervised Vision Transformers with Synthetic Data and Synthetic Hard Negatives}}, author={Nikolaos Giakoumoglou and Andreas Floros and Kleanthis Marios Papadopoulos and Tania Stathaki}, booktitle={Representation Learning with Very Limited Resources: When Data, Modalities, Labels, and Computing Resources are Scarce}, year={2025}, url={https://openreview.net/forum?id=TJUfbYKo2c} }
CVPR

Unsupervised Training of Vision Transformers with Synthetic Negatives

N. Giakoumoglou, A. Floros, K. M. Papadopoulos, T. Stathaki

In CVPR Workshop "Visual Concepts" (non-archival), 2025

OpenReview arXiv Code

We address the neglected potential of hard negative samples in self-supervised learning. Previous works explored synthetic hard negatives but rarely in the context of vision transformers. We build on this observation and integrate synthetic hard negatives to improve vision transformer representation learning. This simple yet effective technique notably improves the discriminative power of learned representations. Our experiments show performance improvements for both DeiT-S and Swin-T architectures.

@inproceedings{giakoumoglou2025unsupervised, title={{Unsupervised Training of Vision Transformers with Synthetic Negatives}}, author={Nikolaos Giakoumoglou and Andreas Floros and Kleanthis Marios Papadopoulos and Tania Stathaki}, booktitle={Second Workshop on Visual Concepts}, year={2025}, url={https://openreview.net/forum?id=dg8FuaOKnC}, }

Under Review

Under Review

SynCo: Synthetic Hard Negatives for Contrastive Visual Representation Learning

N. Giakoumoglou and T. Stathaki

Under review

arXiv Code PDF

Contrastive learning has become a dominant approach in self-supervised visual representation learning, but efficiently leveraging hard negatives, which are samples closely resembling the anchor, remains challenging. We introduce SynCo (Synthetic negatives in Contrastive learning), a novel approach that improves model performance by generating synthetic hard negatives in the representation space. Building on the MoCo framework, SynCo introduces six strategies for creating diverse synthetic hard negatives "on-the-fly" with minimal computational overhead. SynCo achieves faster training and strong representation learning, surpassing MoCo-v2 by +0.4% and MoCHI by +1.0% on ImageNet ILSVRC-2012 linear evaluation. It also transfers more effectively to detection tasks, achieving strong results on PASCAL VOC detection (57.2% AP) and significantly improving over MoCo-v2 on COCO detection (+1.0% AP^bb) and instance segmentation (+0.8% AP^msk).

@misc{giakoumoglou2024synco, title={{SynCo: Synthetic Hard Negatives for Contrastive Visual Representation Learning}}, author={Nikolaos Giakoumoglou and Tania Stathaki}, year={2024}, eprint={2410.02401}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2410.02401}, }
Under Review

Relational Representation Distillation

N. Giakoumoglou and T. Stathaki

Under review

arXiv Code PDF

Knowledge distillation transfers knowledge from large, high-capacity teacher models to more compact student networks. We propose Relational Representation Distillation (RRD) that preserves the relative relationships among instances rather than enforcing absolute separation. Our method introduces separate temperature parameters for teacher and student distributions, creating an implicit information bottleneck that preserves fine-grained relational structure while avoiding the over-separation characteristic of contrastive losses. We establish theoretical connections showing that InfoNCE emerges as a limiting case of our objective when the teacher temperature approaches 0, and empirically demonstrate superior relational alignment and generalization across classification and detection tasks.

@misc{giakoumoglou2024rrd, title={{Relational Representation Distillation}}, author={Nikolaos Giakoumoglou and Tania Stathaki}, year={2024}, eprint={2407.12073}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2407.12073}, }
Under Review

A Review on Discriminative Self-supervised Learning Methods

N. Giakoumoglou, T. Stathaki, A. Gkelias

Under review

arXiv PDF

Self-supervised learning (SSL) has rapidly emerged as a transformative approach in computer vision, enabling the extraction of rich feature representations from vast amounts of unlabeled data and reducing reliance on costly manual annotations. This review presents a comprehensive analysis of discriminative SSL methods, which focus on learning representations by solving pretext tasks that do not require human labels. The paper systematically categorizes discriminative SSL approaches into five main groups: contrastive methods, clustering methods, self-distillation methods, knowledge distillation methods, and feature decorrelation methods. For each category, the review details the underlying principles, architectural components, loss functions, and representative algorithms, highlighting their unique mechanisms and contributions to the field. Extensive comparative evaluations are provided, including linear and semi-supervised protocols on standard benchmarks such as ImageNet, as well as transfer learning performance across diverse downstream tasks. The review also discusses theoretical foundations, scalability, efficiency, and practical challenges, such as computational demands and accessibility. By synthesizing recent advancements and identifying key trends, open challenges, and future research directions, this work serves as a valuable resource for researchers and practitioners aiming to leverage discriminative SSL for robust and generalizable computer vision models.

@misc{giakoumoglou2024review, title={{A Review on Discriminative Self-supervised Learning Methods in Computer Vision}}, author={Nikolaos Giakoumoglou and Tania Stathaki and Athanasios Gkelias}, year={2025}, eprint={2405.04969}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2405.04969}, }

Positions of Responsibility

Reviewer

ICML 2026, IJCNN 2026, ECCV 2026, CVPR 2026, WACV 2026, VISAPP 2026, AAAI 2026, ISBI 2026, ICCV 2025, BMVC 2025 (Exceptional Reviewer), ICASSP 2025, ICIP 2025, WACV 2025, DSP 2025, CVPR 2025, CVPR 2024, Smart Agriculture Technology (ScienceDirect), Agriculture (MDPI), and more.