Research
My research focuses on developing efficient training techniques for computer vision models, with a particular emphasis on understanding the adverse effect of noise and enhancing the robustness of deep learning architectures in real-world scenarios.
As a Ph.D. student, my work investigates how various types of noise—such as low-resolution and environmental distortions—impact the performance of deep learning models, including transformers and vision-language models (VLMs).
Previously, my Master's thesis centered on Person Re-Identification (ReID) in surveillance settings, a domain inherently affected by low-resolution and noisy visual data.
|
|
Coarse Attribute Prediction with Task Agnostic Distillation for Real World Clothes Changing ReID
Priyank Pathak,
Yogesh S. Rawat,
BMVC, 2025
Arxiv
We introduce Robustness against Low-Quality (RLQ) in clothes changing real-world ReID to make the model robust against low-quality artifacts like pixelation, out-of-focus blur, and motion blur.
|
|
Colors See Colors Ignore: Clothes Changing ReID with Color Disentanglement
Priyank Pathak,
Yogesh S. Rawat,
ICCV, 2025
Paper /
Project Page /
Code
Lightweight, annotation-free proxy for mitigating appearance bias in ReID models, when "expensive" clothing annotations aren't available.
Colors See, Colors Ignore (CSCI), an efficient RGB-only method leverages color information directly from raw images or video frames.
|
|
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation Models
Priyank Pathak,
Shyam Marjit,
Shruti Vyas,
Yogesh S. Rawat,
ICLR, 2025
Paper /
Project Page /
Code
LR0.FM benchmark evaluates the impact of low resolution on the zero-shot performance of VLMs, via novel Weighted Aggregated Robustness metric.
LR-TK0 enhances the robustness of VLMs against low-resolution across several datasets without compromising their pre-trained weights.
|
|
Video person re-id: Fantastic techniques and where to find them (student abstract)
Priyank Pathak,
Amir Erfan Eshratifar,
Michael Gormish,
AAAI, 2020
Paper /
Code
One of the first works to solve Person Reid exploring multiple techniques to improve accuracy on the Video ReID model.
|
|
Pixel Onion: Peeling Layers of Zero-Shot Object Detection in Pixelation
Priyank Pathak,
Mukilan Karuppasamy*,
Aaditya Baranwal,
3 more authors
Under Review
* equal contribution
Removing bells and whistles of SOTA Zero-Shot Object Detectors under resolution degradation (`pixelation'), exposing the true vunerability of "backbones", and "dataset" (certain type of image are not affected by pixelation) with minimal impact of "lanugage".
|
|
CosPlan: Corrective Sequential Planning with Visual-Language Models
Shresth Grover,
Priyank Pathak,
Akash Kumar,
2 more authors
Under Review
CosPlan benchmarks evaluates VLMs on "error-prone" sequence completion tasks, testing abilities like Error Detection and Sequence Completion ability.
Novel SGI, incrementally updates Scene Graphs better handle decision making.
|
|
Research Assistant, National University of Singapore
Internship
Aug'22 - Dec'2022
|
|
Research Engineer, Amobee
Full-time
June'20 - Aug'2021
|
|
Deep Learning Research Intern, Clarifai,
Internship
May'19 - Sept'19
|
|
Deep Learning Research Intern, Rice University,
Internship
May'17 - Aug'17
|
"Aloo has no plaace in Biryani" - Every Desi Foodie
Built upon Jon Barron's
template, and modified upon Rohit Gupta
|
|