Datasets for Supervised Adversarial Attacks on Neural Rankers

Abstract

We introduce a novel dataset for adversarial rank attacks against neural rankers, enabling systematic research on robustness. Unlike prior unsupervised or surrogate-based methods, our approach uses Retrieval-Augmented Generation (RAG) with a Large Language Model (LLM) to create high-quality adversarial examples that subtly alter rankings while maintaining coherence and relevance. Built via a self-refining LLM-Ranker feedback loop, the dataset includes two tiers: Gold and Diamond, based on attack strength, along with rich metadata, ranking labels, and quality metrics. Released with code and prompts, it supports training, evaluation, and benchmarking of robust ranking systems.

Publication
Proceedings of the 34th ACM International Conference on Information and Knowledge Management

This work provides datasets and frameworks for studying adversarial attacks on neural ranking models.

Morteza Zihayat
Morteza Zihayat
Principal Investigator

Dr. Morteza Zihayat is a Canada Research Chair (CRC) in Human-Centered AI and Associate Professor at Toronto Metropolitan University, Faculty of Engineering and Architectural Science. He also holds appointments as Adjunct Associate Professor at the University of Waterloo (Management Sciences) and IBM Faculty Fellow at IBM Centre for Advanced Studies. He is the Director of the Human-Centered Machine Intelligence Lab.