ADNet: A Large-Scale and Extensible Multi-Domain Benchmark for Anomaly Detection Across 380 Real-World Categories

Hai Ling1,2, Jia Guo3, Zhulin Tao1*, Yunkang Cao4, Donglin Di3, Hongyan Xu5, Xiu Su5, Yang Song6, Lei Fan6,2*
Arxiv 2025
1Communication University of China,
2DZ Matrix, 3Tsinghua University, 4Hunan University, 5Central South University, 6UNSW Sydney

*Corresponding Author
Image description

Overview of ADNet, covering 380 categories across five application domains.

Abstract

Anomaly detection (AD) aims to identify defects using normal-only training data. Existing anomaly detection benchmarks (e.g., MVTec-AD with 15 categories) cover only a narrow range of categories, limiting the evaluation of cross-context generalization and scalability. We introduce ADNet, a large-scale, multi-domain benchmark comprising 380 categories aggregated from 49 publicly available datasets across Electronics, Industry, Agrifood, Infrastructure, and Medical domains. The benchmark includes a total of 196,294 RGB images, consisting of 116,192 normal samples for training and 80,102 test images, of which 60,311 are anomalous. All images are standardized with MVTec-style pixel-level annotations and structured text descriptions spanning both spatial and visual attributes, enabling multimodal anomaly detection tasks. Extensive experiments reveal a clear scalability challenge: existing state-of-the-art methods achieve 90.6% I-AUROC in one-for-one settings but drop to 78.5% when scaling to all 380 categories in a multi-class setting. To this end, we propose Dinomalym, a context-guided Mixture-of-Experts extension of Dinomaly that expands decoder capacity without added inference cost. It achieves 83.2% I-AUROC and 93.1% P-AUROC, demonstrating superior performance over existing approaches. We aim to make ADNet a standardized and extensible benchmark, supporting the community in expanding anomaly detection datasets across diverse domains and providing a scalable foundation for AD foundation models.

Dataset Information

The ADNet dataset provides a comprehensive collection of high-quality images for anomaly detection. It includes over 196,294 RGB images, consisting of 116,192 normal samples for training and 80,102 test images, of which 60,311 are anomalous. The dataset supports tasks such as anomaly detection.

The ADNet dataset is licensed under the Creative Commons BY-NC-SA 4.0 license. Note that All data must not be used for commercial purposes.

Open-source Plan

Our open-source roadmap is as follows. We will update the status here as we make progress.

The open-source release is tentatively planned to start after the submitted paper receives a preliminary acceptance decision (e.g., major or minor revision), depending on publisher and institutional requirements.

Phase 1 Planned

Open-source ADNet image data for one representative domain.

Phase 2 Planned

Open-source text labels and annotations (e.g., descriptions and attributes) for released domains.

Phase 3 Planned

Open-source ADNet data for all remaining domains.

Dataset Access & Registration

Download ADNet (Coming Soon). Registration is required and you will be notified when the dataset becomes available.