Home
Scholarly Works
VarAD: Lightweight High-Resolution Image Anomaly...
Journal article

VarAD: Lightweight High-Resolution Image Anomaly Detection via Visual Autoregressive Modeling

Abstract

This article addresses a practical task: high-resolution image anomaly detection (HRIAD). In comparison to conventional image anomaly detection for low-resolution images, HRIAD imposes a heavier computational burden and necessitates superior global information capture capacity. To tackle HRIAD, this article translates image anomaly detection into visual token prediction and proposes visual autoregressive modeling-based anomaly detection (VarAD) based on visual autoregressive modeling for token prediction. Specifically, VarAD first extracts multihierarchy and multidirectional visual token sequences, and then employs an advanced model, Mamba, for visual autoregressive modeling and token prediction. During the prediction process, VarAD effectively exploits information from all preceding tokens to predict the target token. Finally, the discrepancies between predicted tokens and original tokens are utilized to score anomalies. Comprehensive experiments on four publicly available datasets and a real-world button inspection dataset demonstrate that the proposed VarAD achieves superior HRIAD performance while maintaining lightweight, rendering VarAD a viable solution for HRIAD.

Authors

Cao Y; Yao H; Luo W; Shen W

Journal

IEEE Transactions on Industrial Informatics, Vol. 21, No. 4, pp. 3246–3255

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publication Date

January 1, 2025

DOI

10.1109/tii.2024.3523574

ISSN

1551-3203

Contact the Experts team