Skip to main navigation Skip to search Skip to main content

CatAID: Category-Guided AI-Generated Image Detection via Vision-Language Model Adaptation

  • Yu Cai
  • , Shan Jia
  • , Jiahe Tian
  • , Jiao Dai
  • , Jizhong Han
  • , Siwei Lyu
  • SUNY Buffalo
  • CAS - Institute of Information Engineering

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The growing use of AI-generated content (AIGC) has spurred research on detecting AI-generated images. While current methods prioritize generalization capability across unseen generators, they may overlook the limitation posed by unseen semantic content. This work seeks a semanticagnostic detection method for better generalization and robustness. Leveraging the powerful semantic representation of Vision-Language Models (VLMs), we propose a Category-guided AI-generated Image Detection, termed CatAID. Our approach integrates Category-Contextualized Prompts to adapt VLMs for the detection task, informing the detector with explicit semantic concepts. This reduces reliance on semantic learning and thereby encourages the VLM's responses to align with more semantic-invariant forgery patterns. Extensive evaluation of AI-generated images from 33 generators in 4 datasets and various unknown semantic contents demonstrates the improved performance of CatAID over state-of-the-art methods.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1564-1574
Number of pages11
ISBN (Electronic)9798331589882
DOIs
StatePublished - 2025
Event2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025 - Honolulu, United States
Duration: Oct 19 2025Oct 20 2025

Publication series

NameProceedings - 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025

Conference

Conference2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025
Country/TerritoryUnited States
CityHonolulu
Period10/19/2510/20/25

Keywords

  • aigc security
  • deepfake
  • media forensics

Fingerprint

Dive into the research topics of 'CatAID: Category-Guided AI-Generated Image Detection via Vision-Language Model Adaptation'. Together they form a unique fingerprint.

Cite this