Skip to main navigation Skip to search Skip to main content

Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics

  • Shan Jia
  • , Reilin Lyu
  • , Kangran Zhao
  • , Yize Chen
  • , Zhiyuan Yan
  • , Yan Ju
  • , Chuanbo Hu
  • , Xin Li
  • , Baoyuan Wu
  • , Siwei Lyu
  • SUNY Buffalo
  • Williamsville East High School
  • The Chinese University of Hong Kong, Shenzhen
  • SUNY Albany

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

41 Scopus citations

Abstract

DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal large language models (LLMs) in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
PublisherIEEE Computer Society
Pages4324-4333
Number of pages10
ISBN (Electronic)9798350365474
DOIs
StatePublished - 2024
Event2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024 - Seattle, United States
Duration: Jun 16 2024Jun 22 2024

Publication series

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
ISSN (Print)2160-7508
ISSN (Electronic)2160-7516

Conference

Conference2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
Country/TerritoryUnited States
CitySeattle
Period06/16/2406/22/24

Keywords

  • Deepfake Detection
  • GPT4V
  • Media Forensics
  • Multimodal Large Language Models

Fingerprint

Dive into the research topics of 'Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics'. Together they form a unique fingerprint.

Cite this