Mastering GenAI Attribution: A Comprehensive Benchmark for Image Generator Identification
Presented by Deep Media AI (See our other research in Deepfake Detection here) Published: 2024 - 08 - 07
Updated: 2024 - 08 - 14 Dataset ID: 240807
https://www.loom.com/share/3ad66853ca084babb72eafc929a271ce?sid=9367d4da-0d7a-429f-a376-a477911a9ac3
Fill out this form with your information and someone from the Deep Media research term will grant you access: https://docs.google.com/forms/d/e/1FAIpQLSeTupVdPJDJM6hAEYDCD5ndIu1lvqZhc_xR_woC5hike532hw/viewform?usp=sf_link
As AI-generated images flood our digital landscape, the ability to attribute these creations to specific generators has become crucial for maintaining digital integrity and trust. Deep Media AI, a trailblazer in Deepfake Detection, now turns its expertise to the challenge of GenAI Image Generator Attribution.
We are proud to introduce our GenAI Image Generator Attribution Lab Benchmark Dataset, a powerful new tool in the fight against digital misinformation and fraud.
Key Features:
8,000 high-fidelity images, representing the most comprehensive and current collection of AI-generated content for attribution research
Equal distribution across four leading generators: DALL-E, Midjourney, Stable Diffusion, and Adobe, providing a balanced representation of current market leaders
This dataset is more than a research tool; it's a catalyst for innovation in the rapidly evolving field of AI-generated content attribution. As the line between human and AI-created content blurs, the ability to accurately identify the source of an image becomes paramount for industries ranging from journalism and law enforcement to digital rights management and social media moderation.
We call upon researchers, academic institutions, and industry leaders to harness this dataset in developing state-of-the-art attribution models. By fostering collaboration between academia and industry, we aim to create a safer, more transparent digital ecosystem where the provenance of visual content can be reliably determined.
Join us at the forefront of AI security and digital forensics. The future of online trust and accountability starts here.
On the Horizon: Expanding our benchmark to include emerging generators and modalities. As the generative AI landscape evolves, so does our commitment to staying ahead of the curve.
Firefly
DallE
MidJourney
Stable Diffusion
Our image classification model demonstrates strong performance in distinguishing between images generated by different AI models (DALLE, MidJourney, StableDiffusion, and Adobe Firefly) and potentially real images. The overall accuracy of 90.54% indicates that the model correctly classifies more than 9 out of 10 images, which is a robust result in the complex domain of AI-generated image detection.
Classification Report:
precision recall f1-score support
DALLE 0.92 0.91 0.92 2000
MidJourney 0.96 0.95 0.95 2000
StableDiffusion 0.90 0.92 0.91 2000
Adobe_Firefly 0.84 0.84 0.84 2000
accuracy 0.91 8000
macro avg 0.91 0.91 0.91 8000
weighted avg 0.91 0.91 0.91 8000
Examining the confusion matrix and classification report reveals several key insights:
These results are particularly impressive given the challenging nature of distinguishing between different AI-generated images. The high accuracy and balanced performance across multiple generators demonstrate the model's robustness and potential for real-world applications in detecting and classifying AI-generated content.
Areas for Improvement:
REAL: Firefly - PREDICTED: DallE
REAL: Firefly - PREDICTED: Stable Diffusion
REAL: DallE - PREDICTED: MidJourney
REAL: DallE - PREDICTED: Stable Diffusion
REAL: MidJourney - PREDICTED: Firefly
REAL: MidJourney - PREDICTED: Stable Diffusion
REAL: Stable Diffusion - PREDICTED: DallE
REAL: Stable Diffusion - PREDICTED: MidJourney
REAL: Firefly - PREDICTED: Firefly
REAL: MidJourney - PREDICTED: MidJourney
REAL: Firefly - PREDICTED: MidJourney
REAL: DallE - PREDICTED: Firefly
REAL: MidJourney - PREDICTED: Firefly
REAL: Stable Diffusion - PREDICTED: Firefly
REAL: Stable Diffusion - PREDICTED: Stable Diffusion
REAL: DallE - PREDICTED: DallE