Robustness of Generative AI Detection: Adversarial Attacks on Black-Box Neural Text Detectors

The increased quality and human-likeness of AI generated texts has resulted in a rising demand for neural text detectors, i.e. software that is able to detect whether a text was written by a human or generated by an AI. In our article "Robustness of Generative AI Detection: Adversarial Attacks on Black-Box Neural Text Detectors", we investigate a broad range of adversarial attacks in English texts with six different neural text detectors, including commercial and research tools. While the results show that no detector is completely invulnerable to adversarial attacks, the latest generation of commercial detectors proved to be very robust and not significantly influenced by most of the evaluated attack strategies.

This repository contains the texts that have been generated for our evaluation of adversarial attacks on neural text detectors, as well as the code that was used to conduct the experiments.

How to cite (to appear)

@article{fishchuk-braun-2024-robustness,
    title = "Robustness of Generative AI Detection: Adversarial Attacks on Black-Box Neural Text Detectors", 
    author = "Fishchuk, Vitalii and Braun, Daniel",
    journal = "International Journal of Speech Technology",
    year = "2024",
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robustness of Generative AI Detection: Adversarial Attacks on Black-Box Neural Text Detectors

How to cite (to appear)

About

Releases

Packages

Languages

DaBr01/Adversarial-Attacks-on-Neural-Text-Detection

Folders and files

Latest commit

History

Repository files navigation

Robustness of Generative AI Detection: Adversarial Attacks on Black-Box Neural Text Detectors

How to cite (to appear)

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages