Publication detail

Diffuse or Confuse: A Diffusion Deepfake Speech Dataset

FIRC, A. MALINKA, K. HANÁČEK, P.

Original Title

Diffuse or Confuse: A Diffusion Deepfake Speech Dataset

Type

conference paper

Language

English

Original Abstract

Advancements in artificial intelligence and machine learning have significantly improved synthetic speech generation. This paper explores diffusion models, a novel method for creating realistic synthetic speech. We create a diffusion dataset using available tools and pretrained models. Additionally, this study assesses the quality of diffusion-generated deepfakes versus non-diffusion ones and their potential threat to current deepfake detection systems. Findings indicate that the detection of diffusion-based deepfakes is generally comparable to non-diffusion deepfakes, with some variability based on detector architecture. Re-vocoding with diffusion vocoders shows minimal impact, and the overall speech quality is comparable to non-diffusion methods.

Keywords

deepfakes, deepfake speech, dataset, diffusion, detection

Authors

FIRC, A.; MALINKA, K.; HANÁČEK, P.

Released

11. 12. 2024

Publisher

GI - Group for computer science

Location

Darmstadt

ISBN

978-3-88579-749-4

Book

2024 International Conference of the Biometrics Special Interest Group (BIOSIG)

Pages from

1

Pages to

7

Pages count

7

URL

BibTex

@inproceedings{BUT189345,
  author="Anton {Firc} and Kamil {Malinka} and Petr {Hanáček}",
  title="Diffuse or Confuse: A Diffusion Deepfake Speech Dataset",
  booktitle="2024 International Conference of the Biometrics Special Interest Group (BIOSIG)",
  year="2024",
  pages="1--7",
  publisher="GI - Group for computer science",
  address="Darmstadt",
  doi="10.1109/BIOSIG61931.2024.10786752",
  isbn="978-3-88579-749-4",
  url="https://ieeexplore.ieee.org/document/10786752"
}