Publication detail

Semi-supervised deep learning approach to break common CAPTCHAs

BOŠTÍK, O. HORÁK, K. KRATOCHVÍLA, L. ZEMČÍK, T. BILÍK, Š.

Original Title

Semi-supervised deep learning approach to break common CAPTCHAs

Type

journal article in Web of Science

Language

English

Original Abstract

Manual data annotation is a time consuming activity. A novel strategy for automatic training of the CAPTCHA breaking system with no manual dataset creation is presented in this paper. We demonstrate the feasibility of the attack against a text-based CAPTCHA scheme utilizing similar network infrastructure used for Denial of Service attacks. The main goal of our research is to present a possible vulnerability in CAPTCHA systems when combining the brute-force attack with transfer learning. The classification step utilizes a simple convolutional neural network with 15 layers. Training stage uses automatically prepared dataset created without any human intervention and transfer learning for fine-tuning the deep neural network classifier. The designed system for breaking text-based CAPTCHAs achieved 80% classification accuracy after 6 fine-tuning steps for a 5 digit text-based CAPTCHA system. The results presented in this paper suggest, that even the simple attack with a large number of attacking computers can be an effective alternative to current CAPTCHA breaking systems.

Keywords

CAPTCHA;Semi-supervised learning;Convolutional Neural Networks

Authors

BOŠTÍK, O.; HORÁK, K.; KRATOCHVÍLA, L.; ZEMČÍK, T.; BILÍK, Š.

Released

12. 4. 2021

Publisher

Springer

Location

London

ISBN

0941-0643

Periodical

NEURAL COMPUTING & APPLICATIONS

Year of study

33

Number

20

State

United Kingdom of Great Britain and Northern Ireland

Pages from

13333

Pages to

13343

Pages count

11

URL

Full text in the Digital Library

BibTex

@article{BUT170906,
  author="Ondřej {Boštík} and Karel {Horák} and Lukáš {Kratochvíla} and Tomáš {Zemčík} and Šimon {Bilík}",
  title="Semi-supervised deep learning approach to break common CAPTCHAs",
  journal="NEURAL COMPUTING & APPLICATIONS",
  year="2021",
  volume="33",
  number="20",
  pages="13333--13343",
  doi="10.1007/s00521-021-05957-0",
  issn="0941-0643",
  url="https://link.springer.com/article/10.1007%2Fs00521-021-05957-0"
}