Reducing the Cost of Breaking Audio CAPTCHAs by Active and Semi-supervised Learning

Malte Darnstädt,Hendrik Meutzner,Dorothea Kolossa

Machine Learning and Applications（2014）

引用 9|浏览0

暂无评分

摘要

CAPTCHAs are challenge-response tests that are widely used in the Internet to distinguish human users from machines. In addition to the well-known visual CAPTCHAs, most Internet services also provide an audio-based scheme, e.g., To enable access for visually impaired users. Recent research has shown that most CAPTCHAs are vulnerable as they can be broken by machine learning techniques. However, such automated attacks come at a relatively high cost as they require human experts to create labels for the unlabeled CAPTCHA samples collected from a website in order to train an attacking system. In this work we utilize active and semi-supervised learning methods for breaking audio CAPTCHAs. We show that these methods can reduce the labeling costs considerably, resulting in an increased vulnerability of audio CAPTCHAs as automated attacks are rendered even more worthwhile. In addition, our findings give insight into improvements to the design of CAPTCHAs, helping to harden prospective audio CAPTCHA schemes against active learning attacks in the future.

查看译文

关键词

authorisation,cost reduction,learning (artificial intelligence),speech recognition,internet services,web sites,active learning attacks,attacking system,audio captcha breaking,audio-based scheme,automated attacks,challenge-response tests,label creation,labeling cost reduction,machine learning techniques,prospective audio captcha schemes,semisupervised learning,unlabeled captcha,visually impaired users,active learning,audio captcha,automatic speech recognition,semi-supervised learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要