Transformer Encoder for Efficient CAPTCHA Recognize

Yaoting Li,Haixia Pan, Huolong Ye, Jiayu Zheng

2023 2nd International Conference on Cloud Computing, Big Data Application and Software Engineering (CBASE)(2023)

引用 0|浏览0
暂无评分
摘要
CAPTCHAs have become a key factor affecting the efficiency of Robotic Process Automation. In this study, we constructed a large-scale CAPTCHA dataset encompassing up to 60 diverse stylistic variations, offering robust data support for model resilience and adaptability. We designed a CAPTCHA image prediction reconstruction task, allowing for unsupervised feature encoding that implicitly models the image spatial domain features. This enables the acquisition of feature representations for various CAPTCHA styles in a common subspace. Additionally, character length prediction and character set prediction tasks were established to explicitly model text label features, fine-tuning network model parameters in alignment with recognition tasks. Through lightweight sequence modeling and text prediction modules, our design achieves high predictive accuracy while maintaining a controlled model size. Our proposed model attained a full-word accuracy rate of 92.47% on the multi-style CAPTCHA dataset, surpassing existing mainstream approaches.
更多
查看译文
关键词
CAPTCHA,Transformer,Unsupervised
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要