Transformer Encoder for Efficient CAPTCHA Recognize
2023 2nd International Conference on Cloud Computing, Big Data Application and Software Engineering (CBASE)(2023)
摘要
CAPTCHAs have become a key factor affecting the efficiency of Robotic Process Automation. In this study, we constructed a large-scale CAPTCHA dataset encompassing up to 60 diverse stylistic variations, offering robust data support for model resilience and adaptability. We designed a CAPTCHA image prediction reconstruction task, allowing for unsupervised feature encoding that implicitly models the image spatial domain features. This enables the acquisition of feature representations for various CAPTCHA styles in a common subspace. Additionally, character length prediction and character set prediction tasks were established to explicitly model text label features, fine-tuning network model parameters in alignment with recognition tasks. Through lightweight sequence modeling and text prediction modules, our design achieves high predictive accuracy while maintaining a controlled model size. Our proposed model attained a full-word accuracy rate of 92.47% on the multi-style CAPTCHA dataset, surpassing existing mainstream approaches.
更多查看译文
关键词
CAPTCHA,Transformer,Unsupervised
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要