Comparison of CNN’s Architecture GoogleNet, AlexNet, VGG-16, Lenet -5, Resnet-50 in Arabic Handwriting Pattern Recognition

Kinetik : game technology, information system, computer network, computing, electronics, and control(2023)

引用 0|浏览0
暂无评分
摘要
The Arabic script is written from right to left and consists of 28 characters, with no capital or lowercase letters. The Arabic script has several orthographic and morphological properties that make handwriting recognition of the Arabic script challenging. In addition, one of the biggest challenges in recognizing Arabic script patterns is the different handwriting styles and characters of each person's writing. The authors propose a study to compare the accuracy of handwriting pattern recognition in Arabic script which has been done previously by comparing five CNN architectures, namely GoogleNet, AlexNet, VGG-16, LeNet-5, and ResNet-50. Considering that previous research has not obtained excellent accuracy. The number of datasets used is 8400 image data and the most optimal comparison of testing and training data is 80:20. Based on the research that has been done, there are several things that the author can conclude. The model is made using 64 filters for each convolution layer because the optimal size is used for 5 architectures, kernel size is 3x3, neurons is 128, dropout weight is 50% to reduce overfitting, learning rate is 0.001, image size is 64x64, the normalization method with the ReLU activation function, and 1-dimensional input image (grayscale), and with a comparison of testing and training data of 80:20. The VGG-16 architectural model is the architecture that gets the highest score, namely 83.99%. This can have good potential to be developed as a medium for learning Arabic script.
更多
查看译文
关键词
arabic,alexnet,cnns,architecture googlenet,recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要