Читать книгу Machine Vision Inspection Systems, Machine Learning-Based Approaches - Группа авторов - Страница 32

2.4.2 Within Language Classification

Оглавление

In n-way testing, we use characters from different languages, but the accuracy obtained for individual language is the main determinant for research. Language-wise classification accuracy was evaluated by preparing one-shot tasks with characters taken from a single alphabet, and the results were illustrated in Table 2.3. These results are based on the nearest neighbour, 1-shot capsule network classifications within individual alphabets. We have selected the Nearest neighbor method because it is a simpler classification method that uses raw pixel values. Thus, it is evident that language level classification accuracy is proportional to the number of characters in the language. Another critical factor that influences accuracy is the structural similarity between characters.


Figure 2.5 Omniglot n-shot n-way learning performance.

Table 2.3 Classification accuracies within individual alphabets.

Model Characters Nearest neighbor 1-shot capsule network
Aurek-Besk 25 6.40% 84.40%
Angelic 19 6.32% 76.84%
Keble 25 2.00% 71.20%
Atemayar Qelisayer 25 4.00% 62.80%
Tengwar 24 3.33% 62.08%
ULOG 25 3.60% 61.60%
Syriac (Serrto) 22 6.36% 58.64%
Atlantean 25 2.80% 58.00%
Avesta 25 5.20% 57.60%
Cyrillic 44 2.05% 57.05%
Sinhala 60 1.00% 56.22%
Ge`ez 25 1.60% 52.40%
Mongolian 29 4.83% 52.07%
Glagolitic 44 1.82% 50.68%
Manipuri 39 3.08% 50.51%
Malayalam 46 3.26% 45.87%
Tibetan 41 2.93% 45.61%
Sylheti 27 4.07% 40.37%
Gurmukhi 44 2.27% 38.41%
Oriya 45 1.56% 33.33%
Kannada 40 1.00% 29.25%

For further analysis, we consider the alphabet models with the same number of characters and those have shown the highest and lowest classification accuracies. Consider the characters of Gurmukhi (38.41% accuracy) and Cyrillic (57.05% accuracy), which has the same number of characters (44), but accuracy differs by 18.64%. The accuracy difference could be due to the structural similarity between characters in those alphabets. Figure 2.6 shows the two alphabets. Due to the same reason, we get lower accuracies for within language classification compared to mixed language n-way classification as described in Section 2.4.1.

Further, in an attempt to boost the accuracies in classification, we have used n-shot learning, while keeping 10 images for each character in the alphabet as the reference set and 10 images for averaging the results. By this experiment, we obtained 7 to 15% accuracy improvement resulting in 94% highest accuracy for Aurek-Besh language and 40% lowest accuracy for Oriya language, respectively.

Machine Vision Inspection Systems, Machine Learning-Based Approaches

Подняться наверх