New Deep Spatio-Structural Features of Handwritten Text Lines for Document Age Classi¯cation

Publisher:
WORLD SCIENTIFIC PUBL CO PTE LTD
Publication Type:
Journal Article
Citation:
International Journal of Pattern Recognition and Artificial Intelligence, 2022, 36, (9)
Issue Date:
2022-07-01
Filename Description Size
20886493_10916294780005671.pdfPublished version3.37 MB
Adobe PDF
Full metadata record
Document age estimation using handwritten text line images is useful for several pattern recognition and arti¯cial intelligence applications such as forged signature veri¯cation, writer identi¯cation, gender identi¯cation, personality traits identi¯cation, and fraudulent document identi¯cation. This paper presents a novel method for document age classi¯cation at the text line level. For segmenting text lines from handwritten document images, the wavelet decomposition is used in a novel way. We explore multiple levels of wavelet decomposition, which introduce blur as the number of levels increases for detecting word components. The detected components are then used for a direction guided-driven growing approach with linearity, and nonlinearity criteria for segmenting text lines. For classi¯cation of text line images of di®erent ages, inspired by the observation that, as the age of a document increases, the quality of its image degrades, the proposed method extracts the structural, contrast, and spatial features to study degradations at di®erent wavelet decomposition levels. The speci¯c advantages of DenseNet, namely, strong feature propagation, mitigation of the vanishing gradient problem, reuse of features, and the reduction of the number of parameters motivated us to use DenseNet121 along with a Multi-layer Perceptron (MLP) for the classi¯cation of text lines of di®erent ages by feeding features and the original image as input. To demonstrate the e±cacy of the proposed model, experiments were conducted on our own as well as standard datasets for both text line segmentation and document age classi¯cation. The results show that the proposed method outperforms the existing methods for text line segmentation in terms of precision, recall, F-measure, and document age classi¯cation in terms of average classi¯cation rate.
Please use this identifier to cite or link to this item: