Deformity Removal from Handwritten Text Documents using Variable CycleGAN

Nigam, Shivangi; Behera, Adarsh Prasad; Sherma, Shekhar; Nagabhushan, P.

doi:10.1007/s10032-024-00466-x

dc.contributor.author	Nigam, Shivangi
dc.contributor.author	Behera, Adarsh Prasad
dc.contributor.author	Sherma, Shekhar
dc.contributor.author	Nagabhushan, P.
dc.date.accessioned	2024-05-22T11:16:21Z
dc.date.available	2024-05-22T11:16:21Z
dc.date.issued	2024-05-07
dc.identifier.issn	1433-2825	es
dc.identifier.uri	https://hdl.handle.net/20.500.12761/1820
dc.description.abstract	Text recognition systems typically work well for printed documents but struggle with handwritten documents due to different writing styles, background complexities, added noise of image acquisition methods, and deformed text images such as strikeoffs and underlines. These deformities change the structural information, making it difficult to restore the deformed images while maintaining the structural information and preserving the semantic dependencies of the local pixels. Current adversarial networks are unable to preserve the structural and semantic dependencies as they focus on individual pixel-to-pixel variation and encourage non-meaningful aspects of the images. To address this, we propose a Variable Cycle Generative Adversarial Network (VCGAN) that considers the perceptual quality of the images. By using a variable Content Loss (Top-k Variable Loss (TVk) ), VCGAN preserves the inter-dependence of spatially close pixels while removing the strike-off strokes. The similarity of the images is computed with TVk considering the intensity variations that do not interfere with the semantic structures of the image. Our results show that VCGAN can remove most deformities with an elevated F1 score of 97.40% and outperforms current state-of-the-art algorithms with a character error rate of 7.64% and word accuracy of 81.53% when tested on the handwritten text recognition system.	es
dc.language.iso	eng	es
dc.publisher	Springer Berlin Heidelberg	es
dc.title	Deformity Removal from Handwritten Text Documents using Variable CycleGAN	es
dc.type	journal article	es
dc.journal.title	International Journal on Document Analysis and Recognition (IJDAR)	es
dc.type.hasVersion	VoR	es
dc.rights.accessRights	open access	es
dc.identifier.doi	10.1007/s10032-024-00466-x	es
dc.page.final	13	es
dc.page.initial	1	es
dc.subject.keyword	Handwritten text	es
dc.subject.keyword	Strike-off	es
dc.subject.keyword	Semantics	es
dc.subject.keyword	Generative adversarial network	es
dc.subject.keyword	Image-to-image translation	es
dc.description.refereed	TRUE	es
dc.description.status	pub	es

Files in this item

Name:: s10032-024-00466-x.pdf
Size:: 1.397Mb
Format:: PDF
Description:: Main Article

This item appears in the following Collection(s)

IMDEA Networks

Show simple item record