Hermite and Gabor transforms for noise reduction and handwriting classification in ancient manuscriptsSpringer Science and Business Media LLC - Tập 9 - Trang 101-122 - 2007
Véronique Eglin, Stéphane Bres, Carlos Rivero
In this paper, we propose a biologically inspired, global and segmentation free methodology for manuscript noise reduction and classification. Our method consists of developing well-adapted tools for writing enhancement, background noise, text and drawing separation and handwritten patterns characterization with orientation features. We have used here analysis of handwritten images in the spectral...... hiện toàn bộ
Comic MTL: tối ưu hóa học tập đa nhiệm cho phân tích hình ảnh truyện tranh Dịch bởi AI Springer Science and Business Media LLC - Tập 22 - Trang 265-284 - 2019
Nhu-Van Nguyen, Christophe Rigaud, Jean-Christophe Burie
Phương pháp phân tích hình ảnh truyện tranh thường đề xuất nhiều thuật toán hoặc mô hình cho nhiều nhiệm vụ khác nhau như phát hiện bảng truyện, nhân vật (cơ thể và khuôn mặt), phân đoạn khung thoại, nhận diện văn bản, v.v. Trong nghiên cứu này, chúng tôi nhằm mục đích giảm thời gian xử lý cho phân tích hình ảnh truyện tranh bằng cách đề xuất một mô hình có khả năng học nhiều nhiệm vụ, được gọi là...... hiện toàn bộ
#phân tích hình ảnh truyện tranh #học đa nhiệm #phát hiện nhân vật #phân đoạn khung thoại #mối quan hệ giữa nhân vật và khung thoại
Large-scale genealogical information extraction from handwritten Quebec parish recordsSpringer Science and Business Media LLC - Tập 26 - Trang 255-272 - 2023
Solène Tarride, Martin Maarand, Mélodie Boillet, James McGrath, Eugénie Capel, Hélène Vézina, Christopher Kermorvant
This paper presents a complete workflow designed for extracting information from Quebec handwritten parish registers. The acts in these documents contain individual and family information highly valuable for genetic, demographic and social studies of the Quebec population. From an image of parish records, our workflow is able to identify the acts and extract personal information. The workflow is d...... hiện toàn bộ
On the improvement of handwritten text line recognition with octave convolutional recurrent neural networksSpringer Science and Business Media LLC - - 2024
Dayvid Castro, Cleber Zanchettin, Luís A. Nunes Amaral
Off-line handwritten text recognition (HTR) poses a significant challenge due to the complexities of variable handwriting styles, background degradation, and unconstrained word sequences. This work tackles the handwritten text line recognition problem using octave convolutional recurrent neural networks (OctCRNN). Our approach requires no word segmentation, preprocessing, or explicit feature extra...... hiện toàn bộ
Devanagari OCR using a recognition driven segmentation framework and stochastic language modelsSpringer Science and Business Media LLC - Tập 12 - Trang 123-138 - 2009
Suryaprakash Kompalli, Srirangaraj Setlur, Venu Govindaraju
This paper describes a novel recognition driven segmentation methodology for Devanagari Optical Character Recognition. Prior approaches have used sequential rules to segment characters followed by template matching for classification. Our method uses a graph representation to segment characters. This method allows us to segment horizontally or vertically overlapping characters as well as those con...... hiện toàn bộ
Binarization of degraded document image based on feature space partitioning and classificationSpringer Science and Business Media LLC - Tập 15 - Trang 57-69 - 2010
Morteza Valizadeh, Ehsanollah Kabir
In this paper, we propose a new algorithm for the binarization of degraded document images. We map the image into a 2D feature space in which the text and background pixels are separable, and then we partition this feature space into small regions. These regions are labeled as text or background using the result of a basic binarization algorithm applied on the original image. Finally, each pixel o...... hiện toàn bộ
Making scanned Arabic documents machine accessible using an ensemble of SVM classifiersSpringer Science and Business Media LLC - Tập 21 - Trang 59-75 - 2018
Randa Elanwar, Wenda Qin, Margrit Betke
Raster-image PDF files originating from scanning or photographing paper documents are inaccessible to both text search engines and screen readers that people with visual impairments use. We here focus on the relatively less-researched problem of converting raster-image files with Arabic script into machine-accessible documents. Our method, called ECDP for “Ensemble-based classification of document...... hiện toàn bộ