Turkish OCR on mobile and scanned document images

Karasu, Kurtuluş; Baştan, Muhammet

Turkish OCR on mobile and scanned document images

Tarih

2015

Yazarlar

Karasu, Kurtuluş

Baştan, Muhammet

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Optical character recognition (OCR) systems have been widely used to convert documents into digital form. There are lots of both commercial and open source OCR systems available, but a benchmark on Turkish OCR is nonexistent. In this work, we first prepared two publicly available datasets for Turkish OCR, consisting of scanned document images and mobile camera captured document images. Then, we evaluated the Turkish OCR performance of three popular open source OCR systems (Tesseract, CuneiForm, GOCR) on the datasets. Tesseract outperformed the other two on both datasets. © 2021 Elsevier B.V., All rights reserved.

Açıklama

2015 23rd Signal Processing and Communications Applications Conference, SIU 2015 -- -- Malatya; Inonu Universitesi -- 113052

Anahtar Kelimeler

benchmark, dataset, mobile device, scanner, Tesseract, Turkish OCR

Scopus Q Değeri

N/A

Bağlantı

https://doi.rog/10.1109/SIU.2015.7130278
https://hdl.handle.net/20.500.12899/3144

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Turkish OCR on mobile and scanned document images

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon