Turkish OCR on mobile and scanned document images

dc.contributor.authorKarasu, Kurtuluş
dc.contributor.authorBaştan, Muhammet
dc.date.accessioned2025-10-24T18:06:41Z
dc.date.available2025-10-24T18:06:41Z
dc.date.issued2015
dc.departmentMalatya Turgut Özal Üniversitesi
dc.description2015 23rd Signal Processing and Communications Applications Conference, SIU 2015 -- -- Malatya; Inonu Universitesi -- 113052
dc.description.abstractOptical character recognition (OCR) systems have been widely used to convert documents into digital form. There are lots of both commercial and open source OCR systems available, but a benchmark on Turkish OCR is nonexistent. In this work, we first prepared two publicly available datasets for Turkish OCR, consisting of scanned document images and mobile camera captured document images. Then, we evaluated the Turkish OCR performance of three popular open source OCR systems (Tesseract, CuneiForm, GOCR) on the datasets. Tesseract outperformed the other two on both datasets. © 2021 Elsevier B.V., All rights reserved.
dc.identifier.doi10.1109/SIU.2015.7130278
dc.identifier.endpage2077
dc.identifier.isbn9781467373869
dc.identifier.scopus2-s2.0-84939160527
dc.identifier.scopusqualityN/A
dc.identifier.startpage2074
dc.identifier.urihttps://doi.rog/10.1109/SIU.2015.7130278
dc.identifier.urihttps://hdl.handle.net/20.500.12899/3144
dc.indekslendigikaynakScopus
dc.language.isotr
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzScopus_20251023
dc.subjectbenchmark
dc.subjectdataset
dc.subjectmobile device
dc.subjectscanner
dc.subjectTesseract
dc.subjectTurkish OCR
dc.titleTurkish OCR on mobile and scanned document images
dc.title.alternativeMobil Cihaz ve Tarayici Görüntülerinde Türkçe Karakter Tanima
dc.typeConference Object

Dosyalar