Skip to main content

Showing 1–2 of 2 results for author: Keskustalo, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.00369  [pdf

    cs.CL

    Optical character recognition quality affects perceived usefulness of historical newspaper clippings

    Authors: Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen

    Abstract: Introduction. We study effect of different quality optical character recognition in interactive information retrieval with a collection of one digitized historical Finnish newspaper. Method. This study is based on the simulated interactive information retrieval work task model. Thirty-two users made searches to an article collection of Finnish newspaper Uusi Suometar 1869-1918 with ca. 1.45 millio… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 21 pages, 6 figures, 2 tables, 1 appendix. arXiv admin note: substantial text overlap with arXiv:2203.03557

  2. arXiv:2203.03557  [pdf

    cs.IR cs.CL cs.DL

    OCR quality affects perceived usefulness of historical newspaper clippings -- a user study

    Authors: Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen

    Abstract: Effects of Optical Character Recognition (OCR) quality on historical information retrieval have so far been studied in data-oriented scenarios regarding the effectiveness of retrieval results. Such studies have either focused on the effects of artificially degraded OCR quality (see, e.g., [1-2]) or utilized test collections containing texts based on authentic low quality OCR data (see, e.g., [3]).… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: IRCDL2022

    Journal ref: IRCDL 2022 Italian Research Conference on Digital Libraries 2022, http://ceur-ws.org/Vol-3160/