| Veröffentlichte Version Download ( PDF | 7MB) | Lizenz: Creative Commons Namensnennung 4.0 International |
Extracting Handwritten Annotations from Printed Documents Via Infrared Scanning
Schmid, Andreas
, Heckelbacher, Lorenz und Wimmer, Raphael
(2022)
Extracting Handwritten Annotations from Printed Documents Via Infrared Scanning.
In: CHI Conference on Human Factors in Computing SystemsExtended Abstracts (CHI ’22 Extended Abstracts), April 29 - May 5, 2022, New Orleans, LA, USA.
Veröffentlichungsdatum dieses Volltextes: 28 Okt 2022 14:45
Konferenz- oder Workshop-Beitrag
DOI zum Zitieren dieses Dokuments: 10.5283/epub.53129
Zusammenfassung
Despite ever improving digital ink and paper solutions, many people still prefer printing out documents for close reading, proofreading, or filling out forms. However, in order to incorporate paper-based annotations into digital workflows, handwritten text and markings need to be extracted. Common computer-vision and machine-learning approaches require extensive sets of training data or a clean ...
Despite ever improving digital ink and paper solutions, many people still prefer printing out documents for close reading, proofreading, or filling out forms. However, in order to incorporate paper-based annotations into digital workflows, handwritten text and markings need to be extracted. Common computer-vision and machine-learning approaches require extensive sets of training data or a clean digital version of the document. We propose a simple method for extracting handwritten annotations from laser-printed documents using multispectral imaging. While black toner absorbs infrared light, most inks are invisible in the infrared spectrum. We modified an off-the-shelf flatbed scanner by adding a switchable infrared LED to its light guide. By subtracting an infrared scan from a color scan, handwritten text and highlighting can be extracted and added to a PDF version. Initial experiments show accurate results with high quality on a test data set of 93 annotated pages. Thus, infrared scanning seems like a promising building block for integrating paper-based and digital annotation practices.
Downloadstatistik
Downloadstatistik