Skip to content
Millennial remote woman designer take photo of document on smartphone to send plan to client or boss

How Artificial Intelligence is Paving the Way for Text Recognition

In order to check and continuously improve the quality of our AI-supported document and text recognition, we have created large test vectors. These can also be used to precisely analyze the individual steps involved in preparing documents (learn more about this here).

Submissions from the smartphone camera

When customers are asked to submit photos of documents themselves, there are often surprises: Distortions, bent or crumpled paper, blur, rotations, etc.

In the following, we would like to show you some particularly adventurous shots, which we of course took ourselves for testing. For this purpose, we printed out a publicly available contract template from BVAEB (Versicherungsanstalt öffentlich Bediensteter) and digitized it again with our smartphone in a particularly creative way.

Preparation with Artificial Intelligence

In each of the images you can see on the left our cell phone photos, on the right the result from our pipeline with several AI networks that have corrected the image as best as possible. This version can then be run through the actual text recognition process – with much better results than from the original photo.


Our pre-processing manages to tighten the text and get it into straight lines – a very important preparation, so that the text can be subsequently.


Even very extreme folds in the paper can be mastered.


Two or more, as well as horizontal and vertical kinks are also “ironed out”.


Two pre-processing steps are relevant here: The crumpled paper must be overcome, but also the bending of the entire sheet. As you can see, this works surprisingly well. The shadows of the paper do look more intense (as we increase the contrast); however, the text recognition can handle this well.


This is the absolute extreme case: That customers send in their papers like this is hopefully the exception. Nevertheless, the lines of the text can still be restored quite well. The font itself becomes italicized, but that is a snap for the text recognition afterwards.


Again, the curved page results in an italicized, but very readable, new image.


And here’s a second extreme case: We clearly overdid it, because this attempt to photograph a leaf is a joke, but we were still very pleasantly surprised by what our AI conjured out of it.


We uploaded this image with very low quality, which makes the first few sentences on paper almost illegible. On the improved version including straightening and de-blurring it looks much better, and the result of our text recognition (to be seen below the images) surprised us very positively.

Try it yourself

If you have a lot of documents to deal with yourself, or just want to see this exciting technology in action yourself, just drop us a line. By the way, text recognition is part of link|that Prism.

Harald Kerschhofer

Harald Kerschhofer

Harald was one of the first developers at link|that and has been producing creative content for and about our products since completing his media studies.

Find out more in our blog


How to achieve scalable & integrated communication

Why scalability and integration are critical for customer facing communication strategies.
telephony automatic categories

ECCO introduces automatic categorization of telephone calls

Our AI for phone calls automatically categorizes conversations and offers even better transcripts.

5 Questions with Agnes

About playing drums, drawing portraits and AI control. Agnes from the label team gives us her answers to the 5 Questions.

Stories from our customers

This might also interest you

Do you want to find out more?


We use cookies for better user experience and traffic analysis.
By continuing to browse the site, you agree to our use of cookies.