AI offers an array of document processing opportunities

Rapid developments involving self-learning artificial intelligence (AI) platforms – which solve complex problems automatically – are now enabling information and business managers to quickly gain real insight from documents, irrespective of the language, the computer file format used, and whether documents contain machine print or cursive handwriting, or both. This is radically set to change how organisations cope with recognising and classifying millions of documents and then extracting and validating information without any manual intervention at all, thereby increasing productivity, accuracy and saving money.

Existing character recognition technologies have their limitations

To date, a vast amount has been invested deploying traditional recognition technologies such as optical character recognition (OCR), intelligent character recognition (ICR) and intelligent word recognition to analyse the content of documents and boost automation. It’s still very much a growth area. Research shows that the global OCR market is expected to reach $13.38 billion by 2025 – increasing at a CAGR of 13.7% from 2019.

Despite this, there are limitations. Many ICR/OCR engines struggle to process a mix of documents – encompassing structured, semi-structured and unstructured data – along with cursive handwriting, historical and old documents especially when the legibility of the paperwork is poor. The situation is exacerbated when volumes are high. And no one traditional ICR/OCR engine can seamlessly process a variety of languages – jumping from documents in English to Chinese, German and so on.

With such variability, correct read-rates drop markedly – it’s still tough to get more than 90-95% accuracy today – such that staff are required to then manually rekey information in. This is time consuming, costly and begs the question of whether enough trained employees are available to do it.

Of course, crowd-sourcing approaches are a good and cheaper work around than actually hiring people to enhance accuracy. Snippets of data are sent to online entry clerks logged into an Internet-based system who then check it prior to inputting it into line of business systems.

But the promise – and now reality – of AI is that these challenges are also resolved using powerful cognitive systems.

Venturers Club roundtable: why AI startups should focus on data privacy and trust

A recent Information Age and Venturers Club roundtable explored how AI startups can place data privacy and trust at the heart of their operations. Read here

AI-powered solutions are available today for document processing

Utilising neural networks, AI-driven document processing platforms offer a leapfrog advance over traditional recognition technologies. At the outset, a system is ‘trained’ so that a consolidated core knowledge base is created about a particular (spoken) language, form and/or document type. In AI jargon, this is known as the ‘inference’. This knowledge base then expands and grows over time as more and more information is fed into the system and it self-learns – able to recognise documents and their contents as they arrive.

This is achieved given a feedback ‘re-training loop’ is used – think of it as supervised learning overseen by a human – whereby errors in the system are corrected when they arise so that the inference (and the meta data underlying it) updates, learns and is able to then deal with similar situations on its own when they next appear.

It’s not dissimilar to how the human brain works, and how children learn a language. In other words, the more kids talk, make mistakes and are corrected, the better they get at speaking. The same is true with AI when applied to document analysis and processing. The inference becomes ever more knowledgeable and accurate.

AI-based systems can be trained to automatically recognise specific forms, review specific content and its layout on the page and then convert cursive handwriting into standard electronic formats such as PDF or JavaScript Object Notation (JSON) for analysis or workflow purposes with validation and verification also taking place. This can also be done at a field-based level so that key value extraction can be completed. Admittedly, this something that ICR/OCR systems can also do but they struggle to recognise cursive handwriting and require complex algorithms to find the fields.

Key value extraction on a form, for example, could be a generic box for ‘name’ or ‘age’ – the key – and then the specific values would be ‘Mr John Smith’ and ‘50’. Or on an invoice, the keys are items purchased and the values are the prices paid for each different one.

The benefits here are clear. Governments, healthcare providers, banks and insurance firms have to process a vast number of handwritten forms with identical formats for various purposes like questionnaires, applications, personal loans, mortgages or claims. Retrieving the handwritten information out of them and converting it into a digital format without human intervention reduces manual errors, lowers cost, allows big data analytics and makes turn around considerably faster.

And the speed of this AI-based processing is impressive. Anywhere up to 50,000 pages per hour can be completed using a single server – with bigger deployments and cloud delivery also possible when more compute power is added.

Loads of different common files can be ingested for analysis such as plain text, PDF, TIFF, JPEG, GIF, PPM, PNG and so on with several neural nets then reading the text and classifying the type – whether it be handwriting or machine print – with ‘fuzzy search’ aiding the text to digital conversion process. And class-leading AI systems – in addition to handling paper documents – are designed to cope with pictures, video and audio, too. Put another way, they are content agnostic and can handle any source content.

Why your big data dreams can’t come true without AI

Emilio Valdes, senior vice-president, EMEA & LATAM at Informatica, discusses the importance of AI in the big data journey. Read here

AI in practice

This is real stuff. One German insurance firm is working over the next six years to shift its entire claim process to use an AI-powered system such that claims under a certain value will be handled automatically based on information extracted, assessed and approved from a form with no human involvement required at all. This will be accomplished as the AI solution automatically checks the name, address, insurance number and other key details about a given incident – capturing all the data from the form correctly first time every time.

When it comes to document processing, seeing AI in action is impressive. It’s ‘wow’ magical stuff to watch a machine ‘read’ a scanned paper document and extract data from it.

One of the consequences of the Covid-19 pandemic and the economic fallout from it is that many companies will want to improve efficiency in a bid to save money. Those who have a significant cost and operational overhead processing forms and other documentation many feel a sense of corporate anxiety or even alarm about how to do this.

So don’t panic. AI has sufficiently matured such that it is now a real-world performant and reliable option for companies tasked with grappling and dealing with millions of paper documents.

Written by Ashley Keil, vice-president, sales EMEA and APAC at IBML

Editor's Choice

Editor's Choice consists of the best articles written by third parties and selected by our editors. You can contact us at timothy.adler at stubbenedge.com