Document intelligence: Long-term value creators

AI for process automation: Buy to last

When senior decision-makers are asked what they look for when evaluating document automation solutions, their answers tend to converge around three core questions:

Can the offering solve my problem?
How easy is it to use?
Can we scale this solution to meet future needs?

These are subjective questions that are hard to answer. Understandably, they often get proxied by easily available quantitative data that vendors are only too happy to provide: model benchmarks, such as accuracy or F1 scores, processing times, up times, etc. While these are definitely useful metrics for understanding the immediate short-term efficacy of a solution, they provide little guidance around long-term utility and scalability.

In evaluating the long-term efficacy of a process automation solution, alternative considerations may be more useful.

Quality Awareness

Does the solution understand its own limitations?

In a situation in which the slightest mistake can result in millions, if not billions, in financial losses, model accuracy itself means nothing, if it’s short of 100%. At the same time, having every machine action double-checked by a human subject matter expert (SME) destroys any hopes of high rates of straight-through processing or meaningful gains from automation.

The ideal solution should have a strong sense of its own ability to deliver results in a variety of circumstances, and be able to judiciously escalate to human SMEs.

The cost of a mistake should be carefully balanced against the cost of human inspection, with a circumstance-based self-predicted quality score driving the decision to escalate to humans.

Deep Dive: Quality awareness is measured in errors. Simply put, Type 1 errors are mistakes that the system categorizes as correct, and therefore represent risk. Type 2 errors are correct data that the system flags as mistakes, and therefore represent a manual check. As the cost of a mistake goes up, the number of Type 1 errors must come down. In most financial services use cases, clients should get Type 1 errors as close to zero — even if it increases type 2 errors a little — to ensure that risk is minimized.

Context Awareness

Does the solution deeply understand the underlying context?

As the saying goes, an infinite number of monkeys, given an infinite amount of time in front of typewriters, will produce the complete works of Shakespeare. The artificial intelligence analogy would be to say that a high accuracy model that seems to solve the problem at hand can be produced with a large number of training examples and computational power.

However, such overengineered solutions fail to stand the test of time as real life instances start to drift away from historical examples, and retraining efforts fail to restore the solution to promised quality levels.

A well-designed product should attempt to understand things and not just strings.

Once armed with the ability to represent and persist contextual metadata, the solution’s failings over time become far more explainable, allowing for nuanced and careful retraining or reconfiguration. This allows the solution’s capabilities to be far more expandable beyond the original use case as business needs evolve with time.

Deep Dive: For example, a system that reads the words “John Smith” should know that this is not just text, but represents the name of a person, and would know if this person has been encountered by the system before. Further, it will enrich its knowledge of John Smith each time new information is acquired, and cross-check each reference to John Smith to be sure it is correct, if multiple John Smiths are recognized by the system.

Queryability

Beyond simplicity of use, can the solution intelligently guide users when things go wrong? Can the solution help decision-makers gain insight from data created and collected during process automation?

Having a simple and intuitive product that business users can start, with little or no training, is valuable. However, a more important criterion that is often overlooked is the solution’s response to failure. Operational errors can be expensive and high-stress affairs. It is imperative that failures be discovered quickly and that users are guided to the next course of action. Further, decision makers need to be able to easily access, analyze and glean insights from automation logs in order to be able to evolve operational processes with the ever-changing landscape of business.

Deep Dive: As an example, consider a situation in which regulatory changes require financial services firms to have insurance coverage for a new risk event. In such an event, one might seek to understand if vendor, supplier and counterparty contracts cover such risks or need modifications. This often needs a team of legal SMEs to manually review documents for days, if not weeks. Having someone crawl through thousands of contracts isn’t the answer. Absolutely necessary in today’s world is the ability to query documents with a system that leverages context and synonyms, and is able to combine qualitative, quantitative metadata and linked data from other documents.

Queryability calls for a risk management system that is able to understand and intelligently query contractual documents at scale to categorize and escalate noncompliant contracts.

What technology is able to automate in today’s day and age is astounding, only to be surpassed by what it promises to deliver tomorrow. Asking the right questions and evaluating solutions for long-term value creation can hugely help businesses gain an edge over their competitors in the coming decade.

Prashant Vijay, CEO of Romulus, a document intelligence software provider

A veteran of the financial services industry, Prashant Vijay is currently chief executive at Romulus, which specializes in building software products that automate document-heavy operations in the financial services industry. He has spent more than two decades working at the intersection of technology and data across multiple roles and geographies. His views are informed by his experience in tech and business roles at Goldman Sachs, and his sales and product and business management roles at IHS Markit.

Tags: document intelligence Romulus

Cookie	Duration	Description
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
__RequestVerificationToken	session	This cookie is set by web application built in ASP.NET MVC Technologies. This is an anti-forgery cookie used for preventing cross site request forgery attacks.
_abck	1 year	This cookie is used to detect and defend when a client attempt to replay a cookie.This cookie manages the interaction with online bots and takes the appropriate actions.
34f6831605	session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
a64cedc0bf	session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
ak_bmsc	2 hours	This cookie is used by Akamai to optimize site security by distinguishing between humans and bots
ARRAffinity	session	ARRAffinity cookie is set by Azure app service, and allows the service to choose the right instance established by a user to deliver subsequent requests made by that user.
ARRAffinitySameSite	session	This cookie is set by Windows Azure cloud, and is used for load balancing to make sure the visitor page requests are routed to the same server in any browsing session.
AWSELB	session	Associated with Amazon Web Services and created by Elastic Load Balancing, AWSELB cookie is used to manage sticky sessions across production servers.
bm_sz	4 hours	This cookie is set by the provider Akamai Bot Manager. This cookie is used to manage the interaction with the online bots. It also helps in fraud preventions
cf_ob_info	past	The cf_ob_info cookie is set by Cloudflare to provide information on HTTP Status Code returned by the origin web server, the Ray ID of the original failed request and the data center serving the traffic.
cf_use_ob	past	Cloudflare sets this cookie to improve page load times and to disallow any security restrictions based on the visitor's IP address.
CONCRETE5	session	This cookie is set by the provider Concrete5 web content management system. This is a necessary cookie used for maintaining the user session between pages.
connect.sid	1 month	This cookie is used for authentication and for secure log-in. It registers the log-in information.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
cookiesession1	1 year	This cookie is set by the Fortinet firewall. This cookie is used for protecting the website from abuse.
crmcsr	session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
ep201	30 minutes	This cookie is set by Wufoo for load balancing, site traffic and preventing site abuse.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
LS_CSRF_TOKEN	session	Cloudflare sets this cookie to track users’ activities across multiple websites. It expires once the browser is closed.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
sxa_site	session	This cookie is used to identify the webiste visitor's session state across page requests on server.
ts	3 years	PayPal sets this cookie to enable secure transactions through PayPal.
ts_c	3 years	PayPal sets this cookie to make safe payments through PayPal.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_test_cookie	session	This cookie is used to check if the cookies are enabled on the users' browser.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
_zcsr_tmp	session	Zoho sets this cookie for the login function on the website.
663a60c55d	session	This cookie is related to Zoho (Customer Service) Chatbox
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
e188bc05fe	session	This cookie is set in relation to Zoho Campaigns
geo	session	This cookie is used for identifying the geographical location by country of the user.
iamcsr	session	Zoho (Customer Support) sets this cookie and is used for tracking visitors (for performance purposes)
lang	session	LinkedIn sets this cookie to remember a user's language setting.
language	session	This cookie is used to store the language preference of the user.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
optimizelyEndUserId	1 year	Optimizely uses this cookie to store a visitor's unique identifier which is a combination of a timestamp and a random number. Different variations of web parts are shown to users that optimizes the website's user experience.
tableau_locale	session	Tableau uses this cookie to determine the preferred language and country-setting of the visitor - This allows the website to show content most relevant to that region and language.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment to count the number of people who visit a certain site by tracking if they have visited before.
ajs_group_id	never	This cookie is set by Segment to track visitor usage and events within the website.
ajs_user_id	never	This cookie is set by Segment to help track visitor usage, events, target marketing, and also measure application performance and stability.
browser_id	5 years	This cookie is used for identifying the visitor browser on re-visit to the website.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
sid	1 year	The sid cookie contains digitally signed and encrypted records of a user’s Google account ID and most recent sign-in time.
uid	1 year	This is a Google UserID cookie that tracks users across various website segments.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.
WMF-Last-Access	1 month 21 hours 5 minutes	This cookie is used to calculate unique devices accessing the website.

Cookie	Duration	Description
_dc_gtm_UA-1038974-4	1 minute	Used to help identify the visitors by either age, gender, or interests by DoubleClick - Google Tag Manager.
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
_pxhd	past	Used by Zoominfo to enhance customer data.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
049fc2ef5beb27056b07d9e4c4d13fd3	session	No description
akaalb_http_misc_subs	session	No description
AnalyticsSyncHistory	1 month	No description
BIGipServermsocu-web-2-rr.webfarm.ms.com.10882	session	No description
bm_mi	session	No description available.
CX_406152288	1 year	No description
DCID	20 minutes	No description
debug	never	No description available.
DrupalVisitorMobile	session	No description available.
ep203	3 months	No description available.
frbatlanta#lang	session	No description
geo_info	1 year	No description available.
GoogleAdServingTest	session	No description
li_gc	2 years	No description
loglevel	never	No description available.
loom_anon_comment	session	No description available.
loom_referral_video	session	No description
mkjs_group_id	never	No description available.
mkjs_user_id	never	No description available.
MorganStanley.ClientServ.Common.IPZipAccess.IPZipCookie.DEFAULT_COOKIE_NAME	past	No description
NSC_us_nbsl-83+63+21+25-91	session	No description
nyt-a	1 year	This cookie is set by the provider New York Times. This cookie is used for saving the user preferences. It is used in context with video and audio content.
nyt-gdpr	6 hours	No description available.
nyt-purr	1 year	No description available.
OCC_Encrypted_Cookie	session	No description
polleverywhere_session_id	14 days	No description
ppnet_2020	session	No description available.
ppnet_2777	session	No description available.
reuters-geo	session	No description
shell#lang	session	No description
smcx_0_last_shown_at	session	No description available.
tableau_public_negotiated_locale	session	No description available.
vary	1 month	No description
www#lang	session	No description
X-Vive-Country	session	No description
xn_uuid	1 month	No description

Document intelligence: Long-term value creators

AI for process automation: Buy to last

M&T’s Mary Kate Loftus to speak at BA Ignite

How to … Find AI-backed automation use cases

Related Posts

Webinar: Citi Ventures’ strategy in a volatile market

Valley Ventures invests in fintechs it can use

ConnectOne Bank builds data lake in-house

How to ... Find AI-backed automation use cases

Stay Informed with Our Newsletters

EMERGING FINTECH DIRECTORY

The Buzz Podcast

RETAIL BANKING

Huntington Bank resolves outage

Barclays, Banco Santander, Lloyds plan product expansion

Online banks lead FIs in customer satisfaction

SPONSORED

Just Released! 2025 Strategy Benchmark

Leverage Treasury Management to Turn Fraud Prevention Into a Strategic, Revenue-Generating Opportunity

A growth mindset in banking requires AI

Connect

Welcome Back!

Retrieve your password