VOOZH about

URL: https://thenewstack.io/indico-applies-ai-and-ml-to-challenge-of-unstructured-data/

⇱ Indico Applies AI and ML to Challenge of Unstructured Data - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2022-02-21 03:00:56
Indico Applies AI and ML to Challenge of Unstructured Data
in-depth-news,
AI / Data

Indico Applies AI and ML to Challenge of Unstructured Data

Indico Data uses AI and machine learning to automate, analyze, and apply previously unreachable unstructured data into enterprise workflows.
Feb 21st, 2022 3:00am by Jeffrey Burt
👁 Featued image for: Indico Applies AI and ML to Challenge of Unstructured Data
Featured image via Pixabay.

Data is often described as the new oil, the crucial resource that drives today’s companies and key to their ability to compete in the modern business world.

However, data also comes with its share of challenges, not the least of which is the massive amount that is being generated by enterprises. IDC analysts predict that in 2025, 175 zettabytes of data will be created, all of which needs to be collected, processed, stored and analyzed to quickly derive pertinent information that will drive business decisions.

This is further complicated by the fact that 80% to 90% of data being created is unstructured data, which includes everything from emails, texts and social media to images, videos and speech. Such data doesn’t fit into tables or other preset models and can’t be mapped into designated fields. It can’t be stored in traditional databases, making it more difficult than structured data to manage.

“One of the big challenges in unstructured is the heterogeneity of it,” Indico Data CEO Tom Wilde told The New Stack. “We think about it in terms of email, image [and] audio, and that stuff is in frame for unstructured that flows through any major enterprise today. It makes up a lot of critical front-, middle- and back-office workflows.”

Addressing Unstructured Data

👁 Image

Tom Wilde

IT in the past used tools like regular expressions and Boolean Logic — with relative success – to help machines better address unstructured data. The arrival of artificial intelligence (AI), machine learning (ML) and neural networks in recent years has changed the dynamics, according to Wilde.

“For the first time we can actually teach the machine by example instead of by instruction,” he said. “That’s a really profound shift. Now if I show the machine enough examples of what I’m trying to solve, it can learn through observation to solve it as a human would. That’s the good news. I don’t need to be technical to show examples. I just need to find a way to put those examples of products on the neural networks.”

That said, there are issues even with those emerging technologies: While machines armed with AI are smart learners, they’re also slow. Users may need to show it tens of thousands of examples before it can reach human proficiency and “now we’re back outside of something a business user can take on a job,” Wilde said.

Indico’s Unstructured Data Platform

👁 Image

Boston-based Indico Data, which was founded in 2014, is looking to address the issue with its Unstructured Data Platform, a collection of tools that leverage transfer learnings approaches, learning techniques and software agents to quickly create machine learning models that can be applied multiple times to pull value from unstructured data.

The company unveiled the platform in September 2021, delivering it as a service in a private cloud or hosted by Indico. It includes the company’s Indico Ignite services.

“What Indico has done here — and the real disruption is the founders who started the company … invented an approach using something called transfer learning, whereby a general approach to the problem could be repurposed over and over again as a bootstrap to solving the specific problem,” Wilde said. “Some people describe it as, we run the first 26 miles of the marathon and the customer runs the last .5 miles. We built this massive pre-trained understanding of language, of images, etc., as a giant generic bootstrap that allows the business user to create custom machine learning models sitting on top of that and train it how to do the specific task that they’re trying to solve. That’s the technical disruption that needs to be brought to the market.”

Custom Models Are Key

The CEO said the ability to create custom machine learning models with only a small amount of training is one of three key parts of the strategy. In addition, the application itself is essentially a point-click which makes it easy for customers to do such tasks as creating the models and presenting and labeling the training data. Keeping humans in the loop also is a key part.

“Oftentimes, customers will say to us, ‘How accurate is your solution?’” he said. “That’s the wrong question. The right question is, ‘How difficult is it for me to get to 100 percent accuracy?’ It’s a subtle difference but an important one. The key distinction there is human knowledge. Getting to 100 percent accuracy is both a machine learning problem and a user experience problem.”

The application allows the user to see what has been processed via custom models and understand where it’s lacking confidence in its prediction. The user then needs to understand the efficiency in finding the problem and remediating it.

“That all together is what solves this problem,” Wilde said. “That separates Indico: it’s all of those things combined. It’s not one of those things.”

A Growing Company

👁 Image

Brandi Corbello

Indico has raised $36 million, including $22 million in December 2020, which enabled it to grow its workforce and expand its channel reach. It now has 15 enterprise customers in such industries as financial services, insurance and commercial real estate, such as Cushman & Wakefield. The Chicago-based global real estate services firm — with about 50,000 employees in 400 offices across 60 countries — was investing in automation technologies as part of its larger digital transformation initiative.

The company had been successful leveraging robotic process automation (RPA) for structured data and wanted to address unstructured data that included millions of documents, emails and other text-based information.

Brandi Corbello was vice president of transformation at Cushman and Wakefield at the time.

“Typically what you’ve seen in the automation space is the automation leaders typically sitting in a function or in a business unit,” Corbello, who in January became Indico’s vice president of business development, told The New Stack. “They’re really only exposed to the problems sitting in that particular business unit or that function. I was in the position where I got to see all of the problems across the organization. What we were finding in a lot of our unstructured data problems was there were a lot of niche solutions or siloed solutions, with one solution that could only look at leases or one solution that could only look at invoices.”

Cushman and Wakefield’s Challenges

The company needed a strategy to solve the unstructured data problem across the organization, she said. The Indico solution enabled business process experts to build models and modify them when needed without having to involve IT or data scientists. The analytics in the offering identified relevant terms in a document in multiple documents and the platform could be used for myriad use cases and document types in disparate business units.

“Within each use case, we created custom models,” Corbello said. “My team knew one tool, so we had one tool in the platform vs. two to three, [and] our end users were actually quite comfortable with that tool, so the change management from a business perspective was low and we had good user experience. We were able to go after a multitude of use cases and not just focus in on one problem in the organization.”

The ability to create custom models also makes it easier for enterprises to adopt the technology, Wilde said. Addressing the first use case generally takes between 90 and 120 days, but subsequent use cases take about four to six weeks.

Once they’re up, once customers get the learning curve, they’re able to capture value at an accelerated rate,” he said.

Cushman and Wakefield used the Indico solution to address such use cases as lease analysis and procure-to-pay invoicing workflows and save 16,000 hours on its deal management initiative and accelerating the turnaround time on each deal by 70%.

“What was key for us at Cushman was we were able to operationalize the tool,” Corbello said. “We were able to embed it in those processes and within those businesses. It’s a true part of their operations, rather than it being a silo technology.”

TRENDING STORIES
Jeffrey Burt has been a journalist for more than three decades, the last 20-plus years covering technology. During more than 16 years with eWEEK and in the years since as a freelance tech journalist, he has covered everything from data...
Read more from Jeffrey Burt
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.