There is no doubt that automation is changing how we interact with the world around us. You only need to look around your home to see examples of automation affecting how you do things every day. Back in the early days, the act of preparing a meal was a long, laborious process – washing, chopping, blending, mixing, or cooking was done by hand and the process would require the full attention of the cook. Nowadays, what you see is a set of automation tools around the kitchen – such as a blender, drink mixer, oven, microwave, rice cooker, slow cooker, and pressure cooker – which quickly simplifies and reduces the effort and attention required so that the cook can now focus on something else – such as looking after a child, working on a hobby, or tending to the garden.
In the business world, this is no different. A recent survey done by McKinsey showed that the COVID-19 pandemic has sped up the adoption of digital technologies by several years. A core aspect of digital transformation is to introduce automation to streamline the workflow and decision-making process of the business’s employees.
Similarly, another survey was done by the IBM Institute of Business Value, where 1,500 executives from around the world were surveyed as part of the Future of Work Study in 2020. Here, the correspondents indicated that in 3 years, the two most important elements to gain competitive advantages would be customer experience and workforce skills. When asked about where they would be invested, more than half planned to increase AI investments, 44 percent to increase the usage of robotics, and over 73 percent expected that implementing Intelligent Automation (IA) would result in revenue growth.
In this chapter, we’ll be covering the following topics:
Broadly speaking, IA refers to the use of automation technologies such as Robotic Process Automation (RPA), Business Process Management (BPM), and Artificial Intelligence (AI) together to streamline and scale workflow and decision-making across organizations. You might also hear about the term hyperautomation, which was defined by Gartner [3], where they refer to it as a business-driven, disciplined approach that organizations use to automate as many business and IT processes as possible. In a way, IA is often used within the context of the overall hyperautomation initiative of an organization. [4]
We believe that there are three key motivations driving the need for IA:
To create a successful IA project, we believe an integrated automation platform that supports a low-code/no-code approach to automation is necessary to engage everyone in the enterprise to create a usable and scalable solution. This is where IBM Cloud Pak for Business Automation comes in.
IBM Cloud Pak for Business Automation [5] is a set of integrated software that will allow organizations to accelerate their IA and hyperautomation journey. There are multiple entry points for Cloud Pak for Business Automation – whether it is Workflow, Enterprise Content Management, Data Capture, Decisions, Business Apps, RPAs, or Process Mining.
Customers do not have to use all of them, but they can start with any one of them to complement what they already have in place today. With the focus on low-code/no-code, increasing use of built-in AI, and the integrated development experience, we should be able to identify the business impact alongside the time to value.
The following figure is a high-level component overview of Cloud Pak. Cloud Pak is built upon Red Hat OpenShift Kubernetes as the foundational technology that allows it to run in a multi-cloud environment. We then provide a set of foundational technology – RPA, Natural Language Processing, Process and Task Mining, Event Management, Machine Learning, third-party integrations, and a comprehensive set of operational models – to support a set of core automation and accelerators such as Workflow, Decision Management, Content Services and Document Processing, and Operational Intelligence. To complement all of these capabilities, we provide a low-code application design environment where customers can build automation applications for their business users:
Figure 1.1 – Component overview of Cloud Pak for Business Automation
The following table shows the key features that will be covered in this book:
Capabilities |
Product |
Description |
Digital labor |
RPA |
Desktop automation combined with human-in-the-loop |
Workflow |
Business automation workflow |
Low-code building of content-centric case, human-centric process, and high volume straight-through-process applications |
Workstream services |
No-code experience for configuring dynamic workstreams and workstream services | |
Workflow process service |
A lightweight full-featured embeddable low-code workflow microservice | |
Decision |
Operational decision management |
Discover, capture, analyze, automate, and govern rules-based business decisions |
Decision services |
Low-code decision with built-in AI integration | |
Content |
FileNet |
Enterprise content management, including transactional content processing, document management, content consolidation, compliance, and governance |
Capture |
Datacap |
Multi-channel capture, recognition, and classification of business documents and information extraction |
Document processing |
Extraction and classification of data in your documents with built-in AI | |
Apps |
Application designer |
Low-code application builder that you can use to create, modify, and refine business automation-based UI applications |
Operational intelligence |
Automation insights |
Provide visualization insights to business owners that feed the data lake to infuse artificial intelligence into IBM Digital Business Automation |
Task and process mining |
Process mining |
Discover hidden tasks and processes with simulation to jump-start your automation journey |
Table 1.1 – Cloud Pak for Business Automation capabilities
There are other capabilities, such as IBM Content Collector for SAP Applications or IBM Enterprise Records, that are designed for specific content management use cases. For more information about those capabilities, please refer to the IBM Cloud Pak for Business Automation Knowledge Center (https://www.ibm.com/docs/en/cloud-paks/cp-biz-automation/21.0.x?topic=automation-using-cloud-pak-business-software) for details.
A large part of any human-centric process automation is related to work that will need to be performed by humans. This would be knowledge-based work, such as providing a medical diagnosis for a symptom, or repetitive work, such as copying information from one website to another. The purpose of Robotic Process Automation (RPA) is to automate as much repetitive work as possible, oftentimes with the help of AI.
While we assume that in today’s digital world many repetitive tasks have already been automated, there is still a large portion of the daily work in an organization that will require manual labor. In many instances, these kinds of tasks are situational or occasional, while some of them are performed daily. Imagine a data clerk responsible for processing incoming invoices (by email or fax) into an order processing system (such as SAP) or a consultant that will need to enter their project activities on a time tracking system for billing purposes.
RPA is a program (in this case, it is a software robot) to mimic human users’ interaction with their desktops to perform tasks. The RPA bot performs mouse clicks and keyboard strokes on the computer screen to enter the information into the ordering application. One question that is sometimes asked is how RPA is different than other API-based integration techniques (for example, popular personal automation platforms such as IFTTT, integration-centric approaches such as SOA or microservices, or script-based automation such as Ansible runbook). One key difference between RPA- and API-based integration approaches is that RPA is not only limited to the command line or API, but also the user interfaces.
Despite the advances in the API economy in the last decades, there are still many legacy business applications (for example, CICS, IMS, and SAP) or native applications (for example, Windows-based) that do not provide modern APIs or command lines. In some cases, the user just doesn’t have access to the APIs (imagine you’re using a third-party web-based application, such as a banking website or an online bookstore) – since the chances of legacy business applications giving regular users access to their backend API are very small.
For many years, workflow, or Business Process Management (BPM), has been the starting point of many IA projects. Lately, the term workflow has become more popular as it implies a broader definition for the sequencing of activities – whether the activities will be performed by humans (Task Workers or Knowledge Workers), robots (RPA bots), or system services.
There are also three major workflow processes:
Depending on the use cases or maturity of the business process, we could start with a non-deterministic knowledge-driven case-centric solution, and as the process matured, it might evolve into a more deterministic process-centric straight-through process.
There are thousands of decisions made by all of us every day – even simple things such as what is the next word to use in an email or the route to take when driving to work. In the business world, a decision could be used to determine the risk score of a borrower or the premium of a life insurance quote. The ability to automate decision-making is a key approach to speeding up and also adding intelligence to your automation.
Automation Decision Services is a low-code experience that allows business users to build and automate their decisions. Traditionally, decisions were created through a combination of business rules and decision tables. More recently, we also combine the use of AI/ML to combine deterministic rules together with predictive rules.
Documents are still the primary means where information is exchanged. In 2020, Statista estimated there were 306.4 billion emails sent and received per day. At some point in your IA journey, we must tackle the issue of automating how these documents/contents are stored, updated, accessed, processed, archived, and deleted.
Closely related to content management is how to extract the information from the documents that are managed by the content management system. Typically, the optical character recognition (OCR) technique can help with identifying the characters that are buried within a document but understanding that the sequence of numbers is an invoice number is a different matter.
In a well-structured document such as a tax form or a questionnaire with predefined boxes, it is relatively easier to identify the meaning of certain fields. However, many of the documents that require human processing would fall into the class of semi-structured documents (for example, invoices or bills of lading) or unstructured documents (for example, contracts, emails, and memos). The ability to capture information from all document types (structured, semi-structured, and unstructured) will be a key attribute in automating the entire workflow. In Automation Document Processing, we make use of various AI techniques to allow knowledge workers to teach the system visually to recognize information from documents without having to write a single line of code.
Before you can create an IA solution, first, you must understand the business problem and how it comes to be. There are two approaches to process discovery:
It is also often useful to apply both techniques, first by using a process documentation tool to describe the perceived business process and then by using a process mining tool to discover the actual process. By comparing the two, you can more easily identify existing process discrepancies and potential business-compliant issues.
In any IA solution, time to value is paramount. Additionally, this means we must have a way to bring all the automation to the hands of the business users, independent of whether they are working from the office or remotely using their mobile devices. The ability to create business applications and leverage low-code techniques will allow your automation developers to create both strategic and situational applications and make them available to your business populations in days versus months.
In any IA solution, intelligence can come in one of two ways:
Your business processes or bots can create a large amount of business data, and many of them capture the decisions made by the task or knowledge workers as they perform their work. The operational intelligence component in IBM Cloud Pak for Business Automation (CP4BA) provides a data lake where all the data can be stored and then subsequently analyzed and used to create predictions or recommendations.
CP4BA differentiates itself in the market through four key different aspects:
The following section will explain, at a high level, how to design an IA solution.
When starting an IA project, it is important that we guide the stakeholders (often, these are the line of business owners or IT executives) to focus more on the intended business outcome rather than a specific set of technology. The intended business outcome can be to improve the customer engagement experience, speed up the time to value of existing projects, or increase the agility of the business in response to changing business conditions. By focusing on the business outcome, we can help LOBs to better define the business problems rather than just a particular solution upfront.
In this section, we will discuss how to go about the methodology and considerations to create a solution architecture for your IA solution.
An IA solution cannot be truly successful until we get buy-ins from both the business and IT sides of the organization. To get those buy-ins, enterprise architects can start with a business analysis and utilize an approach such as IBM Design Thinking to bring both business and IT together.
There are several important principles that enterprise architects and automation developers can use to create an initial concept of a solution:
This book will not go into detail about how you can apply Design Thinking to your IA solutions. For those who are interested, there are reference materials available on the IBM Enterprise Design Thinking website (https://www.ibm.com/design/thinking/).
To begin the solution design, a high-level understanding of the CP4BA architecture and its use cases would be beneficial. The CP4BA architecture is designed with modularity in mind – you can start with just one capability, for example, workflow or RPA, and add more as the use cases expand. As you add each capability, new integrated use cases will become available:
Figure 1.2 – The CP4BA logical architecture
The following table explains each use case:
Use case | |
1 |
Workflow application stores and updates documents in content management |
2 |
Workflow application makes use of Decisions to automate decision making |
3 |
Robot claims and works on tasks using desktop automation |
4 |
Robot retrieves and updates documents in content management |
5 |
Robots make use of Decisions to automate decision making |
6 |
Robots make use of document processing to extract information from documents |
7 |
Process mining creates Workflow processes through discovery |
8 |
Process mining creates robots through task mining |
9 |
Business application engages business users to perform work on workflow, content management, or decision making |
10 |
Workflow and Decisions send business events to Event Bus to be used by automation insights or other customers’ systems |
11 |
Automation Insights subscribes to business events and performs analysis and predictions |
Table 1.2 – Examples of CP4BA use cases
To start putting together the solution architecture, let’s go through an end-to-end scenario, where multiple pieces of these solutions are used together.
In this scenario, we are going to implement an IA solution to onboard clients for a fictitious national financial consulting company. The solution consists of a business application, where the client representative collects all the required information and the requested services about the client before the representative will submit the request to the back office for further processing:
Figure 1.3 – Client onboarding scenario with corresponding key personas
Before we can design the architecture of the solution, first, we will need to break down the business requirements into a set of hills. In this example, we are going to start with three hills. Each hill should speak to the desired business outcome that we want to achieve as a part of the automation solution:
You might notice that the preceding hills do not mention any technology at all, but instead, we are trying to focus on the business values – in this case, we want to make sure any new client can be onboarded within 2 business days.
In this case, we have identified four personas:
As a solution designer, you will have to go through one or more playback sessions with the participants – in this case, with the client representatives, account managers, and account executives to roughly identify the outline of your solution.
We will be creating a business application to help the client representative collect all the information required, such as the contact details and the services required, and even offer a potential quote. Once all of the information has been collected, the client representative will then submit the request for further validation and approval, and to schedule the actual service date with the client.
Through the design session, it was also decided that the business application will have three steps:
Figure 1.4 – Hill 1 high-level solution architecture
Within each step, we will also be using other automation services to gather the necessary information:
In this hill, we will focus on the work required by the account manager to review the submitted information from the front office, approve the request, and schedule the requested services with the client. If the request cannot be approved, both the client and client representative will be informed:
Figure 1.5 – Hill 2 high-level solution architecture
The entire process will be orchestrated by a workflow, where multiple automation services will be used together to complete the client onboarding process.
In this final hill, we will be focused on the experience of the account executives. On a regular basis, the account executive should be able to inspect the business performance of the organization, and make adjustments (for example, re-balance the workload of different account managers) or suggest changes in the scoring model:
Figure 1.6 – Hill 3 high-level solution architecture
There are three data sources that we can use to draw information from:
Since we usually focused more on the business outcomes during the initial hill discussion, one aspect that we sometimes miss during the initial hill discussion is the Technical Foundation hill. Technical Foundation is where we lay down the non-functional requirements of the solutions. In this particular use case, the following aspects are important:
Many of these questions will have a direct impact on the deployment topology and the overall cost of the solution. While everyone might want to be able to support thousands of concurrent users and want the system to be up 100 percent of the time, it does not come for free. We will discuss this further in Part 3, Deployment Considerations, of the book, where we will go deeper into the deployment topology and architecture of an IA solution.
In this chapter, we discussed the promise of IA and how an automation solution can be realized with IBM CP4BA. We provided a brief overview of the key capabilities in CP4BA. Also, we briefly discussed how to leverage the Design Thinking approach to break down the initial automation problem into a set of hills, and how we can come up with different solution architecture for each hill.
In the next chapter, we will go into future details on the key capabilities of RPA, Workflow, Decisions, and Business Applications.