1.2

How the Layers Work Together

10 min

The four layers of the stack rarely operate in isolation. Real professional AI systems typically combine layers, with each layer contributing what it is best suited to produce. Understanding how the layers combine is useful for practitioners because most of the tools they will encounter are combinations rather than pure examples of a single layer, and the combination shapes what the tool can and cannot do.

A useful way to think about the combination is that each layer plays a different role. Automation executes actions and produces the activity records that later layers depend on. Rule-based AI encodes explicit policy and makes structured decisions. Machine learning identifies patterns in historical data and produces predictions that inform decisions. Deep learning interprets unstructured material (text, images, audio, documents) and produces structured outputs that the other layers can act on. When the four layers work together, they produce systems that can execute reliably, decide within explicit policies, learn from experience, and handle the messy unstructured inputs of real professional work.

Consider how a modern loan application process might be handled by a system that uses all four layers. Automation receives the application through a web form, creates a record in the underwriting system, sends confirmation to the applicant, and triggers the downstream workflow. Deep learning reads the applicant's uploaded bank statements and payslips, extracting the specific fields that matter for credit assessment (income, employer, account balances, payment history) from the unstructured documents. Machine learning takes the extracted fields, combines them with other available data (credit bureau information, the applicant's history with the firm), and produces a calibrated probability that the applicant will repay on time. Rule-based AI applies the lender's policies to the risk score and the extracted details, enforcing hard stops (regulatory requirements, fraud indicators, prohibited jurisdictions) and routing the decision (automatic approval if the score is high and nothing else flags, human underwriter review if the case is in the middle band, automatic decline if the score is low). Automation then executes the decision, notifying the applicant, updating the record, and triggering whatever follow-on workflows are required.

The layers share work through structured interfaces. Automation produces event records in a standard format. Deep learning produces structured extractions (named entities, specific fields, classifications) that machine learning and rule-based AI can consume as inputs. Machine learning produces scores and probabilities that rule-based AI can evaluate against thresholds. Rule-based AI produces decisions that automation can execute. Each layer takes inputs from the layers that produce information it needs and produces outputs that the next stage of the workflow consumes.

A second example from a different professional domain shows how the same layered architecture applies across different kinds of work. Consider a contract review system that a legal team uses to process incoming vendor agreements. Automation receives the contract through the firm's document intake system, creates a matter record, assigns a workflow state, and notifies the responsible lawyer that a new contract has arrived. Deep learning reads the full text of the contract and extracts structured information from it (the parties, the term length, the governing law, the liability caps, the termination provisions, the payment terms), producing a structured summary of the contract's key provisions. Machine learning compares the extracted provisions against the firm's historical contract database, identifying provisions that deviate from the firm's typical patterns (unusually long term lengths, liability structures that differ from the firm's standard, payment terms the firm has previously negotiated against) and flagging them for closer review. Rule-based AI applies the firm's explicit policies to the extracted provisions, identifying hard issues that always require attention (certain jurisdictions the firm avoids, specific clauses the firm never accepts, contract values above the partner's signing authority). Automation then presents the review dashboard to the responsible lawyer, with the deep learning extraction at the top, the machine learning flags next, the rule-based policy issues highlighted, and a suggested redline draft ready for the lawyer's judgment.

This example shares the same four-layer structure as the loan application example but operates in an entirely different domain. The specific work each layer does changes based on the task. The overall architecture holds because it reflects the distinct capabilities of each layer. Automation handles movement and workflow. Deep learning handles the unstructured input (the contract text). Machine learning handles pattern recognition (comparison against historical examples). Rule-based AI handles explicit policy. Humans handle judgment. This pattern recurs across professional work because it reflects a fit between the available technology and the distinct cognitive requirements of different parts of a professional task.

Humans review the outputs of each layer where judgment is required. They approve decisions that rule-based AI routes for review rather than automatic handling. They correct extractions from deep learning where accuracy matters for the specific case. They update policies in the rule-based layer when the business rules change. They validate the training data that machine learning draws from and approve retraining cycles when the model is updated. The four-layer system does not remove humans from professional work. It focuses human attention on the decisions that require judgment, while handling the routine work at the layer best suited to it.

The different layers also have different failure modes, which matter for how humans review the system's outputs. Automation fails loudly when its inputs are outside its expected patterns, and the failure is usually easy to detect. Rule-based AI fails silently when the rules do not cover a situation they should have covered, and the failure can be hard to detect without domain expertise. Machine learning fails in ways that depend on the training data and can be hard to detect for individual cases (a particular prediction may be wrong without the system flagging it). Deep learning, and large language models specifically, fails through hallucination and through confident outputs that are not factually grounded, which is the hardest failure mode to detect because the outputs look normal. The professional evaluation disciplines developed later in the programme address these different failure modes in different ways.

Understanding the layered architecture also helps practitioners communicate with technical teams about what a given tool is actually doing. When a vendor describes a product as "AI-powered," the practitioner can ask which layers the product uses, what happens at each layer, and how the layers combine. A product that is mostly automation with a small machine learning component has different properties from a product that is primarily a large language model with some rule-based controls around it. The marketing language rarely makes this distinction. The technical details, when examined, typically do. A practitioner who can ask intelligent questions about a tool's architecture can evaluate the tool's likely strengths and weaknesses before committing to use it.