REPORT

Can You Teach AI to Understand Bank Statements?

We tested every major frontier model on real South African bank statements. The findings will change how you think about financial AI. Here is what 18 months of empirical research revealed.

8.5 min

15-25%

R1.9M+

avg frontier model processing time

accuracy drop in production vs. benchmark

monthly API cost at 65k statements

Read by CTOs, CROs, and Credit Executives across

South African financial services

Inside the Report

Hallucination Is Structural, Not a Bug

We demonstrate how hallucination is inherent to transformer architectures, and show you exactly how it manifests on South African bank statements.

The Real Cost of Using Frontier Models at Scale

At R219 per extraction and 8.5 minutes per statement, the economics collapse fast. We show the full cost picture at 65,000 statements per month, including what the benchmarks deliberately hide.

Trust Is an Architecture Problem, Not a Model Problem

The path to regulated-grade accuracy is not a better model. We explain the parallel consensus architecture that achieves 95 to 99% production accuracy, and why it matters under the National Credit Act.

What You Will Walk Away With

The clearest picture of AI in regulated financial services available today

The empirical benchmark data from testing GPT-5.5 Pro, Claude Opus 4.7, and Gemini 3.1 Pro on real SA bank statements

A side-by-side comparison of three extraction architectures and their risk, cost, and compliance implications

The four validation layers every regulated financial AI deployment must have

Strategic recommendations tailored to customer-facing versus background processing workflows

The cost-per-error framework that replaces the cost-per-page conversation with your board