Auditability

Making a financial system provably trustworthy through immutable trails, tamper evidence, and the ability to reconstruct any past state.

intermediate 46 min read in Ledger Foundations

audit-trailauditabilityevent-sourcingimmutabilityregulatory-recordkeepingtamper-evidence

Learning outcomes

Every financial system you build will eventually be asked a question it did not expect: what exactly happened to this account on this day, who touched it, and can you prove the answer has not been changed since. That question comes from a customer dispute, an internal investigation, an external auditor, or a regulator running an examination, and the system either has a clean, provable answer or it does not. Auditability is the property that determines which. It is not a feature you bolt on at the end. It is a structural commitment you make at the beginning, and almost all of it follows from one idea: you do not overwrite the past.

After studying this page, you can:

Say what makes a financial system auditable, and explain why immutability is the foundation that every other guarantee rests on.
Distinguish the audit trail from the ledger itself, and explain what each one records and why you need both.
Describe append-only logs and event sourcing as the substrate of auditability, and how a write-ahead log gives you the same property at the storage layer.
Explain who-did-what-when access logging, and why capturing the actor, the action, the time, and the context is as important as capturing the data change.
Explain how hash chains, WORM storage, and cryptographic sealing produce tamper evidence, and state precisely how tamper-evident differs from tamper-proof.
Reconstruct the state of an account as of any past moment by replaying an immutable log, and say why that capability is the real test of an audit trail.
Name the regulatory drivers that make all of this mandatory in the United States, including SEC Rules 17a-3 and 17a-4, WORM retention, and SOX internal-control requirements, and explain what each one actually demands.
Diagnose the ways an audit trail fails in production: gaps, mutable history, missing context, and clock disagreement, and the controls that catch each one.

Before we dive in

You do not need an audit or compliance background to start. We will use a small set of words and define each one the first time it appears.

A record is any piece of information a financial firm is required to keep: a trade, an order, a customer communication, a configuration change. An audit trail is the ordered history of events that lets you reconstruct what happened and prove it, separate from the current state of the data. Immutability means a record, once written, is never changed or deleted in place; corrections are made by writing new records, not by editing old ones. Tamper evidence is the ability to detect after the fact that data was altered; tamper resistance or being tamper-proof is the much stronger and rarely achievable property of preventing alteration outright. WORM, write once read many, is storage that physically or logically forbids overwriting a record once written. A hash is a fixed-length fingerprint of some data, computed so that any change to the data changes the fingerprint; we use it to seal records. Provenance is the documented origin and chain of custody of a record: where it came from and every hand it passed through.

Two regulators and one law recur throughout. The SEC is the US Securities and Exchange Commission. FINRA is the Financial Industry Regulatory Authority, the self-regulatory body that examines broker-dealers. SOX is the Sarbanes-Oxley Act of 2002, the law that made executives personally accountable for the integrity of financial reporting and its controls. Hold the picture of a record that is written once, sealed so any later change would show, and kept where no one can quietly overwrite it. Everything below is an elaboration of that picture.

Mental Model

The wrong model, and it is a comfortable one, is that auditability means keeping good logs. You imagine a log file the application writes to, maybe shipped to a search system, and you reason that as long as you log enough, you are auditable. Logging feels like the whole job.

It is not, and the gap is exactly where audits fail. An ordinary application log is written by the same process that mutates the data, stored in a system where an operator with the right access can edit or delete entries, and rotated or expired on a schedule designed for debugging, not for proof. It answers “what did the program roughly do recently” for an engineer. It does not answer “what happened to this customer’s money on this date, who authorized it, and can you prove this answer is the same one the record had a year ago when the event occurred.” The first is a convenience. The second is evidence, and evidence has requirements that convenience does not: it must be complete (no gaps), ordered (you can place events in sequence), attributed (each event names its actor), durable (kept for years under a retention rule), and tamper-evident (any later change is detectable).

Here is the model to hold instead. Picture a court reporter sitting in the room as everything happens, writing every statement in indelible ink into a bound book with numbered pages, where each page is stamped with a fingerprint of the page before it. No page can be torn out without leaving a stub and breaking the sequence of stamps. No line can be erased; a correction is a new line that references the old one. The reporter does not decide what is important and summarize; they record everything, in order, with who said it and when. Auditability is building that court reporter into your system. The logs your engineers read are the rough notes; the audit trail is the bound book, and the whole discipline is making sure the book cannot be quietly rewritten.

Breaking it down

The core teaching runs in twelve steps. The first six build the technical substrate of auditability from first principles, the way an engineer must understand it. The next six connect it to the regulatory, operational, and organizational reality that forces those engineering choices, because the mechanisms are nothing without the obligations that make them mandatory and the audiences that test them.

1. What auditability actually means and why immutability is the foundation

Auditability is the property that lets an independent party reconstruct what happened in a system and trust the reconstruction. Pull that apart and it has two halves. The first is reconstructability: the system retained enough information, in enough order and detail, that you can rebuild the sequence of events and the state at any past moment. The second is trustworthiness: you can show the reconstruction reflects what actually happened and was not altered after the fact. A system that retains everything but lets anyone edit the history is not auditable, because the reconstruction proves nothing. A system that protects its records but throws most of them away is not auditable either, because there is nothing to reconstruct. You need both halves, and immutability is what makes the second one possible.

Immutability is the foundation because trust in a historical record reduces, ultimately, to one question: could this have been changed since it was written. If the answer is yes, every record is suspect, and an auditor has to fall back on other evidence to corroborate yours. If the answer is no, or more honestly, if any change would be detectable, then the record can stand on its own. You cannot get trustworthiness from a mutable store no matter how much you log, because the very act of logging happens in a place that can be edited. Immutability moves the records to a place, or a structure, where editing is either impossible or self-announcing. Everything else in auditability, the hashing, the WORM storage, the access trail, the retention rules, exists to establish, defend, or extend that one property: the past does not change.

Notice this is the same instinct that makes a ledger store immutable postings and derive balances rather than overwriting a balance in place. A ledger refuses to overwrite the truth so it can always reconstruct a past balance and so no one can silently rewrite money. Auditability generalizes that refusal from money to everything the firm must answer for. The ledger is auditable because it is append-only; auditability is what you get when you apply the same append-only discipline to the whole system of record and then prove it cryptographically.

The same correction, mutable history vs immutable history

A fee was posted at $25 by mistake; it should have been $15. An operator opens the record and edits the amount to $15. The data is now correct, but the history is gone: there is no trace that $25 was ever charged, who changed it, or when. A customer who screenshotted the $25 charge can no longer be answered from the system, and an auditor cannot tell an honest fix from a cover-up.

The Toggle is the whole chapter in miniature. Correctness of the current value is necessary but not sufficient. An auditable system also preserves how the value came to be correct, because the question an audit asks is almost never just “what is the value now” but “what was it, when did it change, who changed it, and can you prove that story has not itself been edited.”

2. The audit trail is not the ledger

Engineers new to this conflate two things that live at different layers, and the conflation causes real design mistakes, so we separate them carefully. The ledger records financial facts: postings, balances, the movement of value. The audit trail records what happened to the system and its records: who logged in, who approved a transaction, who changed a configuration, who exported a report, who edited a customer’s address. The ledger answers “what is the financial state and how did it get there.” The audit trail answers “what did people and processes do, to what, when, and on whose authority.”

They overlap but are not the same, and you need both. An append-only ledger is itself a form of audit trail for money: it preserves every posting in order, so you can reconstruct any balance. But the ledger does not, on its own, record that a back-office analyst manually adjusted a customer’s limit at 2 a.m., or that an admin exported the entire customer table, or that someone changed the interest-rate configuration that then drove a thousand correct-looking postings. Those are not financial movements; they are actions on the system, and they are exactly what an investigation needs. A ledger with no surrounding access trail can show you that money moved and that the books balanced, while telling you nothing about who caused it or whether they were allowed to.

flowchart TB
  subgraph trail["Audit trail: actions on the system"]
    AC["Logins, approvals,<br/>config changes,<br/>exports, manual edits<br/>(who, what, when, why)"]
  end
  subgraph ledger["Ledger: financial facts"]
    LG["Immutable postings,<br/>derived balances,<br/>movement of value"]
  end
  AC -->|"a manual rate change<br/>drives many postings"| LG
  LG -->|"a posting needs its<br/>authorizing action"| AC

The practical rule is that a fact in the ledger should be traceable to the action in the audit trail that caused it, and an action in the audit trail that touched money should be traceable to its effect in the ledger. When an examiner picks a suspicious posting and asks “why did this happen and who is responsible,” you answer by walking from the ledger entry to the audit-trail event that authorized it, and from there to the human or system that performed it. A firm that keeps a pristine immutable ledger but a thin, mutable, gap-ridden audit trail has solved half the problem and will fail the half the examiner actually probes.

3. Append-only logs and event sourcing as the substrate

The mechanical substrate underneath an audit trail is the append-only log: a sequence of records you only ever add to the end of, never modify or delete in the middle. Appending is the cheapest, simplest, and most defensible write pattern there is, and it gives auditability almost for free. Because you only append, the log is naturally ordered (later records come after earlier ones), naturally complete (nothing is removed), and naturally easy to seal (you can fingerprint each record as it lands without ever revisiting earlier ones). Every richer construct in this page is an append-only log with something added on top.

Event sourcing is the architectural pattern that makes the append-only log the source of truth for application state, not just a side record. Instead of storing the current state and mutating it, you store the sequence of events that produced it (account opened, deposit made, limit raised, account frozen) and derive the current state by folding over those events. The current state becomes a cache; the event log is the truth. This is auditability as a first-class architecture rather than an afterthought, because the history is not a copy of what happened, it is what happened, and the state is downstream of it. You can never have a state change with no corresponding event, because the event is what produced the change.

Three related constructs people often confuse

The write-ahead log deserves a note because engineers reach for it as evidence of auditability and it is only partly that. A database WAL records every change before it is applied, in order, append-only, for durability and crash recovery. That makes it a faithful change history at the storage layer, and you can mine it (this is what change-data-capture does) to feed an audit pipeline. But a WAL records physical or logical row changes, not business meaning. It tells you a row in the limits table changed from 1000 to 5000; it does not tell you which human raised the limit, through what workflow, citing what justification, under whose approval. A business audit trail must capture the actor and the intent, which live above the storage engine. The WAL is a useful source, not a substitute.

4. Who did what when and the access trail

The single most important content of an audit trail is the answer to who-did-what-when, and getting all four fields right on every recordable action is most of the work. A complete access-trail event names the actor (a specific authenticated identity, never just “system” or a shared service account if you can avoid it), the action (what they did, in business terms: approved a wire, changed a limit, exported a report), the target (the specific record or resource affected), the time (a trustworthy timestamp, which we will see is harder than it sounds), and ideally the context: the source IP or device, the session, the authorization that permitted it, and a reason where the workflow can capture one.

The reason all four-plus fields matter is that an audit is reconstruction, and reconstruction fails on the weakest field. A trail that records the data change but not the actor cannot answer accountability questions. A trail that records the actor but not the precise target cannot answer scope questions (“which accounts did this person access during the period in question”). A trail with unreliable timestamps cannot answer sequence questions, and sequence is often the whole case, because fraud and error are usually about something happening before or after something else. The discipline is to treat every privileged action, especially every manual override of an automated control, as a recordable event that captures actor, action, target, time, and reason at the moment it happens, synchronously, by the same code path that performs the action, so the trail cannot diverge from reality.

sequenceDiagram
  participant U as Analyst (actor)
  participant App as Application
  participant Auth as AuthZ check
  participant Trail as Append-only audit trail
  participant DB as Business data
  U->>App: Request: raise limit on account 4471 to 5000
  App->>Auth: Is this actor permitted?
  Auth-->>App: Permitted (role + approval)
  App->>Trail: Append event {actor, action, target, time, reason, approval}
  App->>DB: Apply change
  Note over Trail,DB: The trail is written in the same path as the change,<br/>so a change can never occur without its audit event.

A subtle but critical design point hides in that diagram: the audit event must be written in the same transactional path as the change it describes, so the two cannot diverge. If you write the audit event in a separate, best-effort call, then a crash between the two writes leaves you with a change and no record of it, or a record of a change that never happened. Both are audit failures. The robust pattern is to make the audit event part of the same atomic commit as the change (or to derive the audit trail from an event-sourced log where the event is the change), so completeness is structural, not hopeful.

Check yourself

A privileged admin action is recorded by a nightly batch job that scans the database for rows that changed that day. Why is this not a sound audit trail for who-did-what-when?

5. Tamper evidence with hash chains WORM and cryptographic sealing

Having captured complete, attributed, ordered events, you now have to defend them against later alteration, and there are two complementary strategies: make alteration detectable (tamper evidence) and make alteration physically hard (tamper resistance). Most real systems lean on the first, because it is cheaper and, done well, sufficient for proof.

The workhorse of tamper evidence is the hash chain. Compute a cryptographic hash (a fixed-length fingerprint where any input change changes the output unpredictably) of each record together with the hash of the previous record. Each record’s seal therefore depends on every record before it. Change one byte of any historical record and its hash changes; that breaks the next record’s hash, which breaks the next, and so on to the end of the chain. To alter one past record undetected, an attacker would have to recompute every hash after it, which is exactly what publishing or sealing the latest hash periodically prevents. This is the same structure a blockchain uses, and the same structure Git uses for commits, and you do not need a blockchain to get it: a single hash chain over an append-only log gives you tamper evidence directly.

Change one character and the seal changes completely

A log record's contents

SHA-256

Type into that box and watch the hash change unpredictably with every keystroke. That is the avalanche property doing the work: there is no small edit you can make to a sealed record that produces a small, plausible change to its fingerprint. Now picture each record’s hash folded into the next, and you have a chain where tampering anywhere downstream of a published checkpoint is detectable.

The animation shows the whole hash chain at once: a sequence of records, each sealed with a hash of itself and the previous seal, and a periodically published checkpoint that freezes the chain up to that point. Watch how editing an old record breaks every seal after it.

Tamper evidence tells you something was changed; it does not stop the change. The complementary defense is WORM storage, write once read many, which makes overwriting physically or logically impossible. Historically WORM was optical media you literally could not rewrite; today it is far more often software-enforced: storage with an immutability lock or legal-hold flag that the storage system itself refuses to override until a retention period expires, so even an administrator with full credentials cannot delete or modify the object early. WORM and hash chains are complementary: WORM raises the cost of tampering by removing the easy path to it, and the hash chain makes any tampering that does occur detectable.

Cryptographic sealing with digital signatures adds a third property: non-repudiation. A hash chain proves the data is internally consistent; a digital signature, where a trusted party signs the records or the checkpoint hashes with a private key, proves who attested to them and lets anyone verify it with the public key. A signed, periodically published checkpoint over a hash chain stored on WORM media is close to the strongest practical combination: complete (append-only), ordered (the chain), tamper-evident (the hashes), tamper-resistant (WORM), and attributable (the signature).

6. Reconstructing state as of any moment

The real test of an audit trail is not whether you can read it but whether you can answer the question that matters: what was the state of this thing at this past moment. This is point-in-time reconstruction, and it is where append-only logs and event sourcing pay off concretely. Because the log preserves every event in order, you reconstruct the state as of any timestamp by replaying every event up to that timestamp and stopping. The state as of last March 14th at noon is the fold over all events with a timestamp at or before that instant. Nothing is guessed; it is computed from the immutable record.

-- Reconstruct an account's balance as of a past instant by replaying
-- the immutable event log up to that timestamp and no further.
SELECT account_id,
       SUM(direction * amount) AS balance_as_of
FROM   postings
WHERE  account_id = $1
  AND  effective_at <= $2   -- the as-of moment
GROUP  BY account_id;

That query is the whole idea, and it generalizes far beyond balances. Any state that is derived from an append-only event log can be reconstructed as of any moment by filtering the log to events at or before that moment and folding. This is why auditors and investigators care so much about immutability: a system that overwrites state in place can only ever show you the present, and it has thrown away the ability to answer as-of questions about the past. A system that stores events and derives state can answer them on demand, which is precisely the capability an examination depends on.

Answering 'what was the limit on account 4471 on March 1' from an event log

Find the relevant eventsScan the append-only log for every event affecting account 4471's limit, in order: opened with limit 1000, raised to 5000 on Feb 10, raised to 8000 on March 15.

Step 1 of 4

A practical wrinkle separates two kinds of time, and serious systems track both. Effective time is when an event took effect in the business world; recording time (or system time) is when the system learned about it. A deposit dated for Friday but entered on Monday has a Friday effective time and a Monday recording time. This is bitemporal modeling, and it matters for audit because the honest answer to “what did the books show on Friday” can differ from “what do the books now say Friday’s state was,” once a back-dated correction lands. An auditable system can answer both, because it never overwrites: it records the correction as a new event with its own recording time and the effective time it applies to, and a query can reconstruct the books either as they were believed on a date or as they are now understood to have been.

7. The regulatory drivers that make this mandatory

Everything so far is good engineering, but in financial services it is also the law, and a senior engineer should know which rules force which mechanisms, because the rules are specific and examiners cite them by number. In the United States, the foundational books-and-records rules for broker-dealers are SEC Rules 17a-3 and 17a-4 under the Securities Exchange Act of 1934. Rule 17a-3 specifies what records a broker-dealer must make: blotters, ledgers, order tickets, account records, and more, capturing the firm’s activity. Rule 17a-4 specifies how long those records must be preserved and in what manner, and it is the rule that drives immutable storage. It requires many records to be kept for specified periods (commonly six years for some categories and three years for others, with the first portion in an easily accessible place), and crucially, where records are kept electronically, it has historically required them to be preserved in a non-rewriteable, non-erasable format, the regulatory definition of WORM. That single phrase is why broker-dealers buy WORM storage: a regulator wrote immutability into the rule.

The SEC amended Rule 17a-4 in 2022 to add an audit-trail alternative: a firm may use an electronic recordkeeping system that maintains a complete audit trail of changes (preserving the original and every modification, with the ability to recreate the original) instead of strictly non-rewriteable media, recognizing that a well-designed, tamper-evident audit trail achieves the same protective goal. This is the regulatory blessing of exactly the hash-chained, append-only approach this page describes: you can satisfy the rule either by storing on WORM or by proving, through a complete and verifiable audit trail, that no record was altered without a preserved history.

flowchart LR
  E17a3["SEC Rule 17a-3<br/>WHICH records<br/>to make"] --> Keep["The firm's<br/>books and records"]
  Keep --> E17a4["SEC Rule 17a-4<br/>HOW LONG and<br/>HOW to preserve"]
  E17a4 --> WORM["Non-rewriteable<br/>non-erasable (WORM)"]
  E17a4 --> ATA["OR a complete<br/>audit-trail system<br/>(2022 amendment)"]
  SOX["Sarbanes-Oxley<br/>internal controls<br/>over reporting"] --> Keep

A different driver operates on public companies generally, not just broker-dealers: the Sarbanes-Oxley Act of 2002, enacted after the Enron and WorldCom accounting scandals. SOX Section 404 requires management to assess and report on the effectiveness of internal control over financial reporting, and the external auditor to attest to it. Auditability is the technical backbone of that requirement: to prove a control is effective, you must show that every transaction affecting the financial statements is captured, that changes are tracked with who-did-what-when, that nobody can alter records without a trail, and that access to financial systems is controlled and logged. SOX Section 802 separately makes it a crime to knowingly alter, destroy, or falsify records with intent to obstruct an investigation, and sets long retention requirements for audit work papers. SOX is why a public fintech treats its general ledger and the systems feeding it as in-scope for rigorous, audited, tamper-evident change control.

The major US drivers and what each one actually demands

Defines WHICH books and records a broker-dealer must create: blotters, ledgers, order memoranda, account records, trial balances. It is about completeness of capture: if it happened, there must be a record of it.

These US rules are the most cited, but the principle is global and the pattern repeats. Bank regulators expect equivalent record-keeping; data-protection regimes add their own retention and access-logging expectations; market-conduct rules in many jurisdictions require time-stamped, reconstructable order and trade trails. The specific rule numbers are institution-and-jurisdiction conventions; the underlying principle, that a regulated firm must keep complete, attributed, time-ordered, tamper-evident records for years and produce them on demand, is the fundamental that does not vary.

8. Retention legal hold and the lifecycle of a record

Auditability is not only about keeping records intact; it is about keeping them for the right length of time and, sometimes, longer than the schedule says. Every record class has a retention period driven by a rule (the six-year and three-year windows under 17a-4, multi-year windows under SOX and tax law, and so on). The system must keep the record, intact and producible, for at least that long, and a well-run firm also defines when records may be disposed of, because keeping everything forever is its own risk: more data to secure, more surface for a breach, and more material to produce in litigation. Retention is therefore two-sided: you must not delete too early (a compliance violation, and under SOX 802 potentially a crime if done to obstruct), and you generally should not keep far past the requirement without reason.

Legal hold is the override that suspends the disposal side of that lifecycle. When litigation or an investigation is reasonably anticipated, the firm has a duty to preserve all potentially relevant records, which means freezing the normal retention clock for the affected data so nothing expires or is deleted while the matter is live. Failing to do this is spoliation, the destruction of evidence, and courts punish it harshly, sometimes by instructing a jury to assume the destroyed evidence was unfavorable. Engineering for legal hold means the retention and deletion machinery must accept a hold that supersedes the schedule, apply it precisely to the right records (and no more, to limit cost and exposure), and itself be auditable: who placed the hold, when, on what scope, and when it was released.

stateDiagram-v2
  [*] --> Active: record created (immutable, sealed)
  Active --> RetentionMet: retention period elapses
  RetentionMet --> Disposed: routine disposal per policy
  Active --> Held: legal hold placed
  RetentionMet --> Held: legal hold placed
  Held --> Active: hold released, clock resumes
  Held --> RetentionMet: hold released after period
  Disposed --> [*]
  note right of Held
    Under hold, disposal is suspended.
    Deleting held records is spoliation.
  end note

The lifecycle diagram captures a tension worth internalizing: immutability and disposal are not opposites, they are different phases. A record is immutable for its whole life (you never edit it in place), but it is not necessarily permanent (after retention, and absent a hold, it can be disposed of in full). WORM storage models this directly with a retention lock that forbids modification or deletion until the period expires, then permits disposal, while a hold flag can extend the lock indefinitely. The discipline is to make immutability the default during life, disposal a deliberate policy-driven act, and hold an auditable override that always wins over routine disposal.

9. The three audiences internal audit external auditors and the regulator

Auditability serves three distinct audiences with different mandates, and designing for it means understanding what each one does with your records, because they ask different questions and apply different pressure. Getting this wrong, building only for one, leaves you exposed to the others.

The three audiences and what each one tests

The common thread is that all three perform the same core act: they sample, then they trace and reconstruct, then they look for breaks. They pick a transaction or a period, ask the system to produce the full supporting trail, and check that it is complete, ordered, attributed, and unaltered. A pleasant surprise for engineers is that the system property they are testing is exactly the one this page has been building: point-in-time reconstruction from an immutable, attributed, tamper-evident log. If you can answer “show me everything that happened to this account in this window, who did it, when, and prove it has not changed,” you can satisfy all three audiences, because that is the question all three are really asking, in different vocabularies.

A worked example makes the exam concrete. An examiner selects a wire transfer from last quarter and asks the firm to demonstrate that it was properly authorized and recorded. The firm walks the trail: the ledger postings for the wire, the audit-trail event showing which operator initiated it and which supervisor approved it, the access logs showing both authenticated, the configuration in effect at the time (reconstructed as-of), and the hash-chain or WORM evidence that none of these records has been altered since. If every link is present and provable, the finding is clean. If the approval event is missing, or the records sit in a mutable store with no integrity proof, or the timestamps disagree about sequence, the examiner has a finding, and findings escalate into remediation, fines, or worse.

10. Engineering for auditability without strangling the system

Knowing what auditability requires, the engineering question is how to build it so it is reliable without crushing performance or developer velocity. Several patterns recur, and they trade off against each other.

The first principle is capture at the source, synchronously, in the same path as the change, as section four established, so the trail can never diverge from reality. The cleanest realization is an event-sourced core for the parts of the system that must be auditable: the event is the change, so completeness is structural. Where you cannot fully event-source, the next-best pattern is the transactional outbox: write the business change and its audit event in one local database transaction to an outbox table, then a separate process reliably ships the audit event onward. Because both writes share one transaction, you never get a change without its event or an event without its change, which is the exact property a thin best-effort audit log fails to guarantee.

The second principle is to keep the audit trail’s integrity machinery cheap by exploiting append-only. Hashing each record as it lands costs almost nothing and never requires touching old records. Publishing a checkpoint (the latest chain hash, possibly signed, possibly to an external or WORM destination) periodically rather than per-record keeps the integrity cost off the hot path. Reconstruction can be made fast with snapshots: periodically persist a derived state with the log offset it corresponds to, so an as-of query replays only the events since the nearest snapshot rather than from the beginning of time, while the full log remains the authority.

How far back a point-in-time query must replay

Snapshot interval7 days

0 days30 days

Worst-case events to replay at about 2000 per day14,000

Balanced: cheap reconstruction, modest snapshot overhead

The slider makes the snapshot trade-off tangible: snapshot too often and you pay storage and write overhead for state you rarely query; snapshot too rarely and every as-of question replays a long tail of events. The log itself is never the thing you tune away, because it is the source of truth; you tune how much derived state you cache alongside it to make reconstruction fast.

The third principle is separation and least privilege around the trail itself. The audit trail should be written by the application path but not freely modifiable by it; ideally it lands in storage where the application’s own credentials cannot rewrite history (WORM, an append-only sink, a separate account with write-but-not-overwrite permissions). The classic failure is giving the same service that mutates business data unrestricted delete rights over the audit store, which means a compromise of that service, or a malicious insider operating it, can erase its own tracks. Auditability against insiders specifically requires that the people and processes who can act cannot also silently rewrite the record of their actions.

11. Failure modes and the difference between tamper-evident and tamper-proof

An audit trail can be present and still fail, and knowing the failure modes is what separates an engineer who trusts the structure blindly from one who builds the right controls. Each failure below is a real way audits go wrong.

How an audit trail fails in production

The deepest distinction in this whole topic is the one between tamper-evident and tamper-proof, and conflating them is a common and dangerous error. Tamper-evident means alteration is detectable after the fact: a hash chain does not stop you from editing an old record, it guarantees that if you do, the broken seals reveal it (provided someone holds an earlier trusted checkpoint to compare against). Tamper-proof, or tamper-resistant, means alteration is prevented: WORM media physically or logically refuses the overwrite, so the change cannot happen in the first place. They are different guarantees with different costs and different failure modes.

Tamper-evident vs tamper-proof

Alteration is not prevented, it is detectable. Anyone can edit a record, but the broken hashes downstream reveal it, AS LONG AS a trustworthy earlier checkpoint exists to compare against. Cheap, flexible, and the basis of the SEC's 2022 audit-trail alternative. The weakness: if the attacker controls everything including all published checkpoints, they can rebuild a fully consistent fake chain, so the proof depends on an independent anchor of trust.

The honest position is that almost nothing is truly tamper-proof; even WORM depends on correct configuration, working hardware, and an administrator who does not have a back door, and a hash chain depends on an independent, trustworthy checkpoint the attacker cannot also forge. Mature systems therefore layer the two and add a third anchor: publish or replicate checkpoints to an independent party (a separate cloud account, a notary, a regulator filing, in extreme cases a public chain) so that no single actor controls every copy of the truth. Tamper evidence is only as strong as the independence of the anchor you verify against.

12. Scaling auditability from a startup to a regulated institution

Auditability is not a fixed cost; it grows with the firm, and the right amount of it at each stage is different. Building the full apparatus on day one is over-engineering; ignoring it until an examiner asks is a crisis. The path has recognizable stages.

For an early-stage startup, the foundation is cheap and worth doing immediately because retrofitting it is expensive: make the ledger and the core records append-only from the start, store money correctly, and capture who-did-what-when on privileged actions synchronously. You do not yet need WORM hardware, signed checkpoints, or a formal retention schedule, but you must not build a system that overwrites history, because that decision is the one that is genuinely hard to reverse later. An append-only core is the seed everything else grows from.

As the firm takes on regulated activity, the obligations attach and the apparatus must catch up: explicit retention schedules mapped to the rules that apply, tamper evidence (hash chaining, often WORM storage for the records 17a-4 covers), formal access controls with the audit trail kept away from the acting services, legal-hold capability, and the ability to produce reconstructions on demand for examiners. At full institutional scale, auditability becomes an organization, not just a system: a dedicated internal audit function, SOX control testing, evidence-capture pipelines feeding a compliance archive, e-discovery tooling for legal hold, and regular examinations whose document requests the system must satisfy in days.

flowchart TB
  subgraph s1["Stage 1: startup foundation"]
    A["Append-only core, money stored correctly,<br/>who-did-what-when on privileged actions.<br/>The hard-to-reverse decision, made right early."]
  end
  subgraph s2["Stage 2: regulated growth"]
    B["Retention schedules mapped to rules,<br/>hash chaining and WORM, access controls,<br/>legal hold, on-demand reconstruction"]
  end
  subgraph s3["Stage 3: institution"]
    C["Internal audit function, SOX control testing,<br/>evidence pipelines, e-discovery, recurring exams"]
  end
  s1 --> s2 --> s3

The constant across every stage is the one decision that is expensive to undo: do not overwrite the past. A startup that internalizes that gets to add WORM, signatures, retention schedules, and audit functions incrementally as obligations arrive, because each of those is a layer on top of an append-only foundation. A startup that builds a mutable, overwrite-in-place system gets to rebuild its core under regulatory time pressure, the worst possible condition for that work. Auditability scales gracefully if and only if its foundation, immutability, was laid first. That is why this page treats it not as a compliance feature but as a first-principles property of how you store the truth.

Mastery Questions

Your team keeps detailed application logs in a centralized search system, retained for 90 days, that record every significant action with a user id and timestamp. Leadership says the firm is “fully auditable” because of this. As the engineer, what gaps would you raise, and what would you change?

Answer. Detailed logs are a useful input but they are not an audit trail in the sense an examiner means, and several gaps follow. First, retention: 90 days is a debugging window, while the applicable rules (for example 17a-4’s three-and-six-year windows, or SOX retention) demand years, so records that must survive an exam are being deleted long before. Second, mutability: a centralized search store is editable and deletable by operators, so even if nothing was altered you cannot prove it, and the logs have no tamper evidence (no hash chain, no WORM, no signed checkpoints) to establish integrity. Third, completeness and coupling: if the logging is a separate best-effort call rather than written in the same transaction as the change, gaps are invisible and you cannot guarantee every recordable action produced an event. Fourth, context and reconstruction: ordinary logs often record that something changed but not the full who-action-target-time-reason in business terms, and a search index is not built to reconstruct an account’s exact state as of a past instant. The changes I would push: make the core records append-only and event-sourced so completeness is structural; capture the audit event synchronously in the same path as the change; add a hash chain with periodic signed checkpoints and, for the records the rules cover, WORM storage; set retention schedules mapped to the actual rules with legal-hold override; and make point-in-time reconstruction a first-class capability rather than relying on a 90-day search index. The logs stay for engineering convenience; the audit trail becomes a separate, durable, tamper-evident, reconstructable system of record.
An auditor argues your hash-chained audit log proves your records have never been tampered with. A skeptical security engineer says it proves no such thing. Who is right, and what would make the proof actually hold?

Answer. The security engineer is right that the claim is too strong, and the precise reason is the difference between tamper-evident and tamper-proof. A hash chain does not prevent alteration; it makes alteration detectable, but only relative to a trusted reference point. If an attacker has write access to the entire log and to every copy of the checkpoint hashes, they can edit a historical record and then recompute every subsequent hash and every checkpoint, producing a chain that is perfectly internally consistent and shows no break. Internal consistency of the chain alone proves nothing, because the attacker can manufacture it. What makes the proof actually hold is an independent anchor of trust that the attacker cannot also forge: checkpoint hashes published or replicated to a party outside the attacker’s control (a separate account with different credentials, an external notary or timestamping authority, a regulator filing, or in the extreme a public ledger), captured at the time the records were created. Then to alter the past undetectably, the attacker would have to also alter the independently held checkpoint, which they cannot reach. The honest statement is therefore: the records have not been altered since the last independently anchored checkpoint without detection, provided that anchor is genuinely independent. Layering WORM storage underneath raises the bar further by removing the easy overwrite path. The auditor’s confidence is justified only to the strength of the independent anchor, not from the chain alone.
During an examination, a regulator selects a single back-office limit increase from eight months ago and asks you to prove who authorized it, when, what the limit was before and after, and that the record has not changed since. Walk through how an auditable system answers this, and name what would make each part fail.

Answer. This question is the whole page applied to one event, and an auditable system answers it by walking the immutable, attributed, tamper-evident trail. To prove who and when, you produce the audit-trail event captured synchronously at the moment of the action, naming the actor, the action, the target account, the timestamp, and the approval that authorized it; this fails if the event was a best-effort side write that was skipped (a gap) or if it recorded the data delta but not the actor and approval (missing context). To prove the before-and-after limit, you reconstruct the account’s limit as of just before and just after the event by replaying the append-only event log up to each instant and folding, which yields not just the numbers but the events that produced them; this fails if the system overwrote the limit in place and kept no event history, leaving it able to show only the present value. To prove the record has not changed since, you verify the event’s position in the hash chain against an independently held checkpoint from around that time, and show it sits on WORM storage under a retention lock; this fails if the records live in a mutable store with no integrity proof, or if the only checkpoints are ones the firm could itself have rewritten. If every link holds, the answer is complete, attributed, reconstructed, and provably unaltered, which is a clean finding. The places it fails map exactly onto the failure modes: a gap, mutable history, missing context, or an integrity proof with no independent anchor.

Sources & evidence17 claims · 8 cited

Grounded in SEC Exchange Act Rules 17a-3 and 17a-4 (including the 2022 audit-trail-alternative amendment), Sarbanes-Oxley Sections 404 and 802, and standard event-sourcing/append-log/hash-chain engineering practice; mirrors the double-entry exemplar's structure and voice. Gap: exact retention-period numbers under 17a-4 are stated as commonly-three-and-six-year windows rather than enumerated per record class, and non-US regimes are described in principle only.

SEC Rule 17a-3 specifies which books and records a broker-dealer must make (blotters, ledgers, order memoranda, account records, trial balances).verified
SEC Rule 17a-4 specifies how long broker-dealer records must be preserved and in what manner, commonly six years for some categories and three years for others, with the most recent portion kept readily accessible.verified
Rule 17a-4 historically required electronic records to be preserved in a non-rewriteable, non-erasable (WORM) format.verified
The SEC amended Rule 17a-4 in 2022 to add an audit-trail alternative: a system preserving the original record plus every modification and able to recreate the original satisfies the rule in place of strictly non-rewriteable media.verified
The Sarbanes-Oxley Act of 2002 was enacted following the Enron and WorldCom accounting scandals.verified
SOX Section 404 requires management to assess and report on the effectiveness of internal control over financial reporting, with external-auditor attestation for public companies.verified
SOX Section 802 criminalizes knowingly altering, destroying, or falsifying records with intent to obstruct an investigation, and sets long retention requirements for audit work papers.verified
A cryptographic hash has the avalanche property: any change to the input changes the output unpredictably, so a small record edit cannot produce a small plausible change to its fingerprint.stable common knowledge
A hash chain seals each record with a hash of itself and the prior record's hash, so altering any past record breaks every subsequent seal; the same structure underlies blockchains and Git commits.stable common knowledge
Event sourcing stores the sequence of events as the source of truth and derives current state by folding over them, so there can be no state change without a corresponding recorded event.stable common knowledge
A database write-ahead log records changes in append-only order before applying them for durability and crash recovery, but captures physical/logical row changes rather than business actor and intent.stable common knowledge
Tamper-evident means alteration is detectable after the fact relative to a trusted reference point; tamper-proof/tamper-resistant means alteration is prevented (e.g. WORM refuses the overwrite until retention expires).internal reasoning
A hash chain alone does not prove non-tampering: an attacker controlling the log and all checkpoints can rebuild an internally consistent fake chain, so the proof requires an independent trusted anchor.internal reasoning
Bitemporal modeling distinguishes effective time (when an event took effect) from recording/system time (when the system learned of it), enabling reconstruction of the books as believed on a date versus as now understood.stable common knowledge
Legal hold suspends routine retention-based disposal when litigation is reasonably anticipated; failing to preserve evidence is spoliation, which courts may sanction including with adverse-inference instructions.verified
Capturing the audit event in the same atomic transaction as the change (or via event sourcing or a transactional outbox) makes completeness structural, preventing changes with no record or records with no change.internal reasoning
Internal audit reports to the board's audit committee independent of management; external auditors opine on financial statements and (for public companies) on internal controls; regulators examine specific obligations with enforcement power.stable common knowledge

Cited sources

Securities Exchange Act Rule 17a-3 and Rule 17a-4 (Records to be made and preserved by certain exchange members, brokers and dealers) · U.S. Securities and Exchange Commission
Electronic Recordkeeping Requirements for Broker-Dealers, Security-Based Swap Dealers, and Major Security-Based Swap Participants (2022 amendments to Rule 17a-4) · U.S. Securities and Exchange Commission
Sarbanes-Oxley Act of 2002 · U.S. Congress
Handbook of Applied Cryptography · Menezes, van Oorschot, Vanstone
Event Sourcing pattern · Martin Fowler
PostgreSQL Documentation: Write-Ahead Logging (WAL) · PostgreSQL Global Development Group
Bitemporal data modeling (system time vs valid/effective time) · Richard T. Snodgrass
Federal Rules of Civil Procedure (preservation duties and spoliation, including Rule 37(e)) · U.S. Courts / Administrative Office