§ Library

Kindred ideas

One relational map of TRACE — its own domain instances alongside independent external work that arrives at the same conclusions. Every entry is read through a single lens: the layers, principles, and metrics it engages, where TRACE adds measurable rigour, and what it should borrow back.

The engages chips reference the same L1–L4, T-R-A-C-E + parsimony, and trust-metric definitions used across this site.

Framework

TRACE itself — the anchor every other entry maps to.

TRACE

Framework

Trustworthy, Reasoned, Accountable, Context-bound, Escalated AI

Zabolotnii et al., 2026

A four-layer architecture (L1 deterministic core · L2a/L2b right-sized learned components · L3 tiered orchestration · L4 human supervision) that makes AI behaviour in operationally critical domains measurable, auditable, and safe by design. Six design principles (five in the acronym plus Model Parsimony) and seventeen trust metrics turn trust properties into measured quantities.

layers

L1L2aL2bL3L4

principles

TRACEMP

metrics

RCR · RCI · UTC · CRP · CFI · IPSR · EP · TCC · FPA · RBI · OvR · SNR · ETC · CE · ABC · OSI · CPR

Anchor of the library. Paper 1 (framework synthesis) — arXiv:2605.03838 (submitted May 2026); this site is the companion.

traces.solutions →Paper 1 · arXiv:2605.03838 →

Instance

A domain deployment of TRACE.

Clinical decision support

Instance · Clinical decision support

Physician as final authority (Instance A) · foundational

Zabolotnii, Holinko, Antonenko

The foundational instance. Rule-oriented clinical logic (L1, FUTURE-AI-aligned traceability); risk scores, vital-sign time series and lab classifiers plus clinical-note NER (L2a); an LLM validator that checks free-text notes for coherence against the patient's anamnesis (L2b); routine cases via calibrated L2a, borderline → L2b, joint high-risk ∧ high-confidence or L2a/L2b inconsistency → mandatory clinician handoff (L3); physician as final reviewer, aligned with the TRIAD human–AI framework (L4).

layers

L1L2aL2bL3L4

principles

TRACEMP

metrics

ETC · OvR · CE · CFI · CRP · EP · FPA · RBI · ABC · OSI

Contributes

·Empirical anchor for the framework — the first validated instantiation.
·Concrete mapping of TRACE onto external clinical-AI standards: FUTURE-AI traceability and the TRIAD human–AI collaboration framework.

Open / to reconcile

·Paper 0 under review — the metrological validation is written up, not yet peer-accepted.

Foundational instance. Paper 0 — IEEE Instrumentation & Measurement Magazine special issue "A Measure of Trust in Healthcare"; under review (Sep 2026).

arXiv preprint →

Industrial multi-domain — upstream oil & gas

Instance · Industrial multi-domain

Driller / supervisor / HSE escalation (Instance B) · patent pending

Shcherban (lead), Zabolotnii

A foundational instance running three sub-domains at once (technology, operations, administrative). A physics-informed control core (L1); classical ML dominant over MWD/LWD and equipment signals — anomaly detection, predictive maintenance, residual-life (L2a); LLM validators for incident narratives and contract diff (L2b); cost-tiered per-sub-domain routing with sustained-anomaly and SLA-breach escalation (L3); driller / supervisor / HSE officer and lawyer / procurement reviewers under DNV-RP-0671 governance (L4). The dominant layer shifts with the type of evidence.

layers

L1L2aL2bL3L4

principles

TRACEMP

metrics

RCR · RCI · EP · TCC · FPA · OvR · RBI · ABC · OSI · ETC · CPR

Contributes

·An adaptive non-Gaussian statistical core placed ahead of any ML stage (model parsimony in practice), validated on the open Utah FORGE geothermal dataset.
·Independent industrial origin (2024–2026) that structurally motivated the TRACE formalisation.
·Strongest validation of Model Parsimony — L2a-dominant across structured-signal sub-domains.
·Sub-domain × layer matrix showing the dominant layer shifting by evidence type.
·Where CPR gets its first empirical evaluation on a real platform (Paper 2).

Open / to reconcile

·Domain-specific methodology and operational statistics are reported separately (Paper 2, Shcherban-led) — not all public yet.

Foundational instance. Patent pending — UA u 2025 04038; U.S. Copyright Office deposit (Mar 2026). Paper 2 planned Q3 2026.

Judicial decision support

Instance · Judicial decision support

Supreme Court of Ukraine · Legal Positions Database (Instance C)

Zabolotnii, with the Supreme Court of Ukraine, funded by Expertise France

A v2 research roadmap: seven applied directions plus a cross-cutting EU-integration dimension. TRACE's value concentrates in L3, reframed as a machine-checkable normative act of the Court rather than a technical config. Direction 4.7 makes L4 oversight empirically measurable (telemetry + injected-error red-teaming + NASA-TLX).

layers

L1L2aL2bL3L4

principles

TREC

metrics

ETC · OvR · EP · CRP · CFI · UTC · RCI · IPSR · RBI · SNR · ABC · CE · OSI · FPA

Contributes

·L3 as a machine-checkable normative act — the strongest L3 formalisation across all instances.
·Direction 4.7 operationalises L4 metrics (OvR, RBI, SNR): meaningful-oversight measurement against rubber-stamping.
·Explicit out-of-scope catalogue: predictive justice, automated procedural decisions, training-data contamination.

Open / to reconcile

·Seventeen metrics under-used in the draft — addressed via the metrics-mapping companion.
·Staged autonomy (ABC) absent; natural fit: pilot → category expansion with accumulated stability data.

Internal TRACE instance. Target venues per roadmap: JURIX 2027, Artificial Intelligence and Law, Law/Innovation/Technology.

Assistant for government-agency websites

Instance · Customs & government

State Customs Service of Ukraine · tender architecture (4th domain)

Zabolotnii, 2026 (tender engineering package)

The most operationally concrete instance: named models (GliNER, XGBoost, e5-large, bge-reranker, GPT-5 Mini/Nano), a T1–T5 tier-ladder delivery, and bottom-up TCO. Fully aligned with current canon — it uses the L2a/L2b split, CPR, and the six-principle set as defined in trace.js, rather than extending them.

layers

L1L2aL2bL3L4

principles

TRACEMP

metrics

RCR · UTC · CRP · CE · EP · TCC · RBI · OvR · ETC · OSI · CPR

Contributes

·Anti-pattern catalogue (§9): LLM-by-default, TAO-conflation, confidence-only escalation, cosmetic L4, learned L1, stateless L3, L2→L4 direct — TRACE's negative space, reusable in canonical docs.
·Tier-ladder T1→T5: the clearest concrete operationalisation of Staged Autonomy / ABC (authority earned through accumulated stability data, incremental delivery).
·Task-by-task L2a-vs-L2b selection table — a worked template for the right-sized-learned principle.
·Honest CPR economics (Annex B v2): at GPT-5 prices CPR is a parsimony / overhead diagnostic, not a 'cheaper-than-baseline' slogan.

Open / to reconcile

·Surfaces a 4th domain (customs / government) absent from the site's DOMAINS — clinical, upstream, legal only.
·Annex B v1's 'CPR ≈ 0.10 = 10× cheaper' framing was retracted in v2; keep the quality + compliance-avoidance framing.

Applied / commercial instance (tender bid). Not a peer-reviewed source.

Kindred idea

Independent external work that corroborates or complements TRACE.

Orchestrated AI — neuro-symbolic enterprise governance

Kindred idea

Trisotech · Knowledge Worker Co-pilot

Denis Gagné & Tom — webinar "Retake Enterprise AI in 2026"

Three architectures — prompt-centric → integration-led → orchestrated — where only model-driven orchestration 'scales towards governance'. A neuro-symbolic split puts business rules in versioned BPMN/CMMN/DMN (not prompts), keeps a mandatory human in the loop, and treats the model as replaceable (bring-your-own-AI). Independent industry corroboration of TRACE's central thesis.

layers

L1L2bL3L4

principles

metrics

ETC · OvR

Contributes

·BPMN/CMMN/DMN as an executable substrate for L1/L3 — TRACE specifies layer properties; this offers a concrete carrier.
·'Iceberg / shadow-processes' narrative — a strong opener for talks.
·'Case = strategic map, BPMN = tactical routes' metaphor for explaining L3 orchestration.

Where TRACE adds rigour

·Governance is qualitative — no quantified metrics; TRACE's seventeen measure what they assert.
·No staged autonomy (ABC) — authority is static, not earned from stability data.
·Bounded context as a safety envelope is absent — the case file grows unbounded for months (a CFI risk, not a guardrail).
·A single escalation role vs TRACE's differentiated final-authority roles per domain.

Cite as industry corroboration only — NOT a methodological primary source (commercial, sellable-trust framing).

NotebookLM notebook →

CQL / ELM — executable substrate for clinical L1

Kindred idea

HL7 Clinical Quality Language · ANSI/HL7 CQLANG R1-2020

HL7 International (CQL v1.5.3) over FHIR R4

Computable clinical guidelines expressed as deterministic rules: human-readable CQL compiles to machine-readable ELM. The CQL↔ELM duality is exactly 'rule ↔ serialized audit trail'. A ready, standardized executable carrier for the clinical L1 layer — and an independent normative (ANSI/HL7) validation of the deterministic-core thesis; the clinical twin of BPMN/DMN for L3.

layers

principles

TCMP

metrics

ETC · CPR · RCR · UTC · CRP · CFI

Contributes

·A standardized, executable substrate for clinical L1 — TRACE specifies the layer property, CQL/ELM is a concrete carrier over typed FHIR R4 resources.
·Draws the CPR line L1↔L2b in engineering terms: what is formalizable goes to CQL, not to an LLM.
·ELM as a literal data → logic → decision chain — a direct ETC artifact.

Where TRACE adds rigour

·Removes syntactic, not clinical, ambiguity — necessary but not sufficient.
·No abstention or escalation (L3) and no measurement of oversight (L4).
·A component, not an alternative to the stack.

Independent normative (ANSI/HL7) validation of the clinical L1 thesis. Pattern-twin of Trisotech↔L3.

HL7 CQL specification →

Unlocking ROI From AI in Energy — SPE DSEATS 2026

Kindred idea

Industry corroboration for the upstream instance

Journal of Petroleum Technology, May 2026 (SPE DSEATS executive forum)

A synthesis of an SPE DSEATS executive forum at CERAWeek 2026: AI in energy fails not at the model level but at ROI attribution to board metrics, network-scale execution, decision-first data selection, hybrid physics + data-driven modelling over sparse data, domain-expert power users, and formal engineering assurance before production. Almost verbatim the problem TRACE formalizes as L1/L2a/L3/L4 plus its metric core.

layers

L1L2aL3L4

principles

ARCMP

metrics

CPR · OSI · CFI · CRP · ABC · OvR · RBI · SNR

Contributes

·Strong motivation for Paper 2: the industry frames the problem as how to attribute, scale and sustain AI value, not whether AI works.
·A business-outcome → TRACE-measurement-chain table linking ROI dashboards to the trust-metric suite.
·L4 as a concrete operating model — the domain-expert power user (drilling / reservoir / production / HSE SME with AI fluency), not an abstract reviewer.

Where TRACE adds rigour

·Not peer-reviewed and not an empirical validation of TRACE — industry signal only.
·No formal cost model — CPR must be framed as overhead diagnostic, not 'X% cheaper'.
·ROI dashboard and trust-metric suite are different layers of measurement and must not be conflated.

Open / to reconcile

·Chatham House Rule — claims anonymized, examples illustrative.

Cite as upstream industry corroboration / motivation only. Do not reveal private Instance B details.

JPT · Unlocking ROI From AI →

Pacific AI Governance Policy Suite

Kindred idea

Pacific = the WHAT · TRACE = the HOW TO MEASURE

Pacific AI, Inc., 2026-A (CC BY-NC-SA 4.0)

The most comprehensive public policy template aggregating enacted AI legislation across 30+ jurisdictions — nine modular policies plus an Adopt → Implement → Attest cadence. It covers the normative L1 substrate and declaratively requires the TRACE mechanics (three lifecycle checkpoints, human oversight & override, traceability logging). Governance corroboration: the regulatory community already mandates oversight and traceability at the policy level.

layers

L1L4

principles

TRA

metrics

ETC · UTC · OvR · ABC · CE

Contributes

·A regularly updated, enacted-legislation-only corpus — a ready policy baseline for L1 in any instance (crosses domains, unlike CQL).
·Three lifecycle checkpoints (Go/No-Go → pre-deployment → annual) as a policy formulation of staged autonomy (ABC); annual review gives metrics a longitudinal frame (OSI).
·Risk-tier → who-tests scaling (internal group vs independent third party) — the same pattern as the customs tier-ladder.

Where TRACE adds rigour

·No metrology — it requires observation (e.g. monitoring overrides) without specifying measurement, method or uncertainty. Proto-OvR, not OvR.
·No parsimony principle — no requirement to use the simplest adequate model; CPR absent.
·No L2a/L2b distinction (except GPAI by FLOPs) and no runtime L3 — checkpoints are design-time gates.

Open / to reconcile

·Vendor template under CC BY-NC-SA, not a standard; value is in the aggregation of binding requirements, not its own authority.

Governance corroboration. Use as evidence that policy bodies require oversight/traceability — and that without a metrological model these obligations are unverifiable.

Pacific AI →

W&B Weave rai-toolkit — evidence-backed verification gate

Kindred idea

Operationalized ETC · the fork TRACE-RAI-toolkit builds on this

K. Nisar, 2026 (wandb/rai-toolkit, Apache 2.0)

A working open-source governance toolkit that frames deployment approval as an evidence-backed verification gate: automated assessments, red-team traces, versioned policy-as-code, and the human reviewer's signed decision bound into a single content-hashed Weave record. The closest known working prototype of operationalized Evidence Trail Completeness; the fork TRACE-RAI-toolkit extends it with the fork's metrology.

layers

L1L4

principles

TCE

metrics

ETC · RCR · UTC · TCC · ABC

Contributes

·A working operationalized ETC — evidence + human decision + hash in one trace; 'screenshots do not survive audit' is the anti-ETC stated plainly.
·Four independent gates joined by logical AND (anti-pattern: a single blended score); worst-case red-team gating echoes joint risk ∧ confidence.
·'Un-assessed ≠ neutral pass' — the GUM spirit in MLOps; and a forkable harness for the Medical Brain gate.

Where TRACE adds rigour

·A design-time verification gate, not a runtime architecture — no stateful L3 (re-invocation, accumulated confidence) and no operational L4 (RBI/OvR/SNR over a decision stream).
·Its LLM-judge scorers are themselves uncalibrated — 'who calibrates the judge?' (CE/IPSR for validators) is unaddressed; the fork adds exactly this.
·No parsimony — it assesses the model it is given; CPR absent.

Open / to reconcile

·A vendor article about its own product; the Weave trace layer pulls toward the W&B platform (mitigate with self-hosted Weave or a backend abstraction).

Tooling corroboration. The fork TRACE-RAI-toolkit (github.com/SZabolotnii/TRACE-RAI-toolkit) develops it as the TRACE verification-gate reference implementation.

wandb/rai-toolkit →Operationalizing AI Governance →