Illustrative example
Decision record
support-router-prod
Confidence
- Winning owner
- Customer Ops Platform
- Budget
- CX Automation
- Action
- Shape recommendation
Computed inside the customer environment. Raw telemetry never leaves.
Platform
Your AI bill shows spend. It does not show ownership. Venturi builds an attribution graph inside the customer environment, resolving every model invocation to its service, code owner, identity, org, and budget.
No tags. No code-ownership spreadsheet. No gateway-only blind spot.
Illustrative example
Decision record
support-router-prod
Confidence
Computed inside the customer environment. Raw telemetry never leaves.
What the platform resolves
Token and cost totals show that spend moved. Venturi resolves why: which service produced it, which code owner is accountable, which identity triggered it, and which org and budget should absorb it.
Illustrative example
Six-layer chain
Model invocation
model usage telemetry
anthropic.claude-sonnet
Service attribution
route, trace, vendor project
support-router
Code ownership
CODEOWNERS, deploy SHA
Customer Ops Platform
Identity resolution
Okta group, service account trace
svc-support-router
Org hierarchy
SSO group and budget hierarchy
CX Operations
Budget attribution
finance budget path
CX Automation
Decision-time path
The decision-time interceptor reads from the attribution index and adds context when it is available. Fail-open is absolute: when enrichment cannot finish within budget, the request proceeds unmodified. Optimizer, never enforcer.
01
Unknown or incomplete records stay visible, but they are not ready for chargeback or enforcement.
02
The default path enriches approvals, review queues, and budget conversations with advisory recommendations.
03
Evidence is strong enough across model, service, code owner, identity, org path, and budget to support finance review.
04
Gate requires explicit customer configuration for a workload and policy class. It is never enabled by default.
How attribution is computed
The graph is derived state. Corrections create new events, never overwrites. The interceptor reads a precomputed attribution index rather than traversing the graph at decision time.
Illustrative example
Evidence stack
Direct match
vendor project, route, service ID
Strong evidence
ResolvedTemporal correlation
deploy window and usage spike
Supporting evidence
PlausibleNaming convention
service account and budget prefix
Weak supporting evidence
PlausibleHistorical pattern
prior owner and workload cadence
Supporting evidence
ContestedService account tracing
shared key fanout
Contested evidence
ContestedProportional allocation
usage share across owners
Allocation prior
Shape recommendationOperational confidence
HRE preserves unresolved and contested evidence. Corrections create new events, and the attribution graph remains derived state. Confidence is calibrated, capped at 0.95, and not additive: stronger and weaker evidence are reconciled, never summed.
Decision records
A decision record is useful only when reviewers can inspect why an owner won, which candidates lost, which evidence was used, and what action class follows from the confidence score.
Illustrative example
Decision record
agent-eval-runner
req_c448af · openai.gpt-4.1
Operational confidence
Winning owner
AI Platform
Budget
Model Quality Programs
Service
quality-evals
Request class
evaluation batch
Time basis
deployment window
Illustrative example
Decision record
shared-notebook-key
req_unknown · gemini.1.5-pro
Operational confidence
Winning owner
Unknown
Budget
Unassigned AI spend
Service
unknown service
Request class
ad hoc analysis
Time basis
month-to-date
Why Venturi
Venturi asks a simple question in the decision path: which advisory action minimizes total risk-adjusted cost given the available evidence?
Plain English
Venturi places attributed context in the decision path. Shape mode recommends and enriches by default. Gate mode is available only when a customer explicitly opts in for a workload.
Decision Rule
Shape mode returns recommendations, enriched context, and review evidence. Gate mode can block only for workloads where the customer has explicitly enabled that workload and policy path.
Actions
The available moves.
Model choice, routing decision, region, and configuration. Venturi evaluates the viable options before the workload decision is made.
TRAC
Total Risk-Adjusted Cost.
Direct cost plus a confidence risk premium. Not just what the request costs, but how much uncertainty remains in the owner, budget, and service chain.
Deployment path
The first deployment should prove attribution coverage before any advisory workflow turns on. Shape recommendations reach reviewers only after the customer trusts the evidence. Gate stays opt-in and workload-specific.
Connector inventory
AWS, Azure, GCP
GitHub Actions, GitLab CI, Jenkins, Buildkite
GitHub, GitLab, Bitbucket
Okta, Azure AD, Google Workspace
Workday, Oracle HCM, generic HCM
model usage, token usage, latency, cost signals
| Group | Data used | Permission | Excluded | Contribution |
|---|---|---|---|---|
| Cloud AWS, Azure, GCP | billing, cost APIs, usage tags where present | read-only cost and monitor APIs | resource modification, secrets, production writes | hourly or daily export budget and vendor project confidence |
| CI/CD GitHub Actions, GitLab CI, Jenkins, Buildkite | deployment metadata, service IDs, release timestamps | metadata read | deployment mutation | event or scheduled sync service and release attribution |
| Source control GitHub, GitLab, Bitbucket | repository metadata, ownership, CODEOWNERS | metadata read | source code content | scheduled sync code ownership confidence |
| Identity Okta, Azure AD, Google Workspace | users, groups, service identities | directory read | identity writes and password data | scheduled sync identity and org path confidence |
| HCM and org Workday, Oracle HCM, generic HCM | org hierarchy and team membership | limited org read | salary, performance, sensitive HR data | daily sync budget and adoption projection context |
| Telemetry model usage, token usage, latency, cost signals | usage totals, routes, latency, request classes | read-only telemetry export | prompt and output export | stream or batch invocation and service attribution |
Projections, not modules
Cost attribution and adoption intelligence are two projections of the same attribution graph, resolved to one confidence-scored decision record. They answer different operating questions without becoming separate product modules. Every component runs inside the customer environment behind a read-only boundary; the full trust boundary is detailed on Security.
Design partner review
No production data is required for the first conversation. Start with one workload, one unclear owner or budget path, and the decision your team cannot make confidently today.
Useful starting points