Why register tools per-org instead of giving every agent every tool?

Two reasons: cost and security. Every registered tool inflates the system prompt with its description, which costs tokens on every turn. And if the supervisor knows about a Salesforce tool the org has not connected, the LLM will hallucinate Salesforce calls; the user sees a 401 and blames the AI. Binding only what the org has paid for and integrated removes both problems at the registration layer.

How fast is per-org agent instantiation?

Tool factory caches by {orgId, integrations_hash}. Cold path: ~150ms (3 Firestore reads + tool wiring). Warm: ~5ms (in-memory map lookup). Cache is invalidated via Pub/Sub on integration change, with <1-second propagation. A naive implementation that rebuilds the supervisor from scratch every call would add 150ms to every request — at scale that is your latency budget gone.

How is tenant isolation actually enforced?

Three layers. (1) Firestore path scoping: all data lives under organizations/{orgId}/... and a query without the path segment fails at dispatch. (2) TenantContextMiddleware: reads the immutable Firebase custom claim, fetches the permissions doc, attaches a request-scoped context. Downstream code reads from request.context — never from the token, URL, or client headers. (3) Per-tool permission check inside the handler. Cross-tenant access requires bypassing all three; the failure mode is "loud build-time error", not "silent runtime leak".

What does the document intelligence pipeline actually do?

Eight stages: Parser (text extraction from PDF/DOCX/XLSX) → Classifier (document type detection) → Evidence Extractor (supporting text per field) → Extractor (structured field extraction via Gemini 2.5 Pro) → Validator (cross-field consistency: dates, parties, amounts) → Scorer (per-field confidence) → Cross-Doc Linker (entity reconciliation across docs) → Graph Writer (Neo4j upsert). Confidence below threshold routes to a human-review queue. The graph then links companies → investments → documents → obligations.

Does this generalize beyond family offices?

Yes. The pattern — supervisor router + per-org tool registration + zero-trust tenant isolation + document intelligence pipeline — is the right shape for any multi-tenant AI SaaS where (a) customers connect different integrations, (b) data isolation is non-negotiable, and (c) unstructured documents need to become structured data. We ship this pattern for legal, regulated finance, and enterprise knowledge SaaS.

Multi-Agent SaaS Case Study: 7 Agents on Google ADK

Q: How is tenant isolation actually enforced?

Three layers. (1) Firestore path scoping: all data lives under organizations/{orgId}/... and a query without the path segment fails at dispatch. (2) TenantContextMiddleware: reads the immutable Firebase custom claim, fetches the permissions doc, attaches a request-scoped context. Downstream code reads from request.context — never from the token, URL, or client headers. (3) Per-tool permission check inside the handler. Cross-tenant access requires bypassing all three; the failure mode is "loud build-time error", not "silent runtime leak".

Q: What does the document intelligence pipeline actually do?

Eight stages: Parser (text extraction from PDF/DOCX/XLSX) → Classifier (document type detection) → Evidence Extractor (supporting text per field) → Extractor (structured field extraction via Gemini 2.5 Pro) → Validator (cross-field consistency: dates, parties, amounts) → Scorer (per-field confidence) → Cross-Doc Linker (entity reconciliation across docs) → Graph Writer (Neo4j upsert). Confidence below threshold routes to a human-review queue. The graph then links companies → investments → documents → obligations.

Q: Does this generalize beyond family offices?

Yes. The pattern — supervisor router + per-org tool registration + zero-trust tenant isolation + document intelligence pipeline — is the right shape for any multi-tenant AI SaaS where (a) customers connect different integrations, (b) data isolation is non-negotiable, and (c) unstructured documents need to become structured data. We ship this pattern for legal, regulated finance, and enterprise knowledge SaaS.

Outcome metrics

Client: A multi-family office platform serving high-net-worth families with diligence, portfolio operations, document intelligence, and graph-based entity analytics.

7agents (Financial, Legal, Knowledge, Document, Calendar, Email, Echo)

Specialist agents behind one supervisor

5ms (cold path: 150ms)

Supervisor warm-path instantiation

was naive rebuild: 150ms every request before

70+endpoints

FastAPI endpoints behind tenant middleware

60+permissions across 5 roles

Permission scopes (wildcard-aware)

0(structurally enforced via path scoping)

Cross-tenant data leakage incidents

<1second (integration change → next request)

Real-time sync propagation

was polling: 60-second worst-case before

The Challenge

A multi-family-office SaaS had to consolidate QuickBooks financials, Google Workspace (Drive, Calendar, Gmail), and investment documents into a single AI-driven platform. Family offices juggle financial statements, legal documents (PPMs, SPAs, SAFEs), cap tables, emails, and calendars across disconnected systems. Manual data extraction from investment documents is error-prone and slow. There was no unified view of portfolio, entities, and obligations — and the multi-tenant requirements made naive "register every tool and let the LLM choose" approaches both expensive (tokens per turn) and unsafe (hallucinated calls to non-integrated tools).

The Solution

A supervisor-router agent on Google ADK 1.15 dispatches to 7 specialist agents based on intent. The supervisor is instantiated per-request from a factory that reads the org-level RBAC and integration status before binding tools — so each org's supervisor only sees the tools that org has paid for and connected. A document intelligence pipeline (parser → classifier → evidence → extraction → validation → scorer → graph writer) auto-extracts structured data from investment documents using Gemini 2.5 Pro, writing the results into a Neo4j knowledge graph linking companies, investments, and documents. Zero-trust tenant isolation is enforced at the Firestore path layer (organizations/{orgId}/...) and via TenantContextMiddleware on every request.

Implementation

Supervisor-router with per-org tool registration

Every request hits TenantContextMiddleware which reads the immutable Firebase custom claim, fetches organizations/{orgId}/permissions, and attaches the merged context to request.context. The supervisor factory takes that context and assembles a fresh Agent: which specialists to register, which tools to bind to each, which system-prompt fragments to splice in. The output is cached by {orgId, integrations_hash} — cold path ~150ms (Firestore reads + tool wiring), warm ~5ms (in-memory map lookup). Pub/Sub invalidates the cache <1s after any integration change.

RBAC at two layers

Layer one (tool registration): the factory only binds tools the org is allowed to use. The LLM literally does not know the others exist — system prompt is shorter, hallucination cannot reach into a non-connected Salesforce. Layer two (per-tool permission check inside the handler): every tool starts with assert_permission(request.context, "salesforce:read"). Defense in depth — the factory layer can be bypassed accidentally; the tool layer is enforced last. Together they cover both gaps.

Document Intelligence pipeline

Parser (PDF / DOCX / XLSX text extraction) → Classifier (document type via Gemini 2.0 Flash) → Evidence Extractor (supporting text per field) → Extractor (structured field extraction via Gemini 2.5 Pro) → Validator (cross-field consistency: dates, party names, amounts) → Scorer (confidence per field) → Graph Writer (Neo4j upsert with cross-document entity linking). The pipeline handles PPMs, SPAs, SAFEs, and cap tables. Confidence below threshold triggers a human-review queue rather than silent low-quality writes.

Zero-trust multi-tenant Firestore

Every collection lives under organizations/{orgId}/. There is no top-level documents collection — only organizations/{orgId}/documents. A query that omits the orgId path segment fails at dispatch. The system has no way to "accidentally" query across tenants because the path itself enforces scope. 60+ permissions follow the format "{resource}:{action}:{scope}" — examples: "documents:read:org", "documents:*:org", "billing:read:platform". Five roles bundle them; wildcards expand at check-time, not at storage.

Real-time sync via webhooks + Pub/Sub

Gmail push notifications land in a Cloud Pub/Sub topic the backend subscribes to; new threads index into the per-org Vertex AI Search engine within ~1 second. Google Drive uses watch-channel webhooks with polling fallback (Drive's webhook reliability is good-but-not-perfect; the poll catches missed deliveries). QuickBooks uses scheduled syncs on Cloud Scheduler — 8 jobs cover financials, transactions, and entity reconciliation.

“The supervisor binds only the tools each org has actually paid for and integrated — the LLM doesn't even know the other tools exist. That single design choice eliminated a class of hallucinated API calls we were dreading.”

CTO, multi-family office (anonymized)

TL;DR

How Cognilium built a multi-tenant AI SaaS with 7 specialist agents on Google ADK, per-org tool registration, and zero-trust Firestore isolation.

Seven specialist AI agents (Financial, Legal, Knowledge, Document, Calendar, Email, Echo) behind a supervisor router, with per-org tool registration and zero-trust multi-tenant Firestore isolation. The architecture that ships a production multi-tenant AI SaaS without forking agents per customer.

Google ADK case studymulti-tenant agentssupervisor routerzero-trust Firestoredocument intelligenceNeo4j knowledge graphfamily office AIRBAC wildcard permissions

A multi-family-office SaaS had to consolidate QuickBooks financials, Google Workspace, and a 10+ year archive of investment paperwork into a single AI-driven platform. The architectural challenge was not building the agents — it was building a multi-tenant agent platform where each org saw only the tools they had paid for and integrated, where data isolation was structurally enforced rather than remembered, and where unstructured documents (PPMs, SPAs, SAFEs, cap tables) became structured data without an army of analysts.

Why a single all-tools agent does not work

The naive approach — register every tool, let the LLM ignore the irrelevant ones — fails on two axes. Cost: every tool description in the system prompt costs tokens on every turn; for 7 agents with 12 tools each, that is roughly 84 tool descriptions on every request. Security: the LLM hallucinates a Salesforce call for an org without Salesforce, the user sees a 401 and blames the AI. The supervisor has to be instantiated per-request with org-aware tool binding.

The agent factory

Every request goes through TenantContextMiddleware: reads the immutable Firebase claim, looks up organizations/{orgId}/permissions, attaches the merged context to request.context. The supervisor factory takes that context and assembles a fresh ADK Agent — which specialists to register, which tools to bind, which prompt fragments to splice in. The factory caches by {orgId, integrations_hash}; warm path is ~5ms, cold path ~150ms. Pub/Sub-driven invalidation means a new integration is visible to the next request within a second.

Document intelligence as a pipeline, not a prompt

Investment documents are not "summarize this PDF" tasks — they have schema. A PPM has named parties, monetary amounts, jurisdictions, dates, voting rights. A SAFE has valuation cap, discount, MFN. A cap table has share classes, share counts, ownership percentages that must sum to 100. The pipeline runs eight stages in order with confidence scoring at each — low-confidence fields route to human review instead of corrupting the graph.

What we measured

7 specialist agents behind 1 supervisor — ~5ms warm-path instantiation, ~150ms cold
70+ FastAPI endpoints, every one tenant-isolated at the path layer
60+ permission scopes, 5 roles, 15 Firestore composite indexes
Pub/Sub-driven cache invalidation: <1 second from integration change to next request
Zero cross-tenant data leakage incidents — structurally enforced, not procedurally

What we would do differently

The 60+ permissions ended up needing a wildcard expansion layer ("documents:*:org" expands to read+write+delete+list at check time) because flat enumeration produced unwieldy role definitions. Build the wildcard expansion in from day one — retrofitting it after roles are already assigned to users is a migration headache.

Where this generalizes

Any multi-tenant AI SaaS where customers connect different integrations, data isolation is non-negotiable, and unstructured documents need to become structured data. Legal, regulated finance, enterprise knowledge management — the pattern transfers.

Technologies used

Google ADK 1.15Gemini 2.0 FlashGemini 2.5 ProFastAPIPython 3.11Next.js 16React 19Neo4j AuraFirestoreVertex AI SearchCloud RunFirebase AuthTerraformCloud Pub/Sub

How a Multi-Family-Office SaaS Consolidated 7 AI Agents on Google ADK with Per-Org Tool Registration

Outcome metrics

The Challenge

The Solution

Implementation

Supervisor-router with per-org tool registration

RBAC at two layers

Document Intelligence pipeline

Zero-trust multi-tenant Firestore

Real-time sync via webhooks + Pub/Sub

Why a single all-tools agent does not work

The agent factory

Document intelligence as a pipeline, not a prompt

What we measured

What we would do differently

Where this generalizes

Technologies used

Share this case study

Frequently Asked Questions

Why register tools per-org instead of giving every agent every tool?

How fast is per-org agent instantiation?

How is tenant isolation actually enforced?

What does the document intelligence pipeline actually do?

Does this generalize beyond family offices?

Still have questions?

Related Case Studies

How a K-12 EdTech Publisher Saves Teachers 22 Hours/Week with an AI Writing Co-Pilot

Technical deep-dives behind this result

Supervisor-Router on Google ADK with Per-Org Tool Registration

Zero-Trust Multi-Tenant Firestore: Middleware, Claims, and 60+ Wildcard Permissions

Surviving Partial Failure in a 3,300-Call Agent Pipeline

Want a result like this for your team?