{
  "id": "gcp",
  "name": "Google Cloud AI Infrastructure",
  "subtitle": "Mapped to the 4+1 Layer AI Infrastructure Model",
  "version": "v1.2 - Managed-Service Authority Reconciliation",
  "date": "June 20, 2026",
  "source": "Google Cloud Next 2026 (Apr 22\u201324), GTC 2026, NVIDIA partnership, Forrester, SiliconANGLE, The New Stack, analyst coverage. v1.2 (instrument reconciliation): 2A GKE Retained\u2192Delegated \u2014 managed K8s behind a standard interface is Delegated, not Retained. Cloud Storage remains Ceded (GCS-native API).",
  "status": "complete",
  "commercialRelationship": true,
  "summary": {
    "title": "Summary Finding",
    "paragraphs": [
      "Google Cloud is the only vendor in this assessment series that owns a frontier foundation model \u2014 and that single fact restructures the entire 4+1 analysis. Google built Gemini, trains Gemini on its own TPUs, optimizes its silicon for Gemini\u2019s training requirements, and weaves Gemini\u2019s intelligence into every layer of its cloud platform. This creates a model-integrated stack: an architecture where the frontier model is not a component plugged into infrastructure but the intelligence that pervades the infrastructure.",
      "No other vendor assessed possesses this vertical integration. Google owns every layer of the 4+1 model with proprietary IP: custom silicon (TPUs), custom networking (Virgo), proprietary storage (Colossus/Spanner/BigQuery), its own runtime and frameworks (JAX/Pathways), its own frontier models (Gemini), and a unified orchestration surface (Gemini Enterprise Agent Platform).",
      "The DAPM implication is not merely that every layer is Ceded \u2014 it is that every layer is ceded to a unified intelligence. With AWS, authority is distributed across multiple vendors\u2019 judgment (AWS infrastructure, Anthropic model reasoning, ISV application logic). That distribution creates complexity but also structural checks. With Google Cloud + Gemini, the enterprise concentrates authority in one vendor\u2019s judgment across every layer \u2014 from silicon to application.",
      "This is the deepest expression of vertical integration in enterprise technology since the mainframe era. The enterprise gains end-to-end optimization that no multi-vendor assembly can match. But the 4+1 framework makes visible what the integration obscures: the enterprise has no fallback position at any layer. Google Distributed Cloud (GDC) addresses data sovereignty without addressing judgment sovereignty \u2014 GDC still runs Google\u2019s software stack and Google\u2019s models.",
      "The structural question: does concentrating all layers of authority and all layers of model judgment in a single vendor deliver enough value to justify the governance position \u2014 and has the enterprise made that concentration explicit rather than inheriting it by default?"
    ]
  },
  "layers": [
    {
      "id": "layer0",
      "label": "Layer 0",
      "title": "Compute & Network Fabric",
      "purpose": "Raw compute, networking, and acceleration fabric",
      "status": "strong",
      "statusLabel": "Ceded to Google",
      "nvidia": [
        {
          "component": "NVIDIA GPU Silicon",
          "detail": "Vera Rubin NVL72, Blackwell B200/B300, H100/H200. 1M+ NVIDIA GPUs. NVIDIA instances serve third-party models that can\u2019t run on TPU."
        },
        {
          "component": "NVIDIA NIXL + Networking",
          "detail": "NIXL for disaggregated inference. ConnectX/BlueField for GPU networking. Google manages the NVIDIA integration layer."
        }
      ],
      "gap": "No Layer 0 capability gap \u2014 Google\u2019s portfolio is the broadest of any single cloud provider. The gap is governance: the enterprise has no authority over any Layer 0 component beyond selecting instance types.\n\nThe silicon-model feedback loop is structurally unique: Google designs TPUs to train Gemini, not primarily to sell cloud compute. TPU roadmap decisions reflect Gemini\u2019s training topology, not enterprise customer workload requirements. The enterprise inherits optimization it didn\u2019t direct. AWS\u2019s Trainium is designed for customer workloads. NVIDIA designs for the broadest market. Google designs for Gemini and makes TPUs available to customers.\n\nThe multi-accelerator matching problem (TPU vs NVIDIA vs Axion CPU) creates a workload-to-silicon decision that recurs per-workload in cloud vs once at procurement on-prem. No productized policy engine automates that matching. Fluid Compute (Layer 2A) begins to address it but doesn\u2019t consult governance metadata.\n\nGDC follows the same inverted operating model as AWS AI Factories: Google operates infrastructure the customer houses. Unlike Dell PowerRack or HPE ProLiant (enterprise-owned hardware), GDC is Google-operated even when customer-hosted.",
      "borrowedJudgment": "The silicon-model feedback loop: Google\u2019s TPU roadmap is driven by Gemini\u2019s training requirements. If Google decides TPU 9 should optimize for MoE architectures because that\u2019s where Gemini is heading, every enterprise TPU workload inherits that architectural bet. Borrowed judgment at the silicon layer \u2014 a concept with no parallel in the Dell or HPE assessments.\n\nVirgo as borrowed network judgment: the enterprise inherits Google\u2019s network optimization decisions without visibility or control. Cannot audit bandwidth sharing across tenants or prioritization of Google\u2019s own Gemini training traffic.",
      "notes": "The dual-architecture hedge (TPU + NVIDIA) gives Google pricing leverage and architectural independence. The enterprise benefits indirectly but does not control whether NVIDIA GPU instances remain first-class citizens as Google optimizes for its own silicon.",
      "components": [
        {
          "component": "Google Custom Silicon (TPUs)",
          "detail": "TPU 8t (training): 9,600-chip superpods, ~3x processing power vs Ironwood. TPU 8i (inference): 288GB HBM, 384MB on-chip SRAM, ~80% better perf/dollar. Designed for agentic AI, MoE models, large-scale RL. Google designs TPUs to train Gemini \u2014 the enterprise benefits from silicon optimized by one of the world\u2019s most demanding ML workloads but does not direct the optimization priorities.",
          "dapm": "Ceded"
        },
        {
          "component": "NVIDIA GPUs on Google Cloud",
          "detail": "A5X bare-metal on Vera Rubin NVL72 (among first cloud providers to deploy). A4 Ultra (NVL72 preview Q2 2026), A3/A3 Mega/A3 Ultra. Fractional GPUs (G4 VMs, industry-first RTX PRO 6000 Blackwell vGPU). Scale: 80,000 Rubin GPUs single-site, 960,000 across multisite. Third-party models (Claude, Llama, Mistral) run on NVIDIA, not TPU \u2014 the playing field is structurally uneven.",
          "dapm": "Ceded"
        },
        {
          "component": "Virgo Network Fabric",
          "detail": "Purpose-built AI-optimized DC fabric. 134,000 TPU 8t chips connected at 47 Pb/s non-blocking bi-section bandwidth per DC. 4x bandwidth per accelerator, 40% lower unloaded latency vs prior gen. Also available for A5X. Designed for Gemini\u2019s training topology.",
          "dapm": "Ceded"
        },
        {
          "component": "Google Distributed Cloud (GDC)",
          "detail": "On-prem deployment of Google Cloud services. Connected and air-gapped configs. 4 racks to hundreds. NVIDIA Blackwell GPUs + Gemini Flash models on-prem. Managed GDC Provider initiative (Clarence, Gulf Energy, T-Systems, WWT). NATO deployment. Customer provides facility; Google provides and operates HW+SW. Addresses data sovereignty but not judgment sovereignty.",
          "dapm": "Delegated"
        }
      ]
    },
    {
      "id": "layer1a",
      "label": "Layer 1A",
      "title": "Data Storage & Governance",
      "purpose": "Durable, governed data foundation \u2014 the Governance Catalog that Layer 2C queries",
      "status": "strong",
      "statusLabel": "Ceded \u2014 Model-Powered Governance",
      "nvidia": [],
      "gap": "Knowledge Catalog with Smart Storage represents the most ambitious attempt to solve the metadata-to-agent grounding problem. No other vendor has a production system that automatically extracts business semantics from raw data, builds a context graph, and serves that context to agents in real-time with governance enforcement.\n\nThe recursive dependency is the central finding: Knowledge Catalog uses Gemini to perform semantic extraction and build the context graph. When Knowledge Catalog determines what context an agent receives, and that context graph was built by Gemini, the enterprise is consuming Gemini\u2019s judgment about what its own data means \u2014 at the governance layer, before any application-level inference occurs.\n\nNo other vendor has this pattern. Dell\u2019s MetadataIQ indexes deterministically. AWS\u2019s Glue doesn\u2019t use Nova to enrich its catalog. HPE\u2019s Ezmeral doesn\u2019t use a foundation model for data semantics. VAST\u2019s catalog is storage-native, not model-powered.\n\nThe 1A\u21922C connection: Knowledge Catalog is not a passive registry. It makes decisions about what context agents receive, how data assets are ranked for retrieval, and which governance policies apply. The context graph determines agent grounding \u2014 an orchestration function, not a storage function.\n\nGovernance gap: Knowledge Catalog is optimized for GCP. Third-party integrations federate INTO Knowledge Catalog, not out of it. An enterprise running PowerScale + S3 + GCS cannot use Knowledge Catalog as a federated surface across all three without making GCP the metadata authority.",
      "borrowedJudgment": "The context graph as borrowed judgment: when Knowledge Catalog builds entity relationships and business meanings, every agent that queries the graph inherits its representation of reality. If Gemini\u2019s semantic extraction misclassifies a data asset, every agent grounded in that context acts on the incorrect interpretation. Borrowed judgment at the governance layer \u2014 before application-level reasoning.\n\nSmart Storage as ingest-time judgment: a single Gemini classification at ingest (\u2018this document is about Project X\u2019) becomes a persistent governance fact. Limited mechanisms to audit or correct model-generated metadata at scale.\n\nComparison to AWS: AWS classifies 1A as Delegated because Lake Formation enforces customer-defined policies. Google\u2019s 1A is Ceded because Knowledge Catalog generates governance intelligence using Gemini. The enterprise on AWS retains governance judgment. The enterprise on Google inherits it.",
      "notes": "The Layer 1A / 2C boundary question: Knowledge Catalog\u2019s context graph is architecturally Layer 1A (data catalog) but functionally Layer 2C (determines agent grounding and context routing). Model-powered governance layers that make orchestration decisions are functionally 2C even when architecturally 1A.",
      "components": [
        {
          "component": "Cloud Storage + Rapid Tier (Colossus)",
          "detail": "Standard object storage at planetary scale. Rapid tier uses Colossus \u2014 Google\u2019s internal distributed storage platform (previously powering Search, Gmail, YouTube, Gemini training). Sub-millisecond read/write. The enterprise gets the same storage engine that holds Google\u2019s training data.",
          "dapm": "Ceded"
        },
        {
          "component": "Smart Storage",
          "detail": "Automatically analyzes unstructured data and generates metadata/context on ingest using Gemini. Auto-tags images, PDFs. First recursive point: Smart Storage uses Gemini to enrich the data that will eventually be served to Gemini-powered agents. The model enriches the data that feeds the model.",
          "dapm": "Ceded"
        },
        {
          "component": "Knowledge Catalog (Gemini-Powered)",
          "detail": "Universal business context and governance. Gemini-powered semantic extraction, entity relationship mapping, dynamic context graph construction. Sub-second semantic search for agent retrieval. Aggregates metadata across BigQuery, AlloyDB, Spanner, Cloud SQL, Firestore, Looker. Third-party integrations (Atlan, Collibra, Datahub). Enterprise Connectivity federates context from Salesforce, Palantir, Workday, SAP, ServiceNow. Column-level lineage (GA).",
          "dapm": "Ceded"
        },
        {
          "component": "BigQuery Storage",
          "detail": "Serverless columnar storage for structured/semi-structured. Managed Iceberg tables. Separates storage and compute. BigQuery spans storage, analytics, ML, and governance in a single service.",
          "dapm": "Ceded"
        }
      ]
    },
    {
      "id": "layer1b",
      "label": "Layer 1B",
      "title": "Context Management & Retrieval",
      "purpose": "Low-latency retrieval for RAG \u2014 vector/hybrid search, context windows",
      "status": "strong",
      "statusLabel": "Ceded \u2014 Model-Powered Prep",
      "nvidia": [],
      "gap": "No meaningful capability gap. Most mature Layer 1B in the assessment series. BigQuery ML eliminates data-to-model handoff. LookML Agent automates semantic model construction. Data Agent Kit enables agent-driven pipeline development.\n\nThe gap is governance over model-generated data artifacts. When LookML Agent generates a semantic model, when Data Agent Kit writes a pipeline, when BigQuery ML trains a model \u2014 who reviews the output for correctness? These are AI-generated artifact governance problems. Google provides no productized capability for governing model-generated data artifacts at scale.\n\nData Agent Kit\u2019s explicit support for Claude Code and non-Google tooling is strategically significant \u2014 the one point in the stack where third-party model access is genuinely equal.",
      "borrowedJudgment": "The semantic model as borrowed judgment: when LookML Agent generates definitions, every analytics query and agent interaction using those definitions inherits Gemini\u2019s interpretation of business logic. Powerful (automates weeks of manual semantic modeling) and risky (embeds model judgment in the analytical foundation).\n\nThe pipeline-building agent as borrowed judgment: Data Agent Kit agents write Dataflow jobs and BigQuery transformations. The enterprise inherits the agent\u2019s data engineering judgment \u2014 join strategies, filter logic, null handling. Previously human expertise, now model-generated.",
      "notes": "The comparison to AWS SageMaker Unified Studio: AWS provides a single governed environment across services (service-wide integration). Google achieves integration through BigQuery spanning storage, analytics, ML, and governance (service-deep integration). Google = tighter integration at cost of BigQuery lock-in. AWS = service diversity at cost of integration complexity.",
      "components": [
        {
          "component": "BigQuery ML",
          "detail": "In-database ML training and inference. Supports linear/logistic regression, K-means, time series, XGBoost, DNNs, imported TF/PyTorch models. Collapses the boundary between data preparation (1B) and AI runtime (2B) by running ML directly in the warehouse.",
          "dapm": "Ceded"
        },
        {
          "component": "Dataflow + Dataproc",
          "detail": "Managed Apache Beam (batch+stream) and managed Spark/Hadoop for large-scale processing. Open-source frameworks, Google-managed execution.",
          "dapm": "Ceded"
        },
        {
          "component": "LookML Agent (Gemini-Powered, Preview)",
          "detail": "Derives semantic models from documentation using Gemini \u2014 automates the business logic capture that traditionally requires manual data engineering. Second Gemini recursion point: Gemini interprets what data MEANS at the business logic layer, generates the semantic model, and agents query through that model. If the interpretation of \u2018revenue\u2019 is subtly wrong, every agent inherits the error.",
          "dapm": "Ceded"
        },
        {
          "component": "Data Agent Kit (Open-Source)",
          "detail": "MCP-based agents packaged as tools and skills. Supports Claude Code, Gemini CLI, Codex, VS Code. Enables intent-driven development: practitioners define goals, agents handle implementation. Creates governance recursion: the agent builds the pipeline that prepares the data that feeds the agent.",
          "dapm": "Delegated"
        },
        {
          "component": "Vertex AI Feature Store",
          "detail": "Managed feature serving for online/offline ML models and agents. Consistent feature serving across training and inference. 1B\u21922B bridge.",
          "dapm": "Ceded"
        }
      ]
    },
    {
      "id": "layer1c",
      "label": "Layer 1C",
      "title": "Data Movement & Pipelines",
      "purpose": "Move/transform data \u2014 ETL/ELT, lineage, cost-aware movement, KV cache tiering",
      "status": "strong",
      "statusLabel": "Ceded",
      "nvidia": [],
      "gap": "Cross-Cloud Lakehouse is the most strategically important Layer 1C capability in the assessment series. It represents a fundamentally different approach to data gravity: extend compute to wherever data sits rather than moving data to compute.\n\nThe DAPM implication: if Cross-Cloud Lakehouse delivers, the enterprise doesn\u2019t need to move data to GCP. That reduces data lock-in at storage. But it increases lock-in at the query layer \u2014 analytical capabilities depend on Google\u2019s query engine reaching across clouds. Data sovereignty improves. Analytical sovereignty does not.\n\nThe data movement / enrichment coupling: Smart Storage\u2019s Gemini enrichment at ingest means Layer 1C (movement) and Layer 1A (governance) are coupled through model inference. Moving files into Cloud Storage triggers model inference generating metadata that propagates into the context graph. No other vendor couples data movement with model-powered enrichment.\n\nCross-Cloud Lakehouse is in preview. Performance, cost, and governance characteristics at enterprise scale are unproven.",
      "borrowedJudgment": "Cross-Cloud Lakehouse as borrowed query optimization: when Google\u2019s engine optimizes a cross-cloud query, the enterprise inherits Google\u2019s optimization judgment. Opaque and unchallengeable \u2014 cannot tune the cross-cloud query plan or audit how Google\u2019s engine accesses data in a competing cloud provider\u2019s storage.\n\nThe enrichment coupling: data engineers moving files into Cloud Storage unknowingly trigger model inference. Convenient (automatic enrichment) and opaque (the engineer may not know Gemini is interpreting their data on arrival).",
      "notes": "The asymmetry between data plane federation (Cross-Cloud Lakehouse) and control plane federation (absent at 2C) is a structural finding. Google invests in making data accessible across clouds but not in making agent governance portable across clouds. Data accessibility without orchestration portability draws workloads toward GCP as the governance center.",
      "components": [
        {
          "component": "Cloud Storage Rapid Tier (Colossus)",
          "detail": "Sub-millisecond caching layer between persistent storage and compute. Previously internal-only, now customer-accessible.",
          "dapm": "Ceded"
        },
        {
          "component": "Managed Lustre",
          "detail": "10 TB/s bandwidth (10x YoY, claimed 20x faster than other hyperscalers), 80 PB capacity. RDMA-enabled. Training data movement layer between Cloud Storage and TPU/GPU clusters.",
          "dapm": "Ceded"
        },
        {
          "component": "Cross-Cloud Lakehouse (Preview)",
          "detail": "Agentic AI workflows access data across AWS and Azure without egress \u2014 querying data in place rather than copying. Eliminates ETL and cross-platform data movement costs. Extends compute to wherever data sits. Google\u2019s query engine becomes the universal data access layer regardless of physical data location.",
          "dapm": "Ceded"
        },
        {
          "component": "BigLake",
          "detail": "Unified data fabric across data lake (Cloud Storage) and warehouse (BigQuery). Single schema, multiple processing engines. Row/column-level governance via BigQuery Storage API across all access paths including open-source engines. Multi-cloud via BigQuery Omni.",
          "dapm": "Ceded"
        },
        {
          "component": "Smart Storage Ingest Enrichment",
          "detail": "Data entering Google Cloud is auto-enriched by Gemini with metadata and context tags. Data is not just moved \u2014 it is interpreted on arrival. No parallel in other vendors\u2019 data movement layers.",
          "dapm": "Ceded"
        }
      ]
    },
    {
      "id": "layer2a",
      "label": "Layer 2A",
      "title": "Infrastructure Orchestration",
      "purpose": "GPU scheduling, quotas, RBAC, fair-share scheduling, utilization optimization",
      "status": "strong",
      "statusLabel": "Delegated (GKE / K8s); GDC Ceded",
      "nvidia": [
        {
          "component": "NVIDIA GPU Operator (on GKE)",
          "detail": "Available for NVIDIA instances on GKE. Google manages the GPU integration layer."
        }
      ],
      "gap": "GKE is the most mature managed K8s for AI workloads. GKE Agent Sandbox has no equivalent in Dell, HPE, or AWS portfolios \u2014 300 sandboxes/second is built for agentic workload density.\n\nFluid Compute sits at the 2A/2C boundary. Its dynamic workload shifting is more than capacity acquisition \u2014 runtime decisions about which compute type serves which workload. But less than full 2C \u2014 routes on workload characteristics, not business context (data residency, compliance tags, cost targets). The Fluid Compute \u2192 Knowledge Catalog connection does not exist: workload placement does not consult governance metadata. Same Infrastructure Layer 2C gap every vendor has.\n\nGDC: the full sovereignty analysis reveals that data sovereignty \u2260 judgment sovereignty. Knowledge Catalog on GDC uses Gemini. Smart Storage on GDC uses Gemini. Agent Platform on GDC uses Google\u2019s runtime. The enterprise gains physical sovereignty but retains the same judgment concentration.\n\nThe \u2018self-driving cloud\u2019 narrative implies Gemini-powered infrastructure operations \u2014 autonomous root-cause analysis on infrastructure telemetry. If the Reasoning Plane is itself Gemini-powered, the operational intelligence and application intelligence are the same intelligence.",
      "borrowedJudgment": "GKE consumed via the standard Kubernetes interface: the enterprise's manifests and operators lift to any Kubernetes (Delegated — managed K8s service; the enterprise could switch without rebuilding). Google's optional Gemini-driven scheduling optimization is the only borrowed-judgment layer, and only if the enterprise opts in.\n\nGDC as borrowed judgment in sovereign packaging: physical control over facility, Google\u2019s judgment in software, model, governance, and operations. Data sovereignty with judgment concentration.\n\nFluid Compute as proto-2C borrowed judgment: when Fluid Compute routes agent work to CPU vs GPU, that routing is Google\u2019s judgment about optimal compute matching. Enterprise doesn\u2019t configure the routing policy.",
      "notes": "Data sovereignty vs judgment sovereignty: the 4+1 framework should distinguish between where data resides (which GDC addresses) and whose model\u2019s reasoning shapes the AI system (which GDC does not address). An enterprise running GDC air-gapped has data sovereignty while fully ceding judgment sovereignty.",
      "components": [
        {
          "component": "GKE + GKE Agent Sandbox",
          "detail": "Managed K8s with AI-era extensions. Agent Sandbox: gVisor-based secure isolation, 300 sandboxes/second/cluster with sub-second time to first instruction. Infrastructure built for the agentic era, not retrofitted. GKE is a managed service behind the standard Kubernetes API — manifests lift to another conformant cluster, so Delegated (operation delegated to Google; the standard interface keeps the opinions portable).",
          "dapm": "Delegated"
        },
        {
          "component": "Fluid Compute",
          "detail": "GCE + GKE dynamically shifting workloads in real-time. CPUs for branchy agent logic, secure sandboxes, RL, SLM inference, RAG. GPU/TPU for training and large-model inference. Proto-Layer 2C: routes based on workload characteristics, not business context.",
          "dapm": "Ceded"
        },
        {
          "component": "GDC (Sovereignty Analysis)",
          "detail": "On-prem Google Cloud services: GKE, Agent Platform, managed storage, Gemini Flash, Blackwell GPUs. Air-gapped for sensitive workloads. Addresses data sovereignty (where computation happens) but NOT judgment sovereignty (whose model drives computation). GDC runs Gemini on-prem \u2014 same recursive dependency, inside the enterprise perimeter.",
          "dapm": "Delegated"
        },
        {
          "component": "Capacity Management",
          "detail": "CUDs (1yr/3yr), on-demand, preemptible/spot, Dynamic Workload Scheduler. Same pattern as AWS \u2014 capacity acquisition (2A), not workload placement (2C).",
          "dapm": "Ceded"
        }
      ]
    },
    {
      "id": "layer2b",
      "label": "Layer 2B",
      "title": "Application Runtime & Execution",
      "purpose": "Model serving, agent execution, inference APIs, distributed inference",
      "status": "strong",
      "statusLabel": "Ceded \u2014 Model-Integrated Stack",
      "nvidia": [
        {
          "component": "NVIDIA GPU Instances",
          "detail": "A3/A3 Mega/A3 Ultra/A5X for third-party model inference. CUDA ecosystem required for non-Gemini models."
        }
      ],
      "gap": "Layer 2B is the center of gravity for the model-integrated stack. The model provider, runtime provider, and infrastructure provider are the same company. When the enterprise runs Gemini on Agent Platform on TPU, it borrows Google\u2019s judgment at the model layer, runtime layer, framework layer, and silicon layer simultaneously. A single entity\u2019s priorities shape the entire execution path.\n\nThe Agent Platform collapses Layers 2B, 2C, and 3 into a single product surface: Agent Runtime (2B infrastructure), Agent Identity/Gateway/Registry/Orchestration/Observability (2C governance), Agent Studio/ADK/Antigravity (Layer 3 development). The product boundary does not align with the architectural boundary.\n\nAWS separates these: Bedrock (model access) is distinct from AgentCore Runtime (agent execution) is distinct from AgentCore Policy (governance). AWS\u2019s separation preserves architectural boundaries the enterprise can independently govern. Google\u2019s collapse optimizes integration but prevents swapping the governance layer (2C) while keeping the runtime (2B).\n\nThe NVIDIA dependency at 2B is optional for Gemini (TPU-native) but required for third-party models. The enterprise using Claude on Google Cloud pays a structural performance tax \u2014 Claude runs on NVIDIA GPUs through a runtime designed for Gemini. Model Garden\u2019s 200+ models are API-equal but not silicon-equal.",
      "borrowedJudgment": "The model-integrated runtime: Gemini on TPU inherits Google\u2019s judgment at model layer (training data, alignment, safety), runtime layer (scheduling, scaling, session management), framework layer (JAX/Pathways optimization), and silicon layer (TPU architecture). Most concentrated borrowed judgment in the assessment series.\n\nThe open-source hedge: llm-d, TorchTPU, vLLM, ADK provide genuine open alternatives. Google opens components that reduce adoption friction (frameworks, SDKs) while keeping authority-concentrating components closed (Agent Runtime infrastructure, Agent Gateway, Pathways). Production deployment pulls open tools into Google\u2019s managed surface where authority shifts from Retained to Ceded.",
      "notes": "The 2B/2C collapse prevents the enterprise from independently governing the orchestration layer. An enterprise that wants Google\u2019s Agent Registry and Agent Identity but AWS\u2019s Bedrock for model access and its own governance engine for policy enforcement cannot compose that architecture. The components are bundled.",
      "components": [
        {
          "component": "Gemini Enterprise Agent Platform",
          "detail": "Unified platform for building, scaling, governing, optimizing agents. Subsumes all Vertex AI services. Agent Studio (low-code), ADK (code-first, Python/Go/Java/TypeScript, model-agnostic, open-source), Model Garden (200+ models incl Gemini, Claude, Llama, Gemma), Agent Runtime, Agent-to-Agent Orchestration, Agent Identity (GA), Agent Gateway, Agent Observability, Agent Registry, Memory Bank, Antigravity (desktop app + CLI).",
          "dapm": "Ceded"
        },
        {
          "component": "The House Model Advantage",
          "detail": "Gemini on TPU: silicon designed for this model, networking designed for its topology, distributed runtime (Pathways) built for its coordination, inference optimized for its architecture, governance (Knowledge Catalog) powered by it, orchestration defaults to it. No other vendor achieves this degree of vertical optimization. Third-party models (Claude, Llama) run on NVIDIA GPUs \u2014 supported but not co-optimized. The playing field is structurally tilted.",
          "dapm": "Ceded"
        },
        {
          "component": "Frameworks (JAX, Pathways, TorchTPU, vLLM)",
          "detail": "JAX: Google\u2019s ML framework optimized for TPU. Pathways: distributed runtime for superpod-scale training. TorchTPU: full PyTorch support on TPUs (concession to ecosystem). vLLM: optimized across GPU + TPU. llm-d: open-source K8s-native inference serving (multi-vendor project).",
          "dapm": "Ceded"
        }
      ]
    },
    {
      "id": "layer2c",
      "label": "Layer 2C",
      "title": "Agentic Infrastructure \u2014 The Reasoning Plane",
      "purpose": "Policy-driven placement and resource coordination \u2014 the Autonomy Layer",
      "status": "strong",
      "statusLabel": "Ceded \u2014 Productized but Captive",
      "nvidia": [
        {
          "component": "No NVIDIA Layer 2C Dependency",
          "detail": "All Layer 2C components are Google IP. NVIDIA does not control governance, policy, or reasoning in Google\u2019s stack."
        }
      ],
      "gap": "Google\u2019s Intelligence Layer 2C is the most complete productized offering in the assessment series: Agent Identity + Gateway + Registry + Orchestration + Observability + Memory Bank. Together they constitute a genuine control plane for agent governance.\n\nInfrastructure Layer 2C \u2014 the autonomous placement engine \u2014 is NOT built as a customer-configurable product. The capacity primitives (Fluid Compute, CUDs, DWS) are building blocks, but they don\u2019t compose into a policy-driven placement engine querying Knowledge Catalog governance metadata. Same gap as AWS and every other vendor.\n\nGoogle\u2019s implicit Layer 2C is the most sophisticated in the assessment: managed services make autonomous placement, scaling, routing, and capacity decisions invisibly. The enterprise cannot see, configure, audit, or override these decisions.\n\nThe model-integrated Reasoning Plane: if Google\u2019s \u2018self-driving cloud\u2019 uses Gemini for infrastructure decisions, then the model powering the enterprise\u2019s agents (Layer 3) is the same model governing agent orchestration (Intelligence 2C) is the same model deciding where agents run (Infrastructure 2C). One model\u2019s judgment pervades every decision surface.\n\nCross-cloud orchestration gap: Google federates the data plane (Cross-Cloud Lakehouse) but NOT the control plane. Agent Platform governs GCP agents only. Enterprise running agents across multiple clouds has no cross-platform agent governance surface \u2014 unless all agents route through Google\u2019s Agent Gateway, which cedes cross-cloud governance to Google.\n\nThe captive-but-best dilemma: this is evidence the control plane CAN be built as a coherent capability. The enterprise architect who wants it has one option: adopt Google Cloud. The federated alternative does not exist.",
      "borrowedJudgment": "The captive control plane: enterprise inherits Google\u2019s orchestration model \u2014 deterministic routing, Google-managed identity, Google-governed protocols. Well-engineered but unchallengeable \u2014 cannot substitute alternative orchestration logic within the Agent Platform boundary.\n\nThe model-powered control plane: if the Reasoning Plane uses Gemini for infrastructure decisions, a model judgment error at the control plane layer is invisible to the enterprise, with no fallback to human decision-making.\n\nIntelligence 2C: Low borrowed judgment in the sense that the components are productized and configurable. High borrowed judgment in the sense that the governance logic itself (Agent Gateway protocol decisions, Agent Identity authentication model, Orchestration routing patterns) is Google\u2019s, not the enterprise\u2019s.",
      "notes": "Google\u2019s 2C proves the Control Plane Working Notes thesis: the control plane can be built. The question is whether it can be liberated from the vendor boundary \u2014 and whether the model-integrated dimension (control plane powered by the same model it governs) is a pattern to replicate or to avoid.\n\nThe asymmetry: data plane federates (Cross-Cloud Lakehouse), control plane does not. This serves Google\u2019s strategic interest \u2014 data accessibility without orchestration portability draws workloads toward GCP as governance center.",
      "components": [
        {
          "component": "Agent Identity (GA)",
          "detail": "Agents as identity principals with authentication, authorization, audit. Control plane function: determines which agents exist as governed entities.",
          "dapm": "Ceded"
        },
        {
          "component": "Agent Gateway",
          "detail": "Protocol-level governance for MCP and A2A communications. Security partner integrations (Broadcom, Check Point, Cisco, CrowdStrike, F5, Netskope, Okta, Palo Alto, Zscaler). Spans Layer 0 (networking), 2B (runtime), and 2C (orchestration).",
          "dapm": "Ceded"
        },
        {
          "component": "Agent-to-Agent Orchestration",
          "detail": "Deterministic multi-agent workflow routing. Control plane function: determines which agent handles which subtask.",
          "dapm": "Ceded"
        },
        {
          "component": "Agent Registry",
          "detail": "Catalog of agents with ownership, capabilities, protocols, invocation details. Administrator-controlled discoverability. Control plane function: determines which agents are available and who can use them.",
          "dapm": "Ceded"
        },
        {
          "component": "Agent Observability",
          "detail": "Monitoring, tracing, debugging across production agent populations. Feedback loop for detecting faulty reasoning and intervening.",
          "dapm": "Ceded"
        }
      ]
    },
    {
      "id": "layer3",
      "label": "Layer 3 (+1)",
      "title": "AI Application Layer \u2014 The Value Plane",
      "purpose": "AI-powered business capabilities \u2014 business logic, workflow automation",
      "status": "strong",
      "statusLabel": "Open Model Layer, Captive Platform",
      "nvidia": [
        {
          "component": "NVIDIA Models via Model Garden",
          "detail": "NVIDIA Nemotron and other NVIDIA models available alongside all other providers."
        }
      ],
      "gap": "No meaningful capability gap. Broadest model catalog. Most portable agent development framework (ADK). Application surfaces from no-code through code-first.\n\nGoogle deliberately keeps Layer 3 more open than any other layer \u2014 while ensuring every Layer 3 application is gravitationally pulled toward Agent Platform (2B/2C). By keeping Layer 3 open, Google maximizes platform adoption: enterprises wanting Claude on Google Cloud still consume Agent Platform\u2019s runtime, identity, gateway, registry, observability. The model is portable; the platform is captive.\n\nConsistent with the 4+1 model\u2019s prediction that vendor lock-in concentrates at Layer 2B/2C, not Layer 3. Google has understood this prediction and built strategy accordingly.\n\nCode portability vs operational portability: ADK is open-source and model-agnostic \u2014 agent code CAN run on AWS or on-prem K8s. But Agent Registry, Memory Bank, Agent Identity, Agent Gateway, Agent Observability are Google Cloud services with no portable equivalents. Agent code is an asset the enterprise owns. Agent operations are an asset it rents.\n\nAntigravity 2.0 deepens the Layer 3 gravitational pull toward Google's platform. The desktop app + CLI + SDK creates a development surface that integrates directly with Agent Platform (2B/2C): agents built in Antigravity inherit Agent Platform's identity, gateway, registry, and observability. The Gemini CLI deprecation (June 18, 2026) forces migration to Antigravity CLI \u2014 consolidating Google's developer AI surface into one opinionated platform. The SDK enabling custom agent templates in AI Studio means Antigravity is not just a coding tool but an agent construction platform that feeds directly into the Gemini Enterprise Agent Platform.\n\nCompare to AWS Kiro (spec-driven, methodology-opinionated, Bedrock-native) and GitHub Copilot (IDE-embedded, multi-model, GitHub-native). Google's differentiator is multi-agent parallel orchestration \u2014 Antigravity coordinates multiple agents simultaneously rather than single-agent sequential interaction. This maps to the 4+1 model's Layer 2C vision: orchestrating multiple agents is a control plane function that Antigravity surfaces through a developer tool.\n\nThe consumer-enterprise feedback loop extends to Antigravity: Google is using Antigravity's capabilities in consumer Search to generate real-time custom UIs as part of search answers. Developer tool innovations flow to consumer products and back \u2014 a flywheel no other vendor in the assessment possesses.",
      "borrowedJudgment": "Gemini as borrowed Layer 3 judgment: alignment changes affect agents (Layer 3), governance enrichment (Layer 1A via Knowledge Catalog), semantic models (Layer 1B via LookML Agent), and potentially infrastructure operations (Layer 2C via self-driving cloud). A single alignment decision propagates across the entire model-integrated stack.\n\nPlatform defaults: Agent Studio and Agent Designer default to Gemini. Enterprise that adopts without explicitly selecting alternatives inherits Google\u2019s model preference as a default rather than a decision.\n\nStrategic openness as borrowed judgment about lock-in location: Google\u2019s decision to keep Layer 3 open and concentrate lock-in at 2B/2C is itself borrowed judgment the enterprise inherits. Evaluating Google on model diversity without evaluating platform captivity accepts Google\u2019s framing of where portability matters.",
      "notes": "The consumer-enterprise feedback loop has no parallel in the assessment. Model improvements from billions of consumer interactions benefit enterprise workloads \u2014 but consumer-driven alignment may constrain enterprise use cases. If Google tightens content policies for consumer safety, enterprise agents inherit that tightening.\n\nThe Gemini CLI \u2192 Antigravity CLI forced migration is a significant authority move. Over 100,000 GitHub stars on Gemini CLI \u2014 all those developers must migrate to Antigravity by June 18, 2026. This concentrates Google's developer AI surface into one platform and one billing model (AI Ultra at $100/month). The deprecation timeline is aggressive but consistent with Google's pattern of consolidating developer tools around Gemini.\n\nAntigravity 2.0's scheduled tasks capability (agents running automatically in the background) converts the developer tool from a single-turn interaction to a persistent automation pipeline. This blurs the boundary between Layer 3 (application) and Layer 2C (orchestration) \u2014 when Antigravity schedules background agents to perform tasks autonomously, who governs those agents? The answer is Agent Platform \u2014 reinforcing the Layer 2C gravitational pull.",
      "components": [
        {
          "component": "Gemini Model Family",
          "detail": "Gemini 3.1 Pro, Gemini 3.5, Gemini Flash. The model the entire stack was designed around. Gemma open-weight models for self-hosting (the one offering where enterprise can Retain model authority).",
          "dapm": "Ceded"
        },
        {
          "component": "Model Garden (200+ Models)",
          "detail": "Gemini, Claude Opus/Sonnet/Haiku, Meta Llama, Gemma, open-source models. Model Evaluation service. Broadest model catalog of any cloud provider. Model-agnostic claim genuine at Layer 3 \u2014 more so than any other layer.",
          "dapm": "Delegated"
        },
        {
          "component": "Application Surfaces",
          "detail": "Agent Studio (low-code), Agent Designer (no-code in Gemini Enterprise app), ADK (code-first, open-source, model-agnostic). Gemini Enterprise app for agent discovery and the Deep Research agent.",
          "dapm": "Retained / Delegated"
        },
        {
          "component": "Consumer-Enterprise Feedback Loop",
          "detail": "Gemini powers Google Search, Gmail, Docs, Photos, Android, Chrome. Workspace Intelligence uses Gemini for agentic work. Model improvements from billions of consumer interactions directly benefit enterprise workloads. But: consumer-driven alignment and safety tuning may not align with enterprise needs.",
          "dapm": "Ceded"
        },
        {
          "component": "Google Antigravity 2.0 (Agent-First Development Platform)",
          "detail": "Announced I/O 2026. Standalone desktop app + CLI + SDK \u2014 a full developer platform built around agent orchestration. Multi-agent parallel execution: orchestrate multiple agents and execute tasks simultaneously. Dynamic subagent workflows and scheduled background automation. Antigravity CLI (Go-based, replacing Gemini CLI \u2014 deprecated June 18, 2026) for terminal-native multi-agent workflows. Antigravity SDK for building custom agents with templates in AI Studio. Powered by Gemini 3.5 Flash (co-developed using Antigravity). Native voice command support. Ecosystem integrations: Google AI Studio, Android, Firebase. Export tool for AI Studio \u2192 local development. Search integration: real-time custom UI generation within Google Search answers. AI Ultra plan ($100/month, 5x usage limits). Google's most aggressive move in the agentic coding market \u2014 positioned as the hub for multi-agent development workflow orchestration, not just code assistance.",
          "dapm": "Ceded"
        }
      ]
    }
  ]
}