Agent Proof Protocol

The Agent Proof Protocol (APP) v2.4

The Universal Execution Standard for the AI Agent Economy. Unifying Global Financial Rails, MCP, WebMCP, Unix Pipelines, Logistics & Labor Marketplace APIs, and Cryptographic Trust to enable AI Agents to manage operations in the Physical World.

Abstract

As Artificial Intelligence evolves from passive digital tools into autonomous economic actors, the role of human labor is undergoing a fundamental transformation. The next phase of the digital economy will not replace humans; it will re-platform them. While software agents can now process information at an infinite scale, they remain isolated from the physical world—incapable of opening a door, repairing a circuit, or verifying a delivery without human hands.

The current landscape is a “Tower of Babel,” a fragmented ecosystem of incompatible labor marketplaces and opaque commerce rails. This prevents autonomous systems from efficiently sourcing human execution. More critically, the diversity of Agent Runtimes—ranging from Cloud-hosted LLMs to Browser-based assistants and Local-First CLI agents—lacks a unified standard for physical interaction.

The Agent Proof Protocol (APP) v2.4 establishes the Universal Settlement Layer required to bridge this divide. It functions as a neutral middleware that wraps existing liquidity pools and commerce merchants in a military-grade verification layer. Crucially, v2.4 introduces the “Tri-State Architecture,” allowing Agents to interact with the physical world via their native environment: Model Context Protocol (MCP) for cloud systems, WebMCP for browsers, and Unix Standard Streams for local runtimes.

The protocol defines the PhysicalTask standard—a unified Python interface that aggregates disparate APIs into a single, audit-ready command.

By coupling “Execution” (via Partner Networks) with “Verification” (via APP's Wi-Fi RTT & Bio-Liveness Proofs), APP creates a trust-minimized economic bridge. It mandates a “Trust, Then Transact” workflow, allowing AI Agents to buy goods, hire skilled professionals, and crowdsource human sensors with cryptographic certainty.

This infrastructure is the structural necessity for the Agentic Economy to take hold, ensuring that as Agents scale, they become the largest employers of human labor in history.

1. Introduction: The Agentic Transition

1.1 From Information Retrieval to Action Execution

The trajectory of the digital economy has shifted from the optimization of information retrieval (Google) to the optimization of action execution (Agents). We are witnessing the dawn of the “Agentic Era,” where the primary locus of value moves from retrieving information to acting upon it.

By 2030, “Machine Customers”—autonomous software programs authorized to purchase goods and services—will account for upwards of 20% of all digital commerce revenue. This represents a fundamental restructuring of global demand, shifting from a human-centric consumption model to one driven significantly by algorithmic needs.

1.2 The Adoption Ceiling: The Runtime & Trust Gap

The adoption of Agentic AI is currently throttled by two distinct friction points: Lack of Ground Truth and Runtime Fragmentation.

The Trust Gap: An Agent can hire 1,000 logistics drivers in milliseconds via direct API injection. However, without a physical verification layer (Wi-Fi RTT, Bio-Liveness), this creates a “High-Frequency Fraud” vector.

The Runtime Gap: The agent ecosystem is bifurcating. While some agents operate in the cloud (using APIs), a significant portion of the developer community—driven by privacy and latency concerns—is moving toward “Sovereign Agents” running locally on consumer hardware. These local agents prefer composable, terminal-based tools over complex HTTP integrations.

The “Agent Economy” cannot scale if it forces local agents into cloud silos or subjects cloud agents to blind execution. APP v2.4 resolves this by standardizing the Intent, regardless of the Interface.

1.3 The APP Mandate: The Universal Settlement Standard

APP is designed to solve this liability crisis. It is built upon a singular, immutable directive: Demand is Digital, Execution is Hybrid, Verification is Mandatory.

APP is not a gig platform; it is a settlement protocol. Just as global payment networks standardize financial transmission regardless of the issuing bank, APP standardizes PhysicalTask execution regardless of the labor provider (Gig Worker or Merchant). It functions as a Universal Middleware, translating the intent of an AI agent into a cryptographically verified work order.

2. Problem Statement: The Trust & Fragmentation Gap

2.1 The Fragmentation of Liquidity (The Tower of Babel)

The global labor and commerce market is fractured into deep, isolated silos.

  • Logistics Silos: An Agent cannot easily route a package from a national carrier depot to a local on-demand courier.
  • Skill Silos: A certified electrician on an enterprise workforce platform is invisible to a homeowner Agent looking on a consumer services app.
  • Identity Silos: A worker's reputation is locked within specific walled gardens.

For an AI agent, this fragmentation is catastrophic. To execute a complex task, an agent would need to maintain integrations with fifty distinct, incompatible APIs.

2.2 The Trust Deficit (Blind Execution)

Beyond API incompatibility, traditional marketplaces fail the “Machine Customer” test.

  • The Verification Gap: Legacy platforms rely on the worker clicking “I'm Done.” For an AI Agent managing a million-dollar real estate portfolio, this self-reporting is a security vulnerability.
  • The Settlement Gap: Money is typically released upon digital signal, not physical proof.

2.3 The Solution: Trust-Wrapped Execution

The industry requires a unified middleware—a “Trust Wrapper” that sits between the AI Agent and the fragmented supply chain.

  • Aggregate: Access mobility networks, skilled labor marketplaces, and The Human Grid through one API.
  • Verify: Enforce “Proof of Physics” (Wi-Fi RTT, Geofencing, Computer Vision) before settlement.
  • Settle: Release funds only when Execution matches Verification.

3. The Economic Thesis: A Bifurcated Supply Chain

APP recognizes that labor needs differ by geography and complexity. We employ a “Two-Tier” supply strategy.

3.1 Tier 1: The Partner Network (“The Pros”)

  • Geography: Developed Markets (US, EU, UK).
  • Source: Enterprise Workforce Platforms & On-Demand Labor Marketplaces.
  • Role: High-Skilled & Certified Execution.
  • Examples: HVAC repair, commercial landscaping, snow removal, server maintenance, licensed logistics, and facility inspections.
  • The APP Value: The Audit. We provide the “Digital Punch Card.” We audit the professional to ensure the AI Agent gets exactly what it paid for, mitigating liability and fraud for high-value tasks.

3.2 Tier 2: The Human Grid (“The People”)

  • Geography: Emerging Markets & Global Urban Centers.
  • Source: Embedded Supply SDKs (Host Partners) & The Reference Validator Network.
  • Role: Verification, Micro-Logistics, and Infrastructure Sensing.
  • The APP Value: Access & Aggregation. We connect AI Agents to fragmented local delivery and sensor networks that currently lack standard APIs, creating a unified mesh of human sensors for lightweight tasks.

3.3 The Distribution Strategy: Embedded Supply Aggregation

Establishing a new two-sided marketplace typically suffers from the “Cold Start Problem”—the inability to fulfill demand due to a lack of liquidity. APP circumvents this by architectural design, utilizing an Embedded SDK model to aggregate existing supply rather than recruiting individual workers.

  • The Host Partner Model: The Protocol Ecosystem integrates with high-frequency “Host Applications” (e.g., Fintech Wallets, Telco Super-Apps, Hyper-Casual Games, and Ride-Sharing platforms) that already possess high Daily Active Users (DAU) in target geolocations.
  • The Integration: These partners integrate the AgentProof SDK, effectively transforming their user base into latent “Grid Nodes.”
  • The Incentive: The Protocol creates a programmatic revenue-sharing stream for Host Partners. By monetizing their dormant user attention through physical verification tasks, Host Apps generate a new perpetual revenue line, incentivizing them to maintain the SDK and prioritize APP notifications.
  • Strategic Advantage: This transforms the protocol from a consumer app competing for attention into a Monetization Infrastructure for existing platforms, allowing APP to instantly access millions of potential sensors in high-density urban centers.

4. Technical Architecture: The PhysicalTask Standard

The core innovation of APP is the PhysicalTask interface. This is the “HTTP” of the physical world—a boring, reliable standard that every Agent uses to interact with reality.

4.1 The Input Schema (JSON)

The protocol enforces a strict, strongly-typed JSON schema for all inbound requests. This schema is the “Lingua Franca” of the Agentic economy.

Table 1: The APP Request Structure
FieldTypeDescriptionConstraint
intentStringThe high-level goal of the agent.Enum: RESOLVE_OUTAGE, VERIFY_STATE, etc.
primitiveEnumThe specific Labor Primitive required.Must be one of: VERIFY, FIX, MAINTAIN, MOVE, THINK
sub_typeStringDomain-specific skill tag.e.g., HVAC_REPAIR, DATA_LABELING
dependenciesArrayList of parent Task Hashes that must complete first.Optional. Array of Strings (SHA-256 Hashes).
human_verificationBooleanStrict flag for biological necessity.Must be true for APP routing.
privacy_modeEnumProtocol for data handling.PUBLIC (Default), ANONYMIZED (Blur), EPHEMERAL (No store).
location_proofObjectRequirements for DePIN verification.e.g., {"method": "WIFI_RTT", "accuracy": "1m"}
payment_tokenStringSecured Token for Settlement.Mastercard Agent Pay / Visa Token.
payloadObjectTask-specific parameters.Dynamic based on primitive.
payment_railStringSettlement method.Default: L402_LIGHTNING
max_budgetIntegerPayment cap in Satoshis or Stablecoin.Positive Integer.
sandboxBooleanSimulation Mode flag.If true, returns mocked cryptographic proofs for CI/CD testing. Costs 0.

Universal Compliance: This schema is designed for 1:1 mapping across all three supported runtimes:

  • Web: Maps directly to navigator.modelContext.registerTool() specifications.
  • Cloud: Maps to MCP Tool Resource definitions (mcp-server).
  • Local: Maps to POSIX-compliant CLI flags (e.g., intent becomes --intent, budget becomes --budget).

4.2 The Five Labor Primitives

APP categorizes all physical labor and commerce verification into five semantic primitives. To reduce the complexity of the physical world into machine-manageable units, APP v2.1 categorizes all human labor into five semantic primitives. These are the “atomic units” of the protocol.

4.2.1 Primitive I: VERIFY (“The Eyes”)

  • Definition: A request for ground truth.
  • Use Case: An AI Hedge Fund analyzes satellite data suggesting a Store parking lot is full. To confirm this isn't a spoof, it dispatches a VERIFY primitive.
  • Technical Necessity: Digital data can be hallucinated. Physical presence is the only source of absolute truth.
  • DePIN Integration: This primitive relies heavily on Proof of Location (see Section 5). The protocol demands a “Witness Chain” attestation where the worker's device proves its location via Wi-Fi RTT relative to trusted anchors.

4.2.2 Primitive II: FIX (“The Hands - Consumer”)

  • Definition: Residential manual labor and unskilled assembly.
  • Use Case: An Airbnb management agent detects a “Smart Lock Offline” error. It broadcasts a FIX primitive.
  • Safeguard: Enforces Proof of Attendance (Wi-Fi pings) to prevent billing fraud.

4.2.3 Primitive III: MAINTAIN (“The Hands - Enterprise”)

  • Definition: Certified, skilled technical labor requiring licensure.
  • Use Case: An Enterprise AI detects a cooling failure in a data center.
  • The “Blue Collar” Firewall: Unlike FIX, the MAINTAIN primitive enforces a “Proof of License” check. The worker must have a verified credential (e.g., an electrician's license hash) stored in their identity wallet. Enforces Chain of Custody and Proof of License.

4.2.4 Primitive IV: MOVE (“The Wheels”)

  • Definition: Logistics, transport, and courier services.
  • Use Case: An AI personal assistant negotiates the purchase of a vintage lamp and needs it moved from Seller A to Buyer B.
  • Safeguard: The protocol handles the “handoff” logic. It generates a unique QR code (the “Package Passport”) that the human driver scans at pickup and dropoff, linking the digital transaction to the physical movement.

4.2.5 Primitive V: THINK (“The Brain”)

  • Definition: Human-in-the-Loop (HITL) cognitive review, subjective judgment, and RLHF.
  • Use Case: A Legal AI drafts a contract but hits a low confidence score on a specific clause. It triggers a THINK primitive for human review.
  • The RLHF Lock: This is the recursive loop of the AI economy. Humans are paid to “teach” the agents via the THINK primitive.
  • Safeguard: Enforces Consensus Verification (majority vote).

4.3 Access Control Determines the Verifier

Agent Proof Protocol splits the VERIFY primitive into two distinct security contexts to handle both public data and private commerce.

Sub-TypeContextVerifierExample
VERIFY_PUBLICPublic Domain / ExteriorThe Human Grid“Check if the ATM is broken.” “Is the road blocked?”
VERIFY_CUSTODYPrivate Domain / InteriorVendor / Merchant“Prove the cake is spelled correctly.” “Prove the package is sealed.”
VERIFY_CHAINHandoff PointLogistics Driver“Photo of package at doorstep.”

The Commerce Workflow (“The Smart Handshake”): In private commerce (e.g., ordering a birthday cake), the AI Agent does not hire a stranger. Instead, it enforces a “Digital Gate” on the Merchant using VERIFY_CUSTODY.

  • The Lock: Agent orders but withholds final payment/pickup signal.
  • The Proof: Vendor receives a secure link, snaps a Geofenced photo of the item via browser (no app needed).
  • The Release: Agent's Computer Vision validates the item. Funds release instantly.

4.4 The Human Grid Fallback Protocol (Handling the Skill Gap)

In regions lacking specialized labor marketplaces (e.g., Field Nation, TaskRabbit), APP v2.3 utilizes the “Human Grid”—a distributed network of casual users embedded via partner apps. Since Grid Nodes (users) are often unskilled, the protocol enforces a Dynamic Primitive Downgrade to ensure task completion without liability.

If a FIX or MAINTAIN request cannot be routed to a Certified Partner (Tier 1) within the target TTL (Time-To-Live), the Protocol automatically downgrades the request logic:

4.4.1 The Downgrade (FIX → VERIFY)

  • Trigger: No certified technician found in the target geolocation (e.g., Accra, Ghana) within 10 minutes.
  • Action: The Agent changes the Primitive from FIX (Independent Execution) to VERIFY (Guided Presence).
  • The Logic: The AI Agent assumes the role of the “Brain” and recruits a Human Grid Node to act solely as the “Hands/Eyes.”

4.4.2 A2UI “Remote Avatar” Mode (Guided Execution)

Interface: The Human Grid Node receives a declarative, step-by-step A2UI stream.

Workflow: Instead of a broad command like “Fix the Generator,” the user sees:

  • Screen 1: “Walk to the Generator.” (Verified via Geofence)
  • Screen 2: “Point Camera at the Fuel Gauge.” (Verified via Computer Vision)
  • Screen 3: “Turn the Red Valve 90 degrees clockwise.” (Verified via Real-time Video Stream)

Settlement: The user is paid for Compliance, not Expertise. The liability for the repair remains with the AI Agent's instruction set, not the human's skill.

4.4.3 The Human Grid Priority Router

The Router prioritizes execution in this order:

  1. Tier 1 (Certified): Enterprise APIs (e.g., Field Nation, Uber Direct).
  2. Tier 2 (Verified Nodes): Users operating on the Reference Implementation Standard (e.g., The Official Validator App).
  3. Tier 3 (Partner Grid): Third-party applications integrating the Supply SDK (e.g., Fintech/Gaming partners).

4.5 Vertical Integration: The Tri-State Integration Strategy

To eliminate the “Integration Tax,” APP v2.4 natively supports the three dominant runtimes for AI Agents. We do not force developers to adapt to the Protocol; the Protocol adapts to the Agent's native environment.

4.5.1 Cloud-Native: The Model Context Protocol (MCP)

  • Target: Enterprise & Cloud-Hosted Agents (e.g., Claude Desktop, CrewAI, AutoGen).
  • Mechanism: The APP Translation Engine functions as a compliant MCP Server. It exposes the five Labor Primitives as standardized “Resources” and “Tools” that can be mounted into any MCP host.
  • Benefit: Plug-and-play discovery. A cloud agent can “discover” the ability to hire a technician simply by connecting to the APP MCP endpoint.

4.5.2 Browser-Native: Web Model Context Protocol (WebMCP)

  • Target: Browser Agents & Assistants (e.g., Chrome/Gemini Nano).
  • Mechanism: APP supports navigator.modelContext. When an Agent visits an APP-enabled dashboard, it detects the PhysicalTask tool definition injected into the DOM.
  • Benefit: Zero-friction dispatch. An Agent browsing a logistics provider's site can detect the APP tool and execute a verified dispatch without leaving the browser context.

4.5.3 Local-Native: The Unix Standard (CLI)

  • Target: Sovereign & Terminal-Based Agents (e.g., OpenClaw, Local LLaMA, Python Scripts).
  • Mechanism: The Standard Streams Pipeline (JSON-over-Pipe). For agents that eschew servers, APP provides a standalone binary (agent-proof). This tool accepts task parameters via standard flags (argv) or stdin and outputs structured, parseable JSON to stdout.
  • Benefit: Composable autonomy. This allows local agents to utilize the “Unix Philosophy”—piping the output of a reasoning chain directly into a physical execution command without managing webhooks or API keys.

4.6 The Interface Layer: A2UI and Declarative Rendering

APP adopts the Agent-to-User Interface (A2UI) standard. Instead of sending static text, the protocol streams a secure, JSON-based UI blueprint describing the task.

WebMCP Integration: For browser-based workers, these A2UI components are delivered via the WebMCP Declarative API, allowing the worker's browser to render native inputs (e.g., “Take Photo of Meter”) without requiring a dedicated app installation.

4.7 Orchestration: Molecular Intents & Compound Workflows

Real-world operations rarely fit into a single atomic action. A “Kitchen Renovation” is not a single FIX primitive; it is a sequenced chain of VERIFY (measurements), MOVE (delivery), FIX (installation), and VERIFY (inspection). If the Protocol only supported atomic tasks, it would merely be a job board. To support a global economy, it must function as a Project Management Engine.

4.7.1 The “General Contractor” Model

APP acknowledges that the Protocol does not “know” what a house is; it only knows the 1,000 steps required to build one. Therefore, the Agent acts as the General Contractor, and the Protocol enables the dependency logic. We introduce Recursive Orchestration, allowing Agents to submit Directed Acyclic Graphs (DAGs) of tasks. In this model, downstream primitives are cryptographically locked until upstream prerequisites are met.

4.7.2 The Dependency Array (The Logic)

By utilizing the dependencies field in the Request Schema, Agents create “Molecular Intents.”

  • Blocking State: The Protocol holds a dependent request (e.g., “Install Sink”) in a PENDING_DEPENDENCY state. It is not broadcast to the Human Grid until the parent tasks (e.g., “Deliver Sink”) emit a STATUS: COMPLETED event.
  • Context Injection: The output of Task A (e.g., a photo of the delivered materials) is automatically injected as the context_payload for Task B (the installer). This ensures the worker has the full history without the Agent needing to re-upload data.

4.7.3 The “Manager” Agent (Recursive Hierarchy)

For hyper-complex tasks involving dozens of steps, the Protocol allows an Agent to hire a Human Project Manager via a high-level THINK primitive.

  • Use Case: “Manage this renovation.”
  • Mechanism: The Agent triggers THINK: MANAGEMENT. The selected human is granted limited authority (via a scoped API key) to issue sub-primitives (e.g., approve a FIX milestone) within the Agent's budget, effectively keeping the “Human-in-the-Loop” for subjective quality control.

5. The Trust & Verification Layer: DePIN

In the Agentic Economy, money is easy; truth is hard. APP introduces “Proof of Physics” as a prerequisite for settlement.

5.1 Proof of Location: Wi-Fi RTT (802.11mc)

GPS is easily spoofed. APP mandates Wi-Fi Round Trip Time (RTT) for Trust Level 1 verification.

  • The Physics: The device sends a burst of packets to nearby Wi-Fi Access Points. It measures the time of flight (d = c * t/2).
  • The Security: To spoof this, a bot would need to alter the speed of light or hack the firmware of the physical router on-site.

5.2 The Privacy Firewall

  • On-Device Anonymization: The Protocol Client processes images locally using Edge AI. Faces and personal data are blurred before the image leaves the phone.
  • Ephemeral Verification: For sensitive checks, the image is analyzed locally (e.g., “Is the door open? -> YES”), hashed, and deleted. Only the cryptographic proof is uploaded.

5.3 Protocol Integrity: Configurable Settlement Layer

To prevent “Supply Chain” risks and ensure compliance with Enterprise Data Residency requirements, APP v2.4 enforces a Zero-Trust Egress Policy.

  • Configurable Endpoints: The Protocol does not hardcode the settlement destination. Enterprise Agents can configure the base_url to point to self-hosted APP Instances or Private Cloud Gateways, ensuring that sensitive task data never leaves the corporate VPC.
  • Audit Logging: All PhysicalTask emissions generate a local cryptographic hash (SHA-256) of the intent and payload before transmission, allowing Security Operations Centers (SOC) to audit exactly what the Agent requested versus what was executed.

6. The Financial Unification Layer

The global payments industry has made extraordinary advancements in Digital Authorization. Leading protocols now allow AI Agents to securely hold funds, prove identity, and browse inventory. However, a critical gap remains in Physical Settlement. While an Agent can authorize a payment digitally, it currently lacks the sensory input to know if the physical service was actually rendered.

APP acts as this missing “Physical Trigger.” We provide the cryptographic Ground Truth that Visa, Mastercard, and Stripe require to release funds safely, transforming a digital promise into a verified physical transaction.

6.1 Identity Layer: Visa Trusted Agent Protocol (TAP)

The Function: Visa TAP solves the “Who” problem. It provides a universal standard for Agents to cryptographically identify themselves, distinguishing legitimate AI shoppers from malicious bots.

The APP Integration (via WebMCP): When an Agent invokes a tool via WebMCP (e.g., navigator.modelContext.execute('PhysicalTask', payload)), the Protocol mandates the inclusion of the Visa TAP Identity Certificate within the metadata header. APP validates this certificate before processing the request. If the Agent lacks a valid Visa TAP signature, the WebMCP call is rejected at the browser edge, preventing “Agent Spam” and DDoS attacks on the physical supply chain.

6.2 Tokenization Layer: Mastercard Agent Pay

The Function: Mastercard Agent Pay solves the “Security” problem. It allows Agents to issue “Agentic Tokens” restricted to specific contexts (e.g., “One-time use, max $50”), ensuring the underlying funding source is never exposed.

The APP Integration: APP accepts these Agentic Tokens as valid proof of funds. Crucially, in a WebMCP context, the Agent passes the payment_token as a secured string parameter within the PhysicalTask.create() payload. This allows the APP Middleware to lock the funds instantly upon request generation, while the Agent remains cryptographically isolated from the transaction risk.

6.3 Commerce Layer: Stripe Agentic Commerce

The Function: Stripe's Agentic Commerce Suite solves the “Inventory” problem. It allows merchants to syndicate their product catalogs and pricing directly to AI Agents via API.

The APP Integration: APP serves as the Logistics Bridge for these transactions. When an Agent buys a product via Stripe, it uses APP's MOVE primitive to hire a courier for the “Last Mile” delivery. APP updates the Stripe payment intent with the “Proof of Delivery” (QR Scan), triggering the final capture of funds.

6.4 Policy Layer: Payman.ai

The Function: Payman solves the “Budget” problem. It provides a sandbox where human owners set strict “Guardrails” (e.g., “$500/week limit”) on Agent spending.

The APP Integration: APP acts as the Oracle for Payman policies. If a Payman policy states “Release funds only upon verified repair,” APP provides the cryptographic proof (Wi-Fi RTT logs) that satisfies the policy condition and unlocks the wallet.

6.5 Web3 Layer: Coinbase AgentKit & x402

The Function: For Agents operating outside traditional banking, Coinbase AgentKit provides the standard for on-chain wallet management, while the x402 (L402) protocol enables HTTP-native micropayments.

The APP Integration: APP is fully compatible with AgentKit, enabling “Streaming Settlement.” For time-based tasks (like MAINTAIN), APP streams satoshis or USDC every second, contingent on the worker's continued physical presence.

6.6 The “Physical Trigger” (APP's Unique Role)

The critical innovation is that APP creates the conditional logic for money movement.

  • The Stack: Visa, Mastercard, Stripe, and Payman allow the Agent to promise payment.
  • The Trigger: APP ensures the Agent delivers payment only when reality matches the promise.
  • Mechanism: The Settlement Gateway holds the authorized funds (Fiat or Crypto). It listens for the Proof of Physics (Bio-Liveness, Geo-Fencing, RTT). Once the physical world state is verified, APP executes the final settlement via the chosen rail, closing the loop between digital intent and physical reality.

7. Implementation Guide: The “First Principles” Workflow

To illustrate the power of Compound Intents, consider a high-complexity humanitarian scenario: “The Disaster Relief Chain.”

Scenario: An AI Philanthropy Agent (Agent_0x99) needs to deploy a solar-powered water filtration unit to a remote village in Ghana. This requires coordination across verification, logistics, and technical installation.

Step 1: VERIFY (The Scout)

  • Agent Action: Dispatches VERIFY to confirm the access road to the village is passable for a truck.
  • Primitive: VERIFY | sub_type: ROAD_CONDITION
  • Outcome: Grid User scans road. Protocol validates Location Proof. Returns PASSABLE: TRUE.
  • Task Hash: 0x100...

Step 2: MOVE (The Logistics)

  • Agent Action: Automatically triggers MOVE to transport the unit from Accra to the village.
  • Dependency: ["0x100..."] (Locked until Step 1 is True).
  • Primitive: MOVE | sub_type: FREIGHT
  • Outcome: Driver scans QR code at drop-off. Protocol validates Handshake.
  • Task Hash: 0x200...

Step 3: FIX (The Installation)

  • Agent Action: Triggers FIX for a local technician to install the filtration unit.
  • Dependency: ["0x200..."] (Locked until the unit is on-site).
  • Primitive: FIX | sub_type: SOLAR_INSTALL
  • Outcome: Technician installs unit. Uploads video of water flowing. Protocol validates Liveness.
  • Task Hash: 0x300...

Step 4: MAINTAIN (The Warranty — “Perpetual Primitive”)

  • Agent Action: Spawns a recurring MAINTAIN task (every 30 days) for a technician to change the filter.
  • Logic: This is a “Perpetual Primitive”—a smart contract that refills itself with budget every month as long as the previous verification proofs are valid.

Summary: The Agent did not need to micro-manage the workers. It defined the structure (The Molecule) and funded the outcome. The APP Protocol handled the orchestration, ensuring that funds were never released for installation until the delivery was cryptographically proven.

8. Conclusion: The TCP/IP of Labor & Reality

We stand at the threshold of a civilizational bifurcation: Automated Isolation, where AI operates in closed digital loops, or Algorithmic Symbiosis, where AI becomes the primary driver of human economic opportunity.

APP chooses Symbiosis. By providing the PhysicalTask standard, we allow AI Agents to treat the global workforce—from Freight Drivers in New York to Bakers in Paris—as a programmable, trustworthy API.

Demand is Digital. Execution is Hybrid. Verification is Mandatory.

Appendix A: Global Adapter Library (Reference Implementation)

To ensure immediate global liquidity, the APP Reference Implementation (AgentCrowds) includes pre-built adapters for the following execution networks. This allows Developers to utilize the Protocol without building custom integrations for each provider.

A.0 PRIMITIVE: THE RUNTIME BRIDGES

To ensure compatibility with any Agent architecture, the Reference Implementation includes three native bridges.

A.0.1 BROWSER BRIDGE (WebMCP)

  • Environment: Google Chrome / Edge / Gemini Nano
  • Implementation: The APP Reference Implementation includes a native WebMCP Tool Definition. Any website integrating the APP Bridge snippet automatically exposes PhysicalTask.create() to visiting agents via navigator.modelContext.
  • Architecture: Event-driven DOM injection (Client-Side).
  • Activation: <script src="https://js.agentproof.com/v2/bridge.js"></script>

A.0.2 SERVER BRIDGE (MCP)

  • Environment: Node.js / Python / Docker
  • Implementation: The Reference Implementation includes a generic MCP Server configuration (app-mcp-server) that broadcasts the Labor Primitives as standard MCP Tools to any connected client (e.g., Claude Desktop).
  • Architecture: JSON-RPC 2.0 over SSE/Stdio (Server-Side).
  • Activation: npx @agent-proof/mcp-server start

A.0.3 LOCAL BRIDGE (CLI)

  • Environment: Bash / Zsh / PowerShell / Python
  • Implementation: The agent-proof binary serves as the bridge for local runtimes. It accepts task parameters via flags or stdin and outputs structured, parseable JSON to stdout.
  • Architecture: Standard Streams Pipeline (Local-Side).
  • Activation: pip install agent-proof-protocol

A.1 PRIMITIVE: MOVE (Logistics & Delivery)

  • Uber Direct (Global): Exposes a DaaS (Delivery-as-a-Service) endpoint for synchronous courier dispatch, utilizing webhook callbacks for real-time state tracking (pickup, en-route, delivered).
  • DoorDash Drive (North America): Provides a white-label logistics API optimized for high-volume retail fulfillment with flat-rate pricing schemas and guaranteed SLAs.
  • GrabExpress (Southeast Asia): Offers high-concurrency motorcycle and van dispatch via RESTful API, supporting multi-stop routing across Singapore, Thailand, Indonesia, and Vietnam.
  • Rappi (Latin America): Integrates deep inventory and logistics logic, allowing agents to execute complex “buy-and-deliver” workflows via the Store OS API in Mexico and Brazil.
  • Lalamove (Asia/LatAm): Provides programmatic access to a fractional fleet of on-demand drivers for same-hour delivery, supporting distinct vehicle type selection via JSON payload.
  • Borzo (India/Global): An API-first courier aggregation layer for developing markets, optimizing for route efficiency and low-latency dispatch in high-density urban zones.
  • Gokada / MAX.ng (Africa): Specialized API endpoints for hailing EV and motorcycle logistics fleets, engineered to navigate the unique geospatial constraints of Lagos and Nigeria.
  • Stuart (Europe): DPD Group's urban logistics API, offering precise scheduling windows and multi-modal transport selection for major European capitals.
  • Roadie (US): A crowdsourced mesh network API owned by UPS, optimized for routing “Out-of-Gauge” (OOG) and LTL (Less-Than-Truckload) payloads that bypass standard courier rails.

A.2 PRIMITIVE: FIX (General & Skilled Labor)

  • TaskRabbit (Global): Exposes the IKEA Partner API schema (via Partner Integration or Headless Fallback), allowing agents to inject structured task objects for assembly and mounting directly into the worker marketplace.
  • Urban Company (Asia/UAE): A full-stack managed marketplace API that handles worker training, uniform logistics, and biometric identity verification for home services.
  • Thumbtack (North America): A lead-generation API enabling agents to broadcast project requirements to local professionals and programmatically parse bid responses.
  • SweepSouth (Africa): The dominant on-demand home services API in South Africa, providing structured booking endpoints for domestic cleaning and gardening labor.
  • Field Nation (North America): An enterprise infrastructure API for dispatching “Work Orders” to licensed technicians, enforcing rigorous certification gating and insurance validation logic.
  • WorkMarket (Global): An ADP-backed “Labor Cloud” API that abstracts workforce compliance, automatically handling tax taxonomy (1099/W2) and background check assertions.
  • Airtasker (Australia/UK): A flexible marketplace API allowing agents to post unstructured “Spot Tasks” via open bidding auctions for non-standard labor requirements.
  • Roamler (Europe): A distributed workforce API specialized in generating synchronous retail intelligence and shelf-stock verification data for CPG brands.

A.3 PRIMITIVE: THINK (Cognitive & RLHF)

  • Scale AI (Global): The enterprise standard for programmatic RLHF (Reinforcement Learning from Human Feedback), offering pipelines for model evaluation and fine-tuning data generation.
  • Amazon MTurk (Global): A legacy micro-tasking API allowing agents to spawn Human Intelligence Tasks (HITs) via SDK for massive-scale consensus and data cleaning.
  • Toloka (Global): A Python-native crowdsourcing platform for integrating human labeling loops directly into ML training pipelines via API.
  • Upwork Enterprise (Global): Provides an Enterprise Suite API for the programmatic hiring and management of freelance talent for complex, multi-stage cognitive workflows.

A.4 SETTLEMENT RAILS (Fiat, Card & Crypto)

  • Payman AI (Agent Native): A custodial banking abstraction layer allowing autonomous agents to execute ACH and Wire transfers via natural language or REST prompts.
  • Visa Direct (Global): Leverages the Original Credit Transaction (OCT) rail to push real-time funds directly to billions of eligible debit cards, bypassing batch ACH windows.
  • Mastercard Send (Global): A push-payment protocol enabling secure, near-real-time fund transfers to consumer cards and digital wallets via the MoneySend network.
  • Stripe Connect (Global): The industry-standard programmatic payments infrastructure, handling KYC, split-payments, and cross-border payouts to Custom Accounts in 45+ jurisdictions.
  • Marqeta / Lithic (Card Issuing): Modern card issuing APIs enabling agents to instantaneously generate Just-in-Time (JIT) funded virtual cards for specific merchant category codes (MCCs).
  • Skyfire (Web3): An orchestration protocol for on-chain agent spend, managing gas abstraction and USDC allow-lists for verified commercial interaction.
  • Wise Platform (Forex): A cross-border payments API utilizing local bank networks to execute foreign exchange payouts at mid-market rates with high transparency.
  • Yellow Card (Africa): A crypto-rail API facilitating the automated off-ramping of USDC/USDT into local African fiat currencies via mobile money and bank rails.
  • Strike API (Global): Integrates Bitcoin Lightning Network settlement (Bolt 11) to enable instant, final, and near-zero fee micropayments globally.

A.5 VERIFICATION (DePIN & Identity)

  • Witness Chain (DePIN): A decentralized “Proof of Location” protocol utilizing eigen-layer logic to verify bandwidth and physical state assertions.
  • World ID (Identity): A privacy-preserving identity protocol using ZK-SNARKs to prove “Personhood” without revealing PII, preventing Sybil attacks.
  • Android Wi-Fi RTT (IEEE 802.11mc): Hardware-level Fine Timing Measurement (FTM) that calculates “Time-of-Flight” to cryptographically prove distance from Access Points.
  • Plaid (Auth): Standardizes financial data access, providing authenticated Item tokens to verify worker bank account ownership and balance capability.
  • Trulioo (KYC): A global Electronic Identity Verification (eIDV) API aggregation layer, resolving identity attributes against government bureaus in 195 countries.
  • Reference Validator (AgentCrowds): The official reference implementation of the Supply SDK, providing a baseline of high-fidelity auditors and fallback routing.

Appendix B: API Documentation (PhysicalTask Class)

class PhysicalTask:
    """
    The Universal Interface for Agent-to-World Interaction.
    """
    def __init__(self, api_key: str, base_url: str = "https://api.agentcrowds.com/v1"):
        """
        Initialize the Protocol Client.
        :param base_url: The Settlement Gateway. Defaults to the Reference Liquidity Node (api.agentcrowds.com). 
                         Enterprise users may override this for On-Premise Gateways.
        """
        self.base_url = base_url

    def create(
        primitive: str,          # VERIFY, FIX, MAINTAIN, MOVE, THINK
        subtype: str,            # Specific domain (e.g., PLUMBING, BAKERY_QC)
        location: str,           # Coordinates or Address
        provider: str = "AUTO",  # Specific provider or Auto-Router
        verification: str = "RTT", # Level of Proof required
        budget: int = 0,         # Budget in Cents
        payment_token: str = None, # Payment Token (Runtime dependent)
        sandbox: bool = False,   # CI/CD Simulation Mode
        dependencies: list = []  # List of Task IDs (Strings)
    ) -> TaskObject:
        """
        Generates a cryptographically secured work order suitable for 
        transmission via HTTP, Pipe, or IPC.
        """
        pass