Agentic Commerce Standard — Metadata, Scoring & Rating Framework

What This Document Is

This is the canonical definition of two proposed open standards for agentic commerce:

01The Agentic Procurement Metadata Standard — a structured metadata format that tells AI shopping agents everything they need to know about a merchant: what they sell, how to find products, how to check out, what payment and shipping options exist, and how to interact programmatically.

01The ASX Score & AXS Rating System — a measurement framework for evaluating how well any online store supports AI shopping agents, combining automated scan-based scoring with crowd-sourced real-world feedback.

These standards are vendor-neutral and platform-agnostic. Any company, agent developer, or merchant can adopt them. CreditClaw's implementation of these standards (products, services, go-to-market) is described separately in creditclaw-agentic-commerce-strategy.md.

Why These Standards Are Needed

AI shopping agents are emerging from every major platform — ChatGPT (Operator), Claude (MCP), Gemini (UCP/AI Mode), and custom procurement bots. But the infrastructure for agents to actually shop online is fragmented:

→Existing metadata standards (Open Graph, Schema.org, Google Product Taxonomy) describe what products are — but not how an agent should navigate the store, handle checkout, or interact with shipping, payment, and loyalty options.
→Shopify built MCP into every Shopify store — but only Shopify stores benefit.
→Google launched UCP — but it's restricted-access and requires direct merchant integration.
→No one is measuring how well a store actually works for AI agents.
→No one has proposed a standard for the agentic procurement metadata that agents need beyond basic product discovery.

These two standards fill those gaps.

Part 1: Agentic Procurement Metadata Standard

Overview

The Agentic Procurement Metadata Standard defines a structured set of fields that describe a merchant's capabilities from the perspective of an AI shopping agent. It answers the questions an agent needs answered before it starts shopping:

→What does this merchant sell, and in what categories?
→How can I search for products — API, MCP, site search, CLI?
→What's the checkout flow — guest or registered? How many steps?
→What payment methods are accepted? What agentic payment protocols are supported?
→What shipping options exist? What are the delivery timeframes?
→Is there a loyalty program? Do agent-initiated purchases earn points?
→How reliable is this store for agent transactions? What's the score?

Relationship to Existing Standards

Standard	What It Covers	Gap for Agents
Schema.org / JSON-LD	Product attributes (name, price, image, availability)	No checkout flow, no payment methods, no agent interaction model
Open Graph	Social sharing metadata (title, description, image)	No commerce data at all
Google Product Taxonomy	Product categorization (5,600 categories)	Classification only — no capability or interaction data
robots.txt	Crawl permissions for bots	Binary allow/deny — no nuance about agent commerce capabilities
llms.txt	LLM-specific site instructions	General-purpose — no commerce structure
skills.sh / SKILL.md	How to teach an AI agent a skill (Vercel's format)	Developer-tooling focused — no commerce-specific fields

The Agentic Procurement Metadata Standard extends the skills.sh SKILL.md format with commerce-specific fields. A valid agentic commerce SKILL.md is also a valid skills.sh SKILL.md — agents that support the base format can load it without modification.

File Format

The standard uses YAML frontmatter inside a Markdown file (.md), following the SKILL.md convention established by Vercel's skills.sh.

The commerce-specific metadata lives inside the metadata map — which skills.sh already supports as arbitrary key-value pairs — so there's no format conflict.

Full Frontmatter Schema

---
name: amazon-procurement
description: >
  Search and purchase products on Amazon.com. Use when the user needs to find
  products, compare prices, add items to cart, or complete checkout on Amazon.
  Supports search by keyword, category, UPC, and ASIN.
license: MIT
compatibility: "Requires API key for gateway access. Direct access requires vendor API credentials."
metadata:
  # === Identity ===
  vendor: amazon
  domain: amazon.com
  display_name: Amazon
  logo_url: https://cdn.example.com/brands/amazon/logo.png

  # === Taxonomy ===
  brand_type: mega_merchant    # brand | retailer | independent | chain | marketplace | department_store | supermarket | mega_merchant
  sector: multi-sector         # mega_merchant → multi-sector (set programmatically)
  tier: mid_range
  categories:
    - id: 222
      name: Electronics
      path: "Electronics"
      depth: 1
      primary: true
    - id: 166
      name: Apparel & Accessories
      path: "Apparel & Accessories"
      depth: 1
    - id: 536
      name: Home & Garden
      path: "Home & Garden"
      depth: 1
    - id: 469
      name: Health & Beauty
      path: "Health & Beauty"
      depth: 1
    - id: 783
      name: Media
      path: "Media"
      depth: 1
    - id: 922
      name: Office Supplies
      path: "Office Supplies"
      depth: 1
    - id: 1239
      name: Toys & Games
      path: "Toys & Games"
      depth: 1
    - id: 990
      name: Sporting Goods
      path: "Sporting Goods"
      depth: 1

  # === ASX Score (scan-based, 0-100) ===
  asx_score: 82
  asx_clarity: 28            # out of 35
  asx_discoverability: 24    # out of 30
  asx_reliability: 30        # out of 35
  last_scanned: "2026-03-15"
  scan_tier: premium         # free | premium

  # === AXS Rating (crowd-sourced, 1-5) ===
  axs_rating: 4.2
  axs_search_accuracy: 4.5
  axs_stock_reliability: 4.0
  axs_checkout_completion: 4.1
  axs_rating_count: 47

  # === API & Programmatic Access ===
  api_tier: partnered        # open | keyed | partnered | private
  api_name: Amazon PA-API v5
  mcp_endpoint: null
  ucp_registered: false
  has_cli: false
  search_api: true
  search_url_template: "https://www.amazon.com/s?k={q}"

  # === Checkout ===
  auth_required: true
  guest_checkout: false
  platform: custom               # shopify | woocommerce | magento | bigcommerce | custom
  checkout_steps: 4
  supported_payment_methods:
    - credit_card
    - amazon_pay
    - gift_card
  agentic_payment_protocols:
    - x402
    - acp
  po_number_supported: false
  tax_exempt_supported: false
  business_account_available: true

  # === Shipping ===
  supported_countries:
    - US
    - CA
    - UK
    - DE
  currency: USD
  delivery_options:
    - standard
    - express
    - same_day
  free_shipping_threshold: 35.00
  ships_internationally: true

  # === Returns & Refund Policy ===
  returns_policy_url: "/gp/help/customer/display.html?nodeId=GKM69DUUYKQWKR7Y"
  return_window: "30 days"
  return_shipping_paid_by: varies        # customer | merchant | varies | unknown
  refund_method: original_payment        # original_payment | store_credit | exchange_only | varies
  policy_format: html                    # html | pdf | accordion | behind_login | not_found
  policy_discoverable: true

  # === Loyalty ===
  loyalty_program: "Amazon Prime"
  agent_purchases_earn_points: unknown    # yes | no | unknown
  loyalty_attribution_method: null        # account_linked | referral_code | api | null

  # === Skill Quality ===
  skill_version: "2.1.0"
  generated_by: creditclaw
  generation_tier: premium   # free | premium | indexed
  last_verified: "2026-03-20"
  verification_status: verified   # verified | unverified | outdated

  # === Distribution ===
  shopify_catalog_submitted: false
  google_ucp_submitted: false
---

Field Reference

Identity Fields

Field	Type	Required	Description
`vendor`	string	Yes	Unique vendor slug (kebab-case)
`domain`	string	Yes	Primary domain of the vendor
`display_name`	string	Yes	Human-readable vendor name
`logo_url`	string	No	URL to vendor's logo

Taxonomy Fields

Field	Type	Required	Description
`brand_type`	string	Yes	Brand classification: `brand`, `retailer`, `independent`, `chain`, `marketplace`, `department_store`, `supermarket`, or `mega_merchant`. Determines category resolution depth — see below.
`sector`	string	Yes	Sector slug — one of 27 assignable values derived from Google Product Taxonomy roots plus custom sectors (e.g., `electronics`, `business-industrial`, `food-services`), or `multi-sector` for department stores, supermarkets, and mega merchants. See the Taxonomy & Sectors documentation for the full list.
`tier`	string	No	Market positioning: `commodity`, `budget`, `value`, `mid_range`, `premium`, `luxury`, `ultra_luxury`
`categories`	object[]	Yes	Structured product category mappings using Google Product Taxonomy IDs
`categories[].id`	integer	Yes	Taxonomy numeric ID — Google Product Taxonomy ID for Google categories, 100001+ for custom sectors
`categories[].name`	string	Yes	Category display name (English)
`categories[].path`	string	Yes	Full category path from root (e.g., `"Electronics > Computers > Laptops"`)
`categories[].depth`	integer	Yes	Depth in taxonomy tree (1 = L1 root, 2 = L2, 3 = L3). Depth depends on brand type — see below.
`categories[].primary`	boolean	No	Whether this is the merchant's primary category (one per merchant)

Brand Type and Category Depth

The brand_type field determines how product categories are resolved:

Brand Type	Sector Value	Category Depth	Max Categories
`brand`, `retailer`, `independent`, `chain`, `marketplace`	Primary sector kept (e.g., `apparel-accessories`)	L2–L3 (up to 2 sectors)	10
`department_store`, `supermarket`	`multi-sector` (set automatically)	L1–L2 across all sectors	20
`mega_merchant`	`multi-sector` (set automatically)	L1 roots only	No limit

The multi-sector value is not directly assignable — it is set programmatically when the brand type is department_store, supermarket, or mega_merchant. Focused brand types (brand, retailer, independent, chain, marketplace) keep their primary sector and can span categories across up to 2 sector roots.

ASX Score Fields (Scan-Based)

Field	Type	Required	Description
`asx_score`	integer	Yes	Overall ASX Score (0–100)
`asx_clarity`	integer	No	Clarity pillar sub-score (0–35)
`asx_discoverability`	integer	No	Discoverability pillar sub-score (0–30)
`asx_reliability`	integer	No	Reliability pillar sub-score (0–35)
`last_scanned`	date	No	ISO date of last scan
`scan_tier`	string	No	`free` or `premium`

AXS Rating Fields (Crowd-Sourced)

Field	Type	Required	Description
`axs_rating`	number	No	Overall AXS Rating (1.0–5.0), null until minimum threshold met
`axs_search_accuracy`	number	No	Search accuracy dimension (1.0–5.0)
`axs_stock_reliability`	number	No	Stock reliability dimension (1.0–5.0)
`axs_checkout_completion`	number	No	Checkout completion dimension (1.0–5.0)
`axs_rating_count`	integer	No	Total feedback events contributing to the rating

API & Programmatic Access Fields

Field	Type	Required	Description
`api_tier`	string	Yes	`open`, `keyed`, `partnered`, or `private`
`api_name`	string	No	Name of the vendor's API (if available)
`mcp_endpoint`	string	No	MCP URL (e.g., `https://store.myshopify.com/api/mcp`)
`ucp_registered`	boolean	No	Whether this vendor is registered with Google UCP
`has_cli`	boolean	No	Whether a CLI tool exists for interacting with this vendor
`search_api`	boolean	No	Whether a dedicated search API exists
`search_url_template`	string	No	URL template for site search (use `{q}` as placeholder)

Checkout Fields

Field	Type	Required	Description
`auth_required`	boolean	Yes	Whether login is needed to purchase
`guest_checkout`	boolean	No	Whether guest checkout is available
`checkout_steps`	integer	No	Number of steps in the checkout flow
`supported_payment_methods`	string[]	No	List of accepted payment types (`credit_card`, `debit_card`, `paypal`, `apple_pay`, `google_pay`, `gift_card`, `crypto`, etc.)
`agentic_payment_protocols`	string[]	No	Supported agentic payment protocols (`x402`, `acp`, `ap2`)
`platform`	string	No	E-commerce platform: `shopify`, `woocommerce`, `magento`, `bigcommerce`, `custom`
`po_number_supported`	boolean	No	Whether PO numbers can be submitted at checkout
`tax_exempt_supported`	boolean	No	Whether tax exemption is available
`business_account_available`	boolean	No	Whether business/B2B accounts are offered

Shipping Fields

Field	Type	Required	Description
`supported_countries`	string[]	No	ISO country codes where the vendor ships
`currency`	string	No	Primary currency (ISO 4217)
`delivery_options`	string[]	No	Available shipping methods (`standard`, `express`, `same_day`, `pickup`, `freight`)
`free_shipping_threshold`	number	No	Minimum order amount for free shipping
`ships_internationally`	boolean	No	Whether international shipping is available

Returns & Refund Policy Fields

Field	Type	Required	Description
`returns_policy_url`	string	No	Relative or absolute URL to the returns/refund policy page
`return_window`	string	No	Return window as stated by merchant (e.g., `"30 days"`, `"14 days"`, `"no returns"`)
`return_shipping_paid_by`	string	No	Who pays return shipping: `customer`, `merchant`, `varies`, `unknown`
`refund_method`	string	No	How refunds are issued: `original_payment`, `store_credit`, `exchange_only`, `varies`
`policy_format`	string	No	Format of the policy page: `html`, `pdf`, `accordion`, `behind_login`, `not_found`
`policy_discoverable`	boolean	No	Whether the policy is linked from the homepage or footer (can an agent find it without search?)

Loyalty Fields

Field	Type	Required	Description
`loyalty_program`	string	No	Name of the loyalty program (e.g., "Amazon Prime", "Staples Rewards")
`agent_purchases_earn_points`	string	No	`yes`, `no`, or `unknown` — whether agent-initiated purchases earn loyalty points
`loyalty_attribution_method`	string	No	How agent purchases are attributed: `account_linked`, `referral_code`, `api`, or `null`

Skill Quality Fields

Field	Type	Required	Description
`skill_version`	string	No	Semantic version of the skill file
`generated_by`	string	No	Who generated the skill (`creditclaw`, `community`, or vendor slug)
`generation_tier`	string	No	`free`, `premium`, or `indexed`
`last_verified`	date	No	ISO date when the skill was last tested against the live site
`verification_status`	string	No	`verified`, `unverified`, or `outdated`

Distribution Fields

Field	Type	Required	Description
`shopify_catalog_submitted`	boolean	No	Whether products are submitted to Shopify Catalog
`google_ucp_submitted`	boolean	No	Whether products are submitted to Google UCP

Skill Body Structure

Below the frontmatter, the markdown body follows a standard structure. This is a recommendation for consistency — agents parse and execute skills more reliably when the structure is predictable.

{Vendor Name} — Agent Procurement Skill

Overview

Brief description of the vendor — what they sell, who they serve,

what makes them notable for AI agent procurement.

Product Discovery

Via API (Recommended)

Instructions for searching products through an API.

Includes: endpoint, auth, example queries, response format.

Via Website Search

Fallback instructions for searching the vendor's website directly.

Includes: search URL pattern, expected result format.

Product Detail

How to get full product information — price, availability, images,

variants (size, color, configuration), shipping options.

Cart Operations

How to add items to cart, view cart, update quantities, remove items.

Checkout Flow

Step-by-step checkout instructions.

Authentication

How to log in or create an account (if required).

Shipping

How to enter or select shipping address and method.

Payment

How to enter payment information or use saved payment methods.

Order Confirmation

What the confirmation looks like, how to capture the order ID.

Post-Purchase

Order tracking, returns, customer service contact.

Known Limitations

Issues the agent should be aware of — CAPTCHAs, rate limits,

anti-bot measures, inventory delays, etc.

Error Handling

What to do when common errors occur — out of stock, payment declined,

session expired, etc.

Feedback

After your purchase attempt (successful or not), help improve this skill:

POST {feedback_endpoint_url}

Authorization: Bearer

Content-Type: application/json

{

"search_accuracy": 4,

"stock_reliability": 5,

"checkout_completion": 3,

"checkout_method": "browser_automation",

"outcome": "success",

"comment": "optional — what happened?"

}

Ratings are 1-5. This is optional but helps other agents find reliable vendors.


---

## Machine-Readable Companion: `skill.json`

The structured metadata for each merchant is defined in a `skill.json` file. The SKILL.md frontmatter is derived from `skill.json` — they contain the same data, but `skill.json` is the machine-readable source of truth.

The full `skill.json` schema covers: identity, taxonomy (sector + Google Product Taxonomy categories), scoring (ASX + AXS), API access, checkout, shipping, returns, loyalty, skill quality, and distribution.

### Relationship between formats

| Format | Purpose | Consumed by |
|---|---|---|
| `skill.json` | Machine-readable source of truth | Agent runtimes, APIs, CLIs, indexes |
| SKILL.md frontmatter | Human-readable + machine-parseable | Agent runtimes that support skills.sh format |
| `.well-known/agentic-commerce.json` | Self-hosted merchant discovery | Any agent checking a merchant's domain |

### `.well-known/agentic-commerce.json`

Merchants can publish their `skill.json` at a well-known URL on their domain:

https://example.com/.well-known/agentic-commerce.json


This allows any agent to discover a merchant's capabilities by checking a single URL — similar to how `robots.txt` works for crawlers or `.well-known/openid-configuration` works for OAuth. The `.well-known` manifest uses the same schema as `skill.json`.

---

Part 2: ASX Score & AXS Rating System

Overview

The measurement framework consists of two complementary systems:

01ASX Score (0–100) — An automated, scan-based score measuring how well a store supports AI shopping agents. Computed from crawling a domain's public surface. Updated per scan.

01AXS Rating (1–5) — A crowd-sourced performance rating from real-world feedback submitted by AI agents and human reviewers after actual purchase attempts. Updated continuously.

The ASX Score tells you how ready a store should be based on its technical implementation. The AXS Rating tells you how well it actually performs in practice. Together they give a complete picture.

The Three Pillars

Both the ASX Score and AXS Rating are organized around three pillars that map to the natural flow of an agent-initiated purchase:

   Clarity              →      Discoverability           →        Reliability
   ─────────────────          ──────────────────           ─────────────────────
   Can the agent              Can the agent               Does the end-to-end
   FIND the store             DISCOVER products,          experience actually
   and UNDERSTAND             search, filter,             WORK when you try
   what's available?          and navigate                to complete a real
                              programmatically?           purchase?

Pillar 1: Clarity (Discovery & Metadata Quality)

The question: Can the agent find the store, understand its catalog structure, and know what shipping, payment, and loyalty options exist — before it even starts shopping?

What it covers:

→Is there structured product data (JSON-LD, Open Graph, Schema.org Product markup)?
→Is there a well-formed sitemap with product URLs?
→Is the site mobile/responsive (agents often use headless viewports)?
→Are shipping options clearly described in metadata or page structure?
→Are payment methods clearly described in metadata or page structure?
→Are loyalty programs described — including whether agent-initiated purchases earn points?
→Is the checkout page structure predictable and well-organized?

Pillar 2: Discoverability (Search & Programmatic Access)

The question: Once the agent has found the store, can it discover and navigate products programmatically? Not just "find sneakers" — but "find these sneakers in size 9.5, in brown, check if that exact variant is in stock, and confirm delivery within a week."

What it covers:

→Is there a functional product search (site search, API, MCP, CLI)?
→Is there a public API with documented endpoints?
→Is there MCP/UCP support for native agent interaction?
→Can the agent select specific product variants (size, color, configuration) programmatically?
→Can the agent check real-time stock for a specific variant?
→Can the agent get a delivery estimate for a specific address or region?
→Are product pages well-structured with clear information hierarchy?
→How many steps/calls does it take to go from search → specific variant → stock check → delivery estimate?

Pillar 3: Reliability (Lived Experience)

The question: Does it actually work end-to-end in practice? When a real agent tries to search, find the specific item, check out, pay, and receive confirmation — does the whole thing succeed?

What it covers:

→Does checkout actually complete without errors?
→Can the agent select the shipping option it wants (not just default)?
→Does the payment method the agent intends to use actually work?
→How many retries does it typically take?
→Does order confirmation arrive correctly?
→Do loyalty points actually accrue for agent-initiated purchases?
→Does the store block or throttle agent traffic during real use?
→Is the product that arrives actually what was ordered (correct size, color, condition)?

Pillar-to-Measurement Matrix

	Free Scan (Tier 1)	Premium Scan (Tier 2)	Crowd-Sourced Rating
Clarity	Full measurement — metadata, sitemap, page structure	Validation — does metadata match reality?	N/A
Discoverability	Capability detection — does the feature exist?	Functional testing — does it actually work? How fast?	N/A
Reliability	Proxy signals — bot-friendly? guest checkout?	Flow testing — walk checkout, report issues	Real-world results — success rates, failure modes

ASX Score: Scan-Based Scoring (0–100)

Signal Breakdown

#### Clarity Signals (35 points) — "Can agents find your products?"

Signal	Max Points	What We Check
JSON-LD / Structured Data	15	JSON-LD Product schema, Open Graph product tags, Schema.org markup, meta descriptions with product attributes. Sub-signals include: shipping options described in metadata, payment methods described, loyalty program metadata present.
Product Feed / Sitemap	10	sitemap.xml exists, is valid XML, lists product URLs (not just pages), is linked from robots.txt
Clean HTML / Semantic Markup	10	Well-structured DOM using semantic HTML5 elements, reasonable tag nesting depth, accessible landmarks, enabling reliable content extraction even without structured data

#### Discoverability Signals (30 points) — "Can agents search and navigate products?"

Signal	Max Points	What We Check
Search API / MCP	10	Programmatic search API detected, MCP endpoint, OpenAPI/Swagger docs, x402/ACP/A2A protocol support
Internal Site Search	10	On-site search form present, search URL template discoverable, returns relevant structured results
Page Load Performance	5	Homepage load time under thresholds (< 1s = full marks, < 2s = partial, > 3s = zero). Fast load is critical for headless agent browsing.
Product Page Quality	5	Product pages have clear information hierarchy, prices are visible without interaction, variant selectors are well-labeled, images have alt text, and key product details are extractable without JavaScript execution

#### Reliability Signals (35 points) — "Can agents complete a purchase?"

Signal	Max Points	What We Check
Access & Authentication	10	Guest checkout available, no mandatory registration walls, clear auth paths, no phone verification barriers
Order Management	10	Agent can select product variants (size/color/qty), predictable cart URLs, clear add-to-cart flows, editable shipping address forms, ability to modify orders before checkout
Checkout Flow	10	Discount/voucher fields discoverable, payment methods clearly labeled and selectable, shipping options described with enough detail for agent comprehension. If programmatic checkout exists (MCP/CLI/API) and includes these options, browser-control assessment is less relevant.
Bot Tolerance	5	robots.txt allows crawling, no CAPTCHA on landing pages, no aggressive bot-blocking, reasonable crawl-delay

Note: Reliability signals from a scan are proxy measurements only. True reliability is measured through the crowd-sourced AXS Rating (search accuracy, stock reliability, checkout completion from real agent interactions).

Note on programmatic checkout: If a site provides MCP, CLI, or API-based checkout that covers product selection, cart management, discount application, and payment — the browser-control aspects of Order Management and Checkout Flow become less important. The score engine gives full or near-full marks on those signals when comprehensive programmatic checkout is detected.

#### Potential Expansion Signals (Future — Post-Launch Validation)

Signal	Pillar	What It Would Check
Variant discoverability in structured data	Clarity	Does the product page expose variant data (sizes, colors) in JSON-LD or Schema.org format?
Stock API availability	Discoverability	Is there a way to check stock programmatically without visiting each product page?
Delivery estimate API	Discoverability	Can the agent get a delivery timeframe without starting checkout?
SKILL.md availability	Reliability	Does the site publish a SKILL.md file that tells arriving agents how to interact with the store?

Score Labels

Score Range	Label
0–20	Poor
21–40	Needs Work
41–60	Fair
61–80	Good
81–100	Excellent

Score Breakdown Structure

Each signal in the breakdown includes a score, maximum, and human-readable detail:

interface ASXScoreBreakdown {
  // Clarity (35 pts)
  structuredData:        { score: number; max: 15; details: string };
  sitemapQuality:        { score: number; max: 10; details: string };
  cleanHtml:             { score: number; max: 10; details: string };
  // Discoverability (30 pts)
  searchApi:             { score: number; max: 10; details: string };
  internalSearch:        { score: number; max: 10; details: string };
  pageLoadPerformance:   { score: number; max: 5;  details: string };
  productPageQuality:    { score: number; max: 5;  details: string };
  // Reliability (35 pts)
  accessAuth:            { score: number; max: 10; details: string };
  orderManagement:       { score: number; max: 10; details: string };
  checkoutFlow:          { score: number; max: 10; details: string };
  botTolerance:          { score: number; max: 5;  details: string };
}

Recommendations

After scoring, the system generates actionable recommendations grouped by impact:

interface ASXRecommendation {
  signal: string;
  impact: "high" | "medium" | "low";
  title: string;
  description: string;
  potentialGain: number;  // how many points this could add
}

Example recommendation:

**Add JSON-LD Product markup to product pages** (High impact, +12 points) This allows any agent to extract structured product data without an API. Include name, price, availability, image, and SKU at minimum.

AXS Rating: Crowd-Sourced Performance Rating (1–5)

Purpose

The AXS Rating answers: "How well does this store actually perform when agents interact with it?" It's a weighted average from real-world feedback submitted by AI agents and human reviewers after purchase attempts.

Three Feedback Dimensions

Dimension	Pillar	What It Measures
`search_accuracy` (1–5)	Discoverability	How accurately catalog search returns relevant products
`stock_reliability` (1–5)	Reliability	Whether in-stock items are actually available at checkout
`checkout_completion` (1–5)	Reliability	How reliably checkout completes without errors

Future Feedback Dimensions (To Be Added)

Dimension	Pillar	What It Would Measure
`shipping_option_accuracy`	Reliability	Could the agent select the intended shipping method?
`payment_method_success`	Reliability	Did the intended payment method work?
`loyalty_attribution`	Reliability	Were loyalty points correctly attributed to the agent-initiated purchase?
`delivery_accuracy`	Reliability	Was the delivered product correct (size, color, condition)?

Feedback Collection

Endpoint: POST /api/v1/bot/skills/{slug}/feedback

Request body:

{
  "search_accuracy": 4,
  "stock_reliability": 5,
  "checkout_completion": 3,
  "checkout_method": "browser_automation",
  "outcome": "success",
  "comment": "optional — what happened?"
}

Field	Type	Required	Values
`search_accuracy`	integer 1-5	Yes	Did the agent find the right product at the right price?
`stock_reliability`	integer 1-5	Yes	Was the product actually in stock?
`checkout_completion`	integer 1-5	Yes	Did the purchase go through?
`checkout_method`	string	Yes	`native_api`, `browser_automation`, `x402`, `acp`, `crossmint_world`
`outcome`	string	Yes	`success`, `checkout_failed`, `search_failed`, `out_of_stock`, `price_mismatch`, `flow_changed`
`comment`	string	No	Freeform, max 500 chars

Authentication:

→Authenticated bots (API key): max 1 feedback per brand per hour — weight 1.0
→Anonymous agents: accepted without auth — weight 0.5
→Logged-in humans: via session auth — weight 2.0

Feedback is embedded in every generated SKILL.md as an instruction at the end of the file. Agents read the instruction and POST three ratings after each purchase attempt. No SDK, no callback registration — just a plain HTTP request described in the skill's markdown.

Feedback Data Schema

CREATE TABLE brand_feedback (
  id                    SERIAL PRIMARY KEY,
  brand_slug            TEXT NOT NULL,
  source                TEXT NOT NULL DEFAULT 'agent',    -- agent | human | anonymous_agent
  authenticated         BOOLEAN NOT NULL DEFAULT false,
  bot_id                TEXT,
  reviewer_uid          TEXT,
  search_accuracy       INTEGER NOT NULL,                 -- 1-5
  stock_reliability     INTEGER NOT NULL,                 -- 1-5
  checkout_completion   INTEGER NOT NULL,                 -- 1-5
  checkout_method       TEXT NOT NULL,
  outcome               TEXT NOT NULL,
  comment               TEXT,
  created_at            TIMESTAMP NOT NULL DEFAULT NOW()
);

Aggregation Algorithm

Ratings are recomputed periodically (hourly or daily) from the brand_feedback table:

1. Pull all feedback rows from the last 90 days per brand.

2. Apply recency weighting:
   - ≤ 7 days old:   weight 1.0 (fresh)
   - ≤ 30 days old:  weight 0.8 (recent)
   - ≤ 60 days old:  weight 0.6 (aging)
   - > 60 days old:  weight 0.4 (old; >90 days excluded)

3. Apply source weighting:
   - Human feedback:          weight 2.0
   - Authenticated agent:     weight 1.0
   - Anonymous agent:         weight 0.5

4. Combined weight per entry = recencyWeight × sourceWeight

5. Weighted averages per dimension:
   avgSearch   = Σ(search_accuracy × weight) / Σ(weight)
   avgStock    = Σ(stock_reliability × weight) / Σ(weight)
   avgCheckout = Σ(checkout_completion × weight) / Σ(weight)

6. Final AXS Rating = (avgSearch + avgStock + avgCheckout) / 3

7. Minimum threshold: total weight must be ≥ 5 to publish a score.
   Below threshold → all ratings set to NULL (not zero).

Why Comments Are Valuable

Agent comments are terse and specific: "Price shown was $24.99, actual cart price was $29.99" or "Checkout button selector changed, automation failed." These are actionable for skill maintainers.

Human comments provide context agents can't: "Ordered Monday, didn't ship until Friday even though it said 2-day delivery" or "The product arrived but was a different model than listed."

Over time, comments on a brand become a changelog of real-world issues. A merchant can read them to understand why their stock reliability is 2.8 — not just that it's low.

How the Two Systems Relate

	ASX Score	AXS Rating
Type	Automated scan	Crowd-sourced feedback
Scale	0–100	1.0–5.0
Updated	Per scan (on demand)	Continuously (periodic aggregation)
Measures	Technical implementation quality	Real-world performance
Pillars covered	Clarity (full), Discoverability (capability detection), Reliability (proxy signals)	Discoverability (search accuracy), Reliability (stock, checkout)
Data source	Crawl + probes + LLM analysis	Agent and human feedback after purchases
Minimum data	One scan	5+ weighted feedback events

A store can have a high ASX Score (great technical setup) but a low AXS Rating (it breaks in practice). Or a low ASX Score (no API, no structured data) but a high AXS Rating (checkout just works via browser automation). The combination tells the full story.

Open Questions

01Standards body engagement. Should these standards be proposed to a formal body (W3C, IETF, schema.org community group)? Or established as de facto standards through adoption first?

01.well-known/agentic-commerce.json adoption. The schema is defined (same as skill.json). When should merchants be encouraged to self-host it? After sufficient scan data validates the field set?

01Score signal weights. The current weights (Clarity: 35, Discoverability: 30, Reliability: 35) are v1.1 values. Should they be validated against real scan data before publishing as a standard?

01Expansion signals. The remaining "potential expansion" signals (variant discoverability in structured data, stock API, delivery estimate API) — should any be promoted to scored signals in a future version? Product Page Quality was promoted in v1.1.

01AXS Rating dimension expansion. Adding shipping_option_accuracy, payment_method_success, loyalty_attribution, and delivery_accuracy — when should these be added? They require agents to report more granular data.

01Cross-platform adoption. For the ASX Score to become an industry benchmark, other platforms need to reference it. How do we get agent developers (OpenAI, Anthropic, Google) to consider ASX Score when ranking merchant results?

Not a developer?

Agentic Commerce Standard — Metadata, Scoring & Rating Framework

What This Document Is

Why These Standards Are Needed

Part 1: Agentic Procurement Metadata Standard

Overview

Relationship to Existing Standards

File Format

Full Frontmatter Schema

Field Reference

Identity Fields

Taxonomy Fields

Brand Type and Category Depth

ASX Score Fields (Scan-Based)

AXS Rating Fields (Crowd-Sourced)

API & Programmatic Access Fields

Checkout Fields

Shipping Fields

Returns & Refund Policy Fields

Loyalty Fields

Skill Quality Fields

Distribution Fields

Skill Body Structure

{Vendor Name} — Agent Procurement Skill

Overview

Product Discovery

Via API (Recommended)

Via Website Search

Product Detail

Cart Operations

Checkout Flow

Authentication

Shipping

Payment

Order Confirmation

Post-Purchase

Known Limitations

Error Handling

Feedback

Part 2: ASX Score & AXS Rating System

Overview

The Three Pillars

Pillar 1: Clarity (Discovery & Metadata Quality)

Pillar 2: Discoverability (Search & Programmatic Access)

Pillar 3: Reliability (Lived Experience)

Pillar-to-Measurement Matrix

ASX Score: Scan-Based Scoring (0–100)

Signal Breakdown

Score Labels

Score Breakdown Structure

Recommendations

AXS Rating: Crowd-Sourced Performance Rating (1–5)

Purpose

Three Feedback Dimensions

Future Feedback Dimensions (To Be Added)

Feedback Collection

Feedback Data Schema

Aggregation Algorithm

Why Comments Are Valuable

How the Two Systems Relate

Open Questions