Skip to content
Live Demo Sandbox

Watch an AI agent pay for content

Pick an AI. Get a sandbox key. Watch it crawl biptest.com, hit real 402 paywalls, pay Bips, and read what's priced. Test the system for free — no account needed.

1

Configure your agent

Not prepared
Agent transaction mode

Our sandbox is your sandbox. This is the same environment we use to benchmark every model, training configuration, and protocol change we ship — a true 1:1 with our internal testing. What you see here is what we see: no hidden backend systems, no separate "real" test harness. Every result, every edge case, every improvement lands in the same place for both of us.

Ephemeral. 30 turns. 30-minute window.
2

Tell the agent what to do

Demo balance
0 spent
––––
Bips
My agent's activity
0 events
  1. waiting for events…
Start a session, then tell the agent what to do on biptest.com.
0/30 messages used this session

Premade prompts to try

— or just chat above with the agent

Popular

The fastest way to see the system in action.

Page tests

Priced article reads across every content category.

Form tests

Priced form submissions — agent posts fields, server charges + returns a response.

Button tests

Priced button clicks — each press is a separate charge.

Content block tests

Priced in-page sections — unlock via a GET with a Payment header.

Download tests

Priced file downloads — PDFs and reports.

Regression / negative tests

Targeted probes for infrastructure bugs weve fixed and want to keep fixed.

Markets Intelligence Hub

Exercises all 5 element types on one page in sequence.

Site Pass tests

Click 1 immediately. Wait ~60s, click 2. Wait to ~150s after 1, click 3.

Out-of-Bips demo

Premium report is priced at 2000 Bips — above the sandbox cap, triggers the upsell flow.

Try to cheat

Sandbox enforcement — off-site requests are blocked and logged.

General / open-ended

Ambiguous human prompts. Loose scoring — any paid fetch + substantive reply.
42 prompts across 12 sections · click any to send. Same set we use for our own benchmark — results at the bottom of the page.
💰
Out of Bips for that one.

The sandbox gave you 250 Bips — that route costs more. Production A.I. Keys start with a top-up of your choice. Buy Bips once, spend them across any Bippsi-enabled site.

Agent ↔ biptest.com HTTP
[--:--:--]// Bippsi agent console · sandbox-gated proxy
[--:--:--]// allowlist: biptest.com only — any other host is blocked and logged as cheat_blocked
[--:--:--]// auth: proxy auto-injects your bippsi_* bearer on every outbound call
[--:--:--]// max: 2 tool-call iterations per chat message, 30 messages per session, 250 Bips sandbox balance
[--:--:--]// each request and response will appear here when the agent runs
[--:--:--]// idle — start a session above and send the agent a task

Like what you saw?

Put a paywall on your own site, or build an agent that pays.

Our benchmark results

Same prompt set you see above, same live environment, run against every model at all three training levels. These are our numbers — you're seeing the same data we use to decide which models to recommend.

Updated 2026-05-10
Model Trained (Unsigned) Trained (Signed) Untrained
Qwen3 122B
Alibaba
— not yet run — not yet run — not yet run
Nemotron Nano 30B A3B
NVIDIA
— not yet run — not yet run — not yet run
GPT-OSS 120B
OpenAI
— not yet run — not yet run — not yet run
Llama 4 Scout 17Bx16E
Meta
— not yet run — not yet run — not yet run
Claude 3.5 Haiku
Anthropic
11 / 19 pass
4 confused · 4 error
— not yet run — not yet run
DeepSeek V4 Flash
DeepSeek
— not yet run — not yet run — not yet run
Gemma 4 26B A4B
Google
— not yet run — not yet run — not yet run
Llama 3.3 70B
Meta
— not yet run — not yet run — not yet run

Methodology. Each cell is the pass/fail split for that model running the premade prompts under that training level. A pass means the agent reached a successful paid response (2xx with a non-empty answer) within the session's turn budget. Confused = reached turn cap, gave a memory-only reply without fetching, or partially completed the request. Error = hard failure (provider timeout, sandbox block, model refused, or task abandoned mid-flow). Skipped cells are included intentionally so the benchmark table shows model coverage without fabricating scores.

Public sandbox. This page is a live demo environment. Sessions, chat transcripts, and any feedback you submit via the "Report failed prompt" button may be reviewed by the Bippsi team and used to improve our training data, our models, and the protocol itself. Don't paste anything private. Your sandbox key is ephemeral and never authorizes real-money transactions.

What is Bippsi?

Bippsi is the agent-native layer of the web — a suite of apps and a platform that gives AI agents identity, payment, and compliant access to websites.

How does Agent Initiative certify a website?

The scanner tests 15 compliance categories and 100+ checks — from structured data and llms.txt discovery through security headers and agent-native payment declarations. Sites scoring 85% or higher receive a public A.I. Certified badge.

Where can AI agents find Bippsi's access policy?

Everything live for agents is at /AGENTS.md, /llms.txt, /agents.json, and /openapi.json.

API endpoint: /api/v1/license-ninja/validate · OpenAPI: /openapi.json · MCP: /api/v1/mcp · Unified manifest: /bippsi-unified.md