biptest is a live demo where visitors give an AI agent a 250-Bip sandbox balance and watch it shop for priced content on biptest.com in real time. No account required.

Is this using real money?

No. The 250 Bips are synthetic — issued to the demo session only. You cannot withdraw them, convert them to USD, or carry them across sessions. biptest.com earnings accrue to a payout-ineligible system account so nothing leaves the sandbox.

Can the AI access sites other than biptest.com?

No. The agent is server-sandboxed to biptest.com. Any attempt to call other hosts is blocked by our proxy and logged as a jailbreak attempt. This is a safety feature, not a bug.

Live Demo Sandbox

Watch an AI agent pay for content

Pick an AI. Get a sandbox key. Watch it crawl biptest.com, hit real 402 paywalls, pay Bips, and read what's priced. Test the system for free — no account needed.

Configure your agent

Not prepared

Pick your AI

⚡

Agent transaction mode

Trained — Unsigned (default)

Agent has a Bippsi key and pays automatically when it hits paywalled content. Fast, simple, fits most use cases.

Trained — Signed

Agent has a Bippsi key plus a per-transaction confirmation step. Use this for high-value transactions where a clear audit trail matters.

Untrained (no key)

Agent has no Bippsi key — what most AIs look like the first time they meet Bippsi. They should tell you what happened and where to get a key.

Our sandbox is your sandbox. This is the same environment we use to benchmark every model, training configuration, and protocol change we ship — a true 1:1 with our internal testing. What you see here is what we see: no hidden backend systems, no separate "real" test harness. Every result, every edge case, every improvement lands in the same place for both of us.

Ephemeral. 30 turns. 30-minute window.

Tell the agent what to do

Demo balance

0 spent

––––

Bips

My agent's activity

0 events

waiting for events…

Start a session, then tell the agent what to do on biptest.com.

0/30 messages used this session

Premade prompts to try

— or just chat above with the agent

Page tests

Priced article reads across every content category.

Form tests

Priced form submissions — agent posts fields, server charges + returns a response.

Button tests

Priced button clicks — each press is a separate charge.

Content block tests

Priced in-page sections — unlock via a GET with a Payment header.

Download tests

Priced file downloads — PDFs and reports.

Regression / negative tests

Targeted probes for infrastructure bugs weve fixed and want to keep fixed.

Markets Intelligence Hub

Exercises all 5 element types on one page in sequence.

Site Pass tests

Click 1 immediately. Wait ~60s, click 2. Wait to ~150s after 1, click 3.

Out-of-Bips demo

Premium report is priced at 2000 Bips — above the sandbox cap, triggers the upsell flow.

Try to cheat

Sandbox enforcement — off-site requests are blocked and logged.

General / open-ended

Ambiguous human prompts. Loose scoring — any paid fetch + substantive reply.

42 prompts across 12 sections · click any to send. Same set we use for our own benchmark — results at the bottom of the page.

💰

Out of Bips for that one.

The sandbox gave you 250 Bips — that route costs more. Production A.I. Keys start with a top-up of your choice. Buy Bips once, spend them across any Bippsi-enabled site.

Buy Bips → Learn about A.I. Keys

Agent ↔ biptest.com HTTP

[--:--:--]// Bippsi agent console · sandbox-gated proxy
[--:--:--]// allowlist: biptest.com only — any other host is blocked and logged as cheat_blocked
[--:--:--]// auth: proxy auto-injects your bippsi_* bearer on every outbound call
[--:--:--]// max: 2 tool-call iterations per chat message, 30 messages per session, 250 Bips sandbox balance
[--:--:--]// each request and response will appear here when the agent runs
[--:--:--]// idle — start a session above and send the agent a task

Like what you saw?

Put a paywall on your own site, or build an agent that pays.

Set up pricing on my site → Get an A.I. Key for my agent Read the API docs

Our benchmark results

Same prompt set you see above, same live environment, run against every model at all three training levels. These are our numbers — you're seeing the same data we use to decide which models to recommend.

Updated 2026-05-10

Model	Trained (Unsigned)	Trained (Signed)	Untrained
Qwen3 122B Alibaba	— not yet run	— not yet run	— not yet run
Nemotron Nano 30B A3B NVIDIA	— not yet run	— not yet run	— not yet run
GPT-OSS 120B OpenAI	— not yet run	— not yet run	— not yet run
Llama 4 Scout 17Bx16E Meta	— not yet run	— not yet run	— not yet run
Claude 3.5 Haiku Anthropic	11 / 19 pass 4 confused · 4 error	— not yet run	— not yet run
DeepSeek V4 Flash DeepSeek	— not yet run	— not yet run	— not yet run
Gemma 4 26B A4B Google	— not yet run	— not yet run	— not yet run
Llama 3.3 70B Meta	— not yet run	— not yet run	— not yet run

Methodology. Each cell is the pass/fail split for that model running the premade prompts under that training level. A pass means the agent reached a successful paid response (2xx with a non-empty answer) within the session's turn budget. Confused = reached turn cap, gave a memory-only reply without fetching, or partially completed the request. Error = hard failure (provider timeout, sandbox block, model refused, or task abandoned mid-flow). Skipped cells are included intentionally so the benchmark table shows model coverage without fabricating scores.

Public sandbox. This page is a live demo environment. Sessions, chat transcripts, and any feedback you submit via the "Report failed prompt" button may be reviewed by the Bippsi team and used to improve our training data, our models, and the protocol itself. Don't paste anything private. Your sandbox key is ephemeral and never authorizes real-money transactions.

Watch an AI agent pay for content

Configure your agent

Tell the agent what to do

Tell us what went sideways

Premade prompts to try

Popular

Page tests

Form tests

Button tests

Content block tests

Download tests

Regression / negative tests

Markets Intelligence Hub

Site Pass tests

Out-of-Bips demo

Try to cheat

General / open-ended

Like what you saw?

Our benchmark results

Watch an AI agent pay for content

Configure your agent

Tell the agent what to do

Tell us what went sideways

Premade prompts to try

Popular

Page tests

Form tests

Button tests

Content block tests

Download tests

Regression / negative tests

Markets Intelligence Hub

Site Pass tests

Out-of-Bips demo

Try to cheat

General / open-ended

Like what you saw?

Our benchmark results

What is Bippsi?

How does Agent Initiative certify a website?

Where can AI agents find Bippsi's access policy?