Portable web search via the opper:web_search server-side tool. One tool entry that works on every compat endpoint, with native server-side search where the routed model supports it and Opper-managed fallback where it does not.
Server-side web search comes in two flavors at Opper.Provider-native shapes forward verbatim to the routed provider: web_search_20250305 on Anthropic, {type:"web_search"} on OpenAI Responses, {googleSearch:{}} on Google.The opper:web_search tool documented here is the portable cross-provider shape. It gives you one tool entry, identical response artifacts, and no per-provider branching in your code.Pick the canonical shape when you want a single tool entry that works regardless of which model the request lands on. Pick a native shape when you want the model’s provider-specific search behavior and are happy to author per-provider request bodies.
The engine field controls how Opper routes the search.
Value
Behavior
auto (default)
Use the model’s native server tool if the routed model+endpoint supports one. Fall back to Opper’s engine otherwise.
native
Require native. Returns 400 if the routed model has no native web search on the called endpoint.
opper
Always use Opper’s engine, regardless of native availability. Opper picks the backend. Useful when you want a uniform response shape across every model.
jina
Pin Opper’s engine to the Jina backend.
exa
Pin Opper’s engine to the Exa backend. Exa supports a true publish-date freshness window.
jina and exa are backend pins within Opper’s engine: like opper they always run the server-side search (never native), but force a specific provider. Availability depends on server-side configuration. Pinning a backend that isn’t configured returns a tool-call error.Today, native web search is available on:
Anthropic Claude on /v3/compat/v1/messages (both direct anthropic/ and Vertex gcp/claude-* routes)
OpenAI on /v3/compat/responses for Responses-API models (gpt-5* family, gpt-5-search-api, etc.)
Google Gemini on /v3/compat/v1beta/models/{model}:generateContent
engine: "auto" will route to native on those combinations and to Opper’s engine for everything else (Mistral, DeepSeek, Alibaba Qwen, Groq, etc.).
The response shape matches whatever endpoint you called, so your code reads citations from the same place regardless of engine.
Anthropic-shape: content[] contains server_tool_use, web_search_tool_result (URLs + snippets), and text blocks whose citations[] (web_search_result_location) reference the sources. See server-side tools for the field-by-field shape.
OpenAI-shape (Responses): output[] contains web_search_call items with the search queries and a message whose content[].output_text.annotations[] carry url_citation entries.
OpenAI-shape (Chat Completions): choices[0].message.annotations[] carry the url_citation entries, mirroring the Responses shape.
Google-shape: candidates[0].groundingMetadata carries webSearchQueries, groundingChunks[].web (URI + title), and groundingSupports.
On the native route (engine: "auto" landing on a model with a native server tool) citations are anchored per statement: each one points at the exact span of prose the model grounded on, exactly as the provider emits them.On Opper’s engine route (engine: "opper", "jina", or "exa") citations are block-level: every source the search returned is attached to the answer text. You get the same fields in the same place, but without per-sentence character offsets, and the Google searchEntryPoint (provider-hosted “Search Suggestions” HTML) is not emitted.
count is the number of searches the model actually ran, unit is the per-search cost, cost is the total. engine is also surfaced when the request routed through Opper’s engine.
Opper’s engine is billed per search. Native searches (engine: "auto" landing on a model with a native tool, or engine: "native") are billed at the routed provider’s rate and passed through — the exact cost still shows up under usage.opper.cost.tools.web_search.
provider rate, ≈ $14 / 1,000 queries (Gemini 3) — see Google
1 The underlying index for the native providers is as publicly reported, not officially confirmed by the provider. Native provider prices change independently of Opper; the linked pages are authoritative. More Opper engine backends will be added over time.2 Data residency reflects where the search runs. Native searches execute on the provider’s own infrastructure — Opper doesn’t control their region, and only the Jina backend offers EU residency. This is separate from result localization: use the country parameter (above) to bias results toward a geography, which does not change where the search runs.
Each server-side search appears in the trace view as a server_side_tool step nested under the turn that requested it, showing the query, the results it returned, and the per-search cost — so the search is never a black box.
The quick start keeps the tool entry minimal. This example sets every parameter on one request so you can see them in one place. Only type is required; everything else is optional and falls back to the engine or handler default.