Wiser is bankrupt — the consolidation window is open

You guarantee 99%+ product matching.
The runtime should match that reliability bar.

DataWeave ingests pricing, assortment, content, and digital-shelf signals from hundreds of retailer sites every day — powering Costco, Home Depot, QVC, Zappos, Meijer, Whirlpool, Pernod Ricard, and dozens more. Cloudflare's developer platform is the natural runtime for the scraping perimeter, the AI matching layer, and the per-brand tenancy that comes next.

Stack today: Apache + AWS EKS + us-east-1 · Anthropic verified on the apex TXT · dataweave.com not yet on Cloudflare. This sketch is what a no-rip-and-replace adoption path looks like.

"Considering switching from Wiser due to their bankruptcy? Learn more."
— Live banner on dataweave.com/us, June 2026. Wiser, the market's largest competitive-price-intelligence vendor, collapsed earlier this year. Every Wiser customer is in active vendor evaluation.
99%+
Product matching accuracy
15K+
SKUs / customer (Blain's case study)
51→76%
Content health lift (Bush Brothers)
4
Regions: US, UK, India, International
Trusted by the biggest names in retail & consumer brands
Costco· Home Depot· QVC· Zappos· Meijer· Whirlpool· Pernod Ricard· Lactalis· Bush Brothers· TATA 1mg· Metro

The hardest part of digital commerce analytics is the scraping perimeter.

DataWeave's job, underneath everything, is to read other people's websites at scale — reliably, accurately, and politely — and turn HTML into structured product, price, and content signal. Retailers fight back with anti-bot systems, geolocation gates, dynamic JS, and rate limits. Cloudflare runs both sides of that fight every day, which is exactly why the perimeter belongs at the edge.

DataWeave's scrape pipeline, sketched on Cloudflare primitives

From "give me Walmart.com's price for SKU X across 50 zip codes" to a normalized row in Veracite — the whole flow can move closer to the source.
SOURCES
Retailer sites & marketplaces
Walmart, Amazon, Target, Kroger...
PERIMETER
Workers + Browser Rendering API
geo-distributed, headless Chrome at the edge
EXTRACT + MATCH
Workers AI + Vectorize
attribute extraction, similarity matching
Why the edge wins this fight: Browser Rendering on Workers gives you globally distributed headless Chrome with no fleet to manage. Workers AI puts the LLM that extracts “price” from a 200KB product page directly next to the page itself. Vectorize sits on the same wire and answers “is this Walmart SKU the same product as that Target SKU?” in single-digit milliseconds. The scrape, the parse, the match, and the storage all live on the same plane — not three hops apart.

Four product lines, four developer-platform centers of gravity.

Pricing Intelligence, Digital Shelf Analytics, Assortment Analytics, and Content Optimization each have a different shape. Each one maps cleanly to a different Cloudflare primitive as its natural runtime — without ripping out anything you've already built on AWS.

PRODUCT 01

Pricing Intelligence → Workers + Hyperlocal Edge

"AI-driven pricing insights" requires reading competitor prices across locations, channels, and currencies. Hyperlocal analytics means scraping the same page from 50 zip codes to capture geo-priced SKUs — Workers run from 330+ POPs, so you can scrape "from inside" any market natively.

The wedge: Workers' POPs cover every fuel-pricing zone, every grocery delivery hub, every regional channel you need — without renting proxy fleets or coordinating regional EC2 instances. Hyperlocal scraping becomes a deployment target, not a procurement problem.
Workers Hyperlocal Browser Rendering Fuel pricing
PRODUCT 02

Digital Shelf Analytics → Vectorize + Durable Objects

Share of Search, Share of Media, content audit, ratings & reviews across marketplaces. Bush Brothers improved content health from 51%→76% using this product. Underneath, that's vector similarity ("how close is this brand's content to the category-best?") and per-brand alert state.

The wedge: Vectorize for content embeddings + DOs for per-brand monitoring state. Search rankings, content scores, and review sentiment all move into a single observable layer instead of three separate microservices.
Vectorize Durable Objects Workers AI Sentiment
PRODUCT 03

Assortment Analytics → D1 + Vectorize

Benchmark assortments, detect gaps, compare against competitors. Underneath, that's a relational join across millions of competitor SKUs combined with semantic similarity to spot "the same product but a slightly different SKU." D1 handles the join, Vectorize handles the similarity.

The wedge: D1 is small enough that you can ship per-tenant catalogs as their own databases — isolating Costco's assortment universe from Meijer's by construction, instead of by row-level security in a shared Postgres.
D1 Vectorize Workers Per-tenant
PRODUCT 04

Content Optimization → Workers AI + R2

Audit and improve product titles, images, and descriptions with AI-driven scoring. Each audit = an LLM pass over the title, a vision-model pass over the images, and a stored "before/after" recommendation. Repeat across millions of SKUs across hundreds of retailers.

The wedge: Workers AI runs the LLM + vision models directly at the edge, against R2-stored product images (zero egress). AI Gateway sits in front of Anthropic (already verified on your apex) for the bigger jobs — with semantic cache catching duplicate content scoring across SKUs.
Workers AI R2 AI Gateway Vision

Your AI stack, mapped to Cloudflare's matching primitives.

DataWeave's published AI architecture is already remarkably aligned with Cloudflare's developer platform. Below: every layer of your AI innovation page, mapped to the Cloudflare primitive that runs it most cheaply and observably.

DataWeave AI → Cloudflare primitive mapping

Pulled verbatim from dataweave.com/us/ai-innovation. Right column = the Cloudflare primitive built for exactly that workload.
DATAWEAVE LAYER
LLM Modeling
LLM-based attribute extraction, aiding high-precision product matching
CLOUDFLARE PRIMITIVE
Workers AI + AI Gateway
Open-weight models at the edge for cheap extraction; AI Gateway in front of Anthropic for the precision jobs; per-tenant cost attribution by default
DATAWEAVE LAYER
Advanced Computer Vision
Fine-grained object detection for precise classification using text and image embeddings
CLOUDFLARE PRIMITIVE
Workers AI + Vectorize + R2
Vision models run on Workers AI against images stored zero-egress in R2; embeddings indexed in Vectorize for sub-50ms similarity lookup
DATAWEAVE LAYER
GPT-based Business Analytics Layer
Natural language query-based reporting and visualization
CLOUDFLARE PRIMITIVE
AI Gateway + Workers + D1
NL->SQL through AI Gateway with caching; queries hit D1 per-tenant for isolation; Workers serve the dashboard with edge-low latency
DATAWEAVE LAYER
AI-aided Human-in-the-Loop (Veracite)
Continuous feedback loop to improve AI models over time
CLOUDFLARE PRIMITIVE
Workflows + Queues + R2
Durable, replayable feedback pipelines without Temporal; corrections stored in R2 for re-training; observable end-to-end inside the same control plane

The economics of 99%+ matching at scale.

AI Gateway pays for itself fastest in product-matching pipelines. Every "is this Costco SKU the same product as that Home Depot SKU?" query is a candidate for semantic caching — categories repeat, brands repeat, attributes repeat. The cache hit rate on commerce-domain queries is well above generic LLM workloads.

A back-of-the-envelope, not a quote
Modeled across attribute extraction + matching + content scoring at $5 / M blended tokens (mix of Workers AI + Anthropic)
DAILY AI CALLS ACROSS PIPELINES
~800K–2M
Attribute extraction per scraped page, similarity matching, content scoring, sentiment, plus NL-query reporting from the dashboard layer.
SEMANTIC CACHE HIT RATE
50–70%
Commerce queries cluster heavily: same SKUs reappear across retailers, same attribute schemas across categories, same content templates across brands.
ANNUAL INFERENCE SAVINGS
$0.6M–$1.8M
Plus per-customer + per-product-line attribution — the unlock for confident enterprise tier pricing as Wiser customers come over.
The real win isn't the savings, it's the attribution. When Costco, Home Depot, QVC, Zappos, and every Pernod Ricard region each have their own AI cost line — broken out by product line and by scrape volume — you can defensibly price the next wave of Wiser refugees at the right tier. Today that data lives across CloudWatch + manual reconciliation. With AI Gateway it lives in a dashboard, by default.

Four target markets, four tenants. Workers for Platforms is the boundary.

Digital & Omnichannel Retailers, Consumer Brands, Food & Grocery Delivery, and Travel customers all want the same DataWeave platform — but with different ingest cadences, different match weights, different alert thresholds, and increasingly different data-residency requirements (US / UK / India / International).

Per-target-market tenancy, sketched

Each segment gets its own Worker namespace inside Workers for Platforms. Same edge, same observability, isolated AI budget, isolated alert routing, isolated data residency.
🏪
Digital + Omnichannel Retailers
🛒
Consumer Brands
🍲
Food + Grocery Delivery
✈️
Travel
Shared control plane — Workers for Platforms + AI Gateway + Vectorize
one runtime · one observability surface · data residency enforced by region binding, not by RBAC

Current stack, with Cloudflare overlaid.

Every row below is sourced from public DNS records and HTTP response headers on dataweave.com. The Cloudflare column is additive — nothing here requires ripping out the AWS estate.

What's running today, and where Cloudflare slots in

Pure overlay model. AWS keeps doing what it does well; Cloudflare picks up what it does better.
LAYER
DATAWEAVE RUNS TODAY
CLOUDFLARE FIT
PUBLIC EDGE
Apache on AWS (44.x / 3.x IPs, us-east-1)
+ Cloudflare in front: CDN, WAF, Bot Mgmt, DDoS
APP + API PLANE
AWS EKS (eks-prod-common-green-ext.dweave.net ELB)
No change — CF Tunnel exposes EKS to the edge without DMZ surface
DNS
AWS Route 53 (ns-1356.awsdns-41.org, etc.)
+ Cloudflare DNS — unified with WAF + analytics in one pane
SCRAPE PERIMETER
Likely EC2 + headless Chrome fleet, regional proxies
+ Workers + Browser Rendering API — 330+ POPs, no fleet
AI MATCHING
Anthropic (verified on apex TXT) + in-house models
+ AI Gateway in front: cache, attribution, rate-limit, budget cap
VECTOR SEARCH
Likely Elasticsearch / OpenSearch / pgvector on EKS
+ Vectorize for hot-path similarity (sub-50ms from any POP)
DOCUMENT + IMAGE STORE
Likely S3 (us-east-1) for product images & reports
+ R2 as zero-egress mirror — cheap edge-served images
PER-CUSTOMER STATE
Watch lists, dashboards, custom rules in shared Postgres
+ Durable Objects — per-customer state at the edge, no DB tier
PER-MARKET ISOLATION
Four target markets sharing a multi-tenant app
+ Workers for Platforms — per-segment namespace by construction
EMAIL
Google Workspace, HubSpot, Rocketseed UK
+ Cloudflare Email Security as defense-in-depth (optional)
RUNTIME VERSION
Apache + PHP 7.0.33 (end-of-life since Jan 2019)
+ Workers as a modern runtime for new surfaces — without forcing a PHP migration
EMPLOYEE ACCESS
Likely VPN to AWS VPCs
+ Zero Trust Access — identity-aware proxy, no VPN

Why this is the right quarter to start the conversation

Wiser is bankrupt. Your homepage banner makes it clear: every Wiser customer is in active vendor evaluation right now, and DataWeave is positioned as the natural alternative. The companies you'll onboard in the next six months are the ones that will define your scale-up arc into 2027 — and the architecture you'll need to serve them at 99%+ accuracy is bigger than the one serving you today.

Anthropic is already a vendor for you. The TXT record on dataweave.com confirms it. AI Gateway in front of Anthropic is the lowest-friction observability + cost-control upgrade available — no model migration, no prompt rewrite, just a header change. You see attribution, you cache duplicates, you set per-customer budget caps from day one.

The scrape perimeter is the most strategic surface you own. It's also the surface that benefits most from running on 330+ POPs instead of a regional EC2 fleet. Browser Rendering on Workers is the developer-platform primitive built for the job — and the geo-distribution that hyperlocal analytics actually requires is a deployment configuration, not a procurement event.

Worth a 30-minute conversation with the team?

The interesting conversation is which of these primitives is closest to your current sprint: AI Gateway in front of Anthropic for matching, Browser Rendering for the hyperlocal scrape, or Vectorize for the digital-shelf similarity layer. I'd rather hear what's actually on your roadmap than guess.

Matt Holscher Calendar  → Reply by email