Engagemii AEO Data Marketplace
Engagemii grades every brand on the web from 0 to 10 for how well AI engines β ChatGPT, Claude, Perplexity, Gemini β can find, parse, and cite them. The score breaks down into six sub-scores covering structured data, content structure, entity clarity, E-E-A-T signals, technical AEO, and AI discoverability. We pair every record with the brand's industry, audience type, tech stack, geography, social URLs, and how many times AI bots have actually crawled them.
Agencies use it to find clients who need fixing. SaaS companies embed the scores into their products. Researchers track which industries are falling behind. Then pull on-demand Fix-It Kits for any brand and walk into sales meetings with the exact patches to apply.
Every column in the CSV
38 columns per brand.
The AEO score
aeoScore
Overall AEO score from 0 to 10 β how well AI engines like ChatGPT, Claude, and Perplexity can find, parse, and cite this brand.
structuredData
Sub-score 0-10 for JSON-LD / schema.org markup quality.
contentStructure
Sub-score 0-10 for headings, sections, and answerable copy.
entityClarity
Sub-score 0-10 for how clearly the brand identifies itself as an entity.
eeaT
Sub-score 0-10 for Experience, Expertise, Authority, and Trust signals.
technicalAeo
Sub-score 0-10 for robots.txt, llms.txt, sitemaps, and crawler accessibility.
aiDiscoverability
Sub-score 0-10 for AI-engine-specific signals (allowlist, ai.txt, etc.).
AI engine crawl activity
aiBotHitsTotal
How many times AI bots (GPTBot, ClaudeBot, PerplexityBot, Applebot, AmazonBot, Meta AI) have crawled this brand. Real demand signal.
aiBotsSeen
Which AI bots have actually visited (semicolon-separated).
Identity & classification
companyName
Brand name as it appears on their site.
domain
Root domain (e.g. shopify.com).
websiteUrl
Full URL we crawled.
industry
Granular industry classification (100+ buckets like 'marketing and advertising', 'aviation & aerospace', 'civic & social organization').
employeeSize
Employee count band (e.g. 1-10, 11-50, 51-200, 201-500, 501-1000, 1001-5000, 5001-10000, 10000+).
yearFounded
Year the company was founded (when known).
audienceType
B2B, B2C, or Mixed β classified from on-page signals.
audienceConfidence
Classifier confidence 0-1.
detectedPlatform
What runs the site: shopify, wordpress, wix, webflow, woocommerce, squarespace, ghost, hubspot, etc.
Geography
city
City (when known).
state
US state, lowercased.
country
Country.
Social URLs (direct links)
LinkedIn URL found on the site.
Instagram URL.
Facebook URL.
X (Twitter) URL.
tiktok
TikTok URL.
youtube
YouTube channel URL.
Timestamps
addedToDb
When we first added this brand.
lastUpdated
Last time any field changed.
scoredAt
Last time we re-scored this brand.
Sign up free, browse the data and build your filter inside the portal, then pick a tier and pay only when you're ready to download.
No subscriptions. No auto-renew.
A la carte
From $500
minimum order
10K record minimum
Build any mix you want. Pay per record, pay per phone.
β’ $0.05 per record, $0.50 per delivered phone
β’ Build a per-industry allocation in the portal
β’ Pick how many SaaS, e-commerce, etc.
β’ Emails included free where available
Starter
$1,500
one-time
100,000 records
Save 70%
One vertical or geography in volume.
β’ 100,000 records, your allocation
β’ Volume discount baked in
β’ All 38 columns per row
β’ Fix-It Kits not included at this tier
Agency
$10,000
one-time
1,000,000 records
Save 80%
For agencies pitching clients and lead-gen teams.
β’ 1,000,000 records, your allocation
β’ Fix-It Kit access unlocked
β’ 50 Fix-It Kits included (worth $4,950)
β’ $5 per additional kit
Enterprise
$20,000
one-time
Full US dataset
Save 84%
Everything we have. Product integrations and broad plays.
β’ Every scored US brand in the database (2M+)
β’ Fix-It Kit access unlocked
β’ 200 Fix-It Kits included
β’ $3 per additional kit
β’ API access for programmatic use
Custom
Contact
sales
Talk to us
Live feed and integration help. Data platforms, AI labs, enterprise.
β’ Full dataset plus ongoing feed updates
β’ Custom integration (API, S3, Snowflake, etc.)
β’ Volume-based pricing
β’ Dedicated support
How we built this
We crawl and score every brand on the web continuously. The scorer pulls each site's homepage, parses on-page content, JSON-LD structured data, and meta tags, then runs the AEO scoring engine that powers our paid Fix-It Kit product at engagemii.com/aeo. Each brand gets one overall 0-10 score plus six sub-scores. Scores are refreshed when sites change.
We track every AI bot hit (GPTBot, ClaudeBot, PerplexityBot, Applebot, AmazonBot, Meta AI) and tie those hits to the brand pages they visited. The aiBotHitsTotal and aiBotsSeen columns let you see which brands are already being ingested by AI engines β real demand signal you can't get elsewhere.
Brands are classified into 50+ granular industries and a B2B/B2C/Mixed audience type. The detectedPlatform column tells you the stack (Shopify, WordPress, Webflow, etc.) so you can filter to clients on the platforms you know.