Most SEOs implement schema markup to chase visual features in search results. That’s the wrong frame. In 2026, the primary value of well-implemented structured data is something less visible and far more important: it tells AI systems — Google’s AI Overviews, ChatGPT, Perplexity, Gemini — exactly who you are, what you’re authoritative about, and whether you can be trusted as a citation source.
This is a deeper look at what schema actually does, what the research really says (including some inconvenient data the industry has been slow to absorb), and what a proper JSON-LD strategy looks like for 2026 and beyond.
What the numbers actually show
There’s a lot of schema markup statistics floating around the internet. Some are accurate. Some are optimistic to the point of being misleading. Let’s separate signal from noise.
Those numbers make a compelling case. But here’s the research that complicates the picture, and that you need to understand to implement schema strategically.
Ahrefs tracked 1,885 pages that added JSON-LD schema between August 2025 and March 2026, matched against 4,000 control pages. Adding schema produced no major uplift in AI citations on Google AI Overviews, AI Mode, or ChatGPT. A separate searchVIU experiment confirmed that when AI systems fetch a page in real time, none of them read the JSON-LD — they extract only visible HTML content.
This is genuinely important data. It punctures the “add schema and get cited by AI” narrative that has been circulating in SEO circles. But it doesn’t mean schema is unimportant. It means schema’s value operates at a different layer than most practitioners assume.
The two jobs schema actually does
Understanding this distinction is the difference between a schema strategy that delivers real results and one that’s theatrical effort for its own sake.
Job 1: Trigger rich results in traditional search
This is the visible, measurable, immediate value. Product schema unlocks price and availability display in Shopping results. Review schema surfaces star ratings. Recipe schema generates calorie and prep-time panels. Event schema displays dates and locations. These features demonstrably improve CTR — the 82% figure above is robust across multiple studies.
This job is governed by a clear rulebook: Google’s Rich Results guidelines. Eligibility matters, content alignment matters, and as of 2026, the list of supported schema types has narrowed.
Job 2: Entity disambiguation for the knowledge graph
This is the less visible, less immediate, and arguably more important job. Every major search engine and AI system maintains a knowledge graph — a structured map of entities (people, organizations, places, products, concepts) and the relationships between them.
When your schema says “this website belongs to Alneeko Technologies, which is the same entity as this LinkedIn profile, this Crunchbase entry, and this Wikidata node, and which is authoritative about Technical SEO, GEO, and Shopify optimization” — you’re not triggering a rich result. You’re seeding the knowledge graph with a clean, verifiable entity record.
“An AI is more comfortable citing an entity it can verify against its own knowledge graph. A page with clean Organization markup, verified sameAs links, and clear mainEntity declarations is simply a more citable source than an unstructured page saying the exact same thing.”
— RebelMouse structured data guide, 2026
The Ahrefs study didn’t find uplift from arbitrary schema addition because it was measuring the wrong outcome. The entity disambiguation job doesn’t produce an immediate citation bump — it builds the underlying trust infrastructure that makes AI citation possible in the first place.
What changed in 2026: the deprecations you need to know
Google made two significant schema-related announcements in the six-month window between November 2025 and May 2026. Misreading either one will either leave you doing unnecessary cleanup work or missing a real change in how schema strategy should be prioritised.
November 2025: Seven schema types removed from rich result eligibility
| Schema Type | Status | Why |
|---|---|---|
| Practice Problem | No rich result | Low adoption, educational use case too narrow |
| Dataset | Restricted | Now only serves Dataset Search, not general results |
| Sitelinks Search Box | Deprecated | Integrated into core search infrastructure |
| Special Announcement | Deprecated | COVID-specific markup no longer needed |
| Q&A | No rich result | Limited adoption, overlap with other types |
| HowTo | Restricted | Demoted on supplementary content; primary content still eligible |
| FAQPage (rich result) | Dead as of May 7, 2026 | Google ended the feature formally; schema remains valid JSON-LD |
FAQPage and HowTo as JSON-LD schema types are not dead. The rich result display feature was removed. The underlying structured data is still parsed by Bingbot, PerplexityBot, voice-assistant indexers, and RAG crawlers. Keep FAQPage schema on authoritative, answer-rich pages. Remove it from thin or supplementary content where it was added opportunistically.
Critically, 31 schema types retain full rich result support as of March 2026. The types with the strongest continuing performance are tied to specific user intent: product availability, event timing, recipe details, local business information. If your schema strategy was built on one of the deprecated types, the cleanup is simple. If it was built on Product, Article, LocalBusiness, Review, or Event — nothing material has changed.
The @graph pattern: why it matters more than any single schema type
Most schema tutorials teach you to add isolated schema blocks — one FAQPage block on your FAQ, one Product block on your product pages, one Article block on your blog. This is schema as a collection of disconnected signals. It works for triggering individual rich results. It does almost nothing for entity disambiguation.
The @graph pattern is different. It declares multiple entities in a single JSON-LD block and, critically, establishes the relationships between them using stable @id identifiers. Your organization, your website, your team members, your service pages — all linked in a machine-readable graph that mirrors how a knowledge graph models the real world.
Here’s what a minimal but properly structured @graph block looks like for a digital services company:
/* alneeko.com — global @graph schema, site-wide in <head> */ { "@context": "https://schema.org", "@graph": [ { "@type": "Organization", "@id": "https://alneeko.com/#organization", "name": "Alneeko Technologies", "url": "https://alneeko.com", "logo": { "@type": "ImageObject", "url": "https://alneeko.com/logo.png" }, "foundingLocation": { "@type": "Place", "name": "Frankfurt, Germany" }, "knowsAbout": [ "Technical SEO", "Generative Engine Optimization", "Schema Markup", "Shopify SEO", "Klaviyo Email Automation" ], "sameAs": [ "https://www.linkedin.com/company/alneeko", "https://www.crunchbase.com/organization/alneeko", "https://www.upwork.com/ag/alneeko" ] }, { "@type": "Person", "@id": "https://alneeko.com/#waqar-ahmed", "name": "Waqar Ahmed", "jobTitle": "Founder", "worksFor": { "@id": "https://alneeko.com/#organization" }, "knowsAbout": [ "JSON-LD Schema Markup", "AI Answer Engine Optimization", "Shopify Technical SEO" ], "sameAs": [ "https://www.linkedin.com/in/waqar-ahmed-alneeko" ] }, { "@type": "WebSite", "@id": "https://alneeko.com/#website", "url": "https://alneeko.com", "publisher": { "@id": "https://alneeko.com/#organization" } } ] }
Notice what this block does that isolated schema doesn’t: it creates a network. The Person entity references the Organization via @id. The WebSite entity references the Organization. External platforms are linked via sameAs. The knowsAbout fields explicitly claim topical authority. If Google, Perplexity, or any AI crawler reads this once and indexes it, they know with high confidence what this entity is and what it’s authoritative about.
The sameAs property: your shortest path to entity verification
Of all the properties in the Organization and Person schema types, sameAs has the highest leverage relative to the effort required to implement it. It takes five minutes to add. Its impact on knowledge graph entity resolution is disproportionate.
The logic is simple. A search engine’s knowledge graph already has nodes for LinkedIn, Crunchbase, Wikidata, and similar platforms. Those platforms have their own entity data. When your schema declares a sameAs link to your LinkedIn company page, you’re not just providing a URL — you’re telling the graph “the entity at this node and the entity at that LinkedIn node are the same thing.” The graph can then merge the attributes from both sources, creating a richer, more confident entity record.
The highest-confidence sameAs targets, roughly in order of trust signal strength:
- Wikidata (if applicable — establishes public record existence)
- LinkedIn (near-universal trust signal for business entities)
- Crunchbase (strong for technology and services companies)
- GRID (for academic and research organizations)
- Official government or regulatory registrations
- Industry directories with editorial curation
Don’t add sameAs to low-quality directories or platforms where your profile is thin. The goal is corroboration by trusted external sources — not volume.
Schema for AI visibility: what actually works
Given that direct schema parsing by AI systems during real-time retrieval appears to be minimal, the question becomes: how does schema contribute to AI visibility at all?
The answer is indirect but real. It operates through three mechanisms.
1. Knowledge graph entity trust
As discussed above, well-implemented entity schema builds your organization’s node in the knowledge graph. AI systems trained on or augmented by knowledge graph data are more likely to include a clearly-defined, verifiable entity as a citation source. This is a pre-training and indexing-time benefit, not a real-time retrieval benefit.
2. Crawl confidence and indexation quality
The Search Engine Land controlled experiment referenced earlier found that the page with well-implemented JSON-LD not only appeared in an AI Overview — the page with no schema wasn’t even indexed. Schema markup is part of the signal set that tells crawlers “this content is well-organized, accurate, and worth indexing deeply.” Better indexation equals better AI inclusion eligibility.
3. Structured extraction compatibility
While AI systems don’t appear to read JSON-LD during real-time retrieval, they do extract visible content with high fidelity. Schema markup trained on clean, schema-rich content creates better extraction patterns. The speakable property — a WebPage block with a SpeakableSpecification pointing at your core answer elements — explicitly tells Google AI which parts of your page are best suited for voice and AI extraction. This is one of the genuinely forward-looking schema implementations for 2026.
A phased implementation approach
Rather than treating schema as a one-time technical task, think of it as a layered build. Each phase delivers discrete value and creates the foundation for the next.
-
1Global entity layer (deploy once, site-wide) Organization + Person + WebSite in a single @graph block. Add sameAs links. Implement knowsAbout. This is the foundation for everything else and takes a few hours to do correctly.
-
2Content schema per template Article on blog posts, Service on service pages, Product on product pages. These are template-level implementations — set once in your CMS or theme, populate dynamically. Validate each type against the Rich Results Test.
-
3High-intent schema for conversion pages For e-commerce: full Product markup including availability, condition, price, and Review aggregates. For local: LocalBusiness with opening hours, address, geoCoordinates. For events: complete Event markup with performer and organizer references. These types have the clearest rich result ROI.
-
4AI-forward extensions Add Speakable markup on authoritative answer pages. Consider DefinedTermSet on glossary pages. Review BreadcrumbList on navigational pages. None of these trigger classic rich results — they’re signals for AI extraction and knowledge graph coherence.
-
5Audit and governance Schema markup degrades over time — pages change, content migrates, CMS updates break JSON-LD injection. Monthly validation against Google Search Console’s rich results report and Schema.org’s validator (which validates more than GSC alone) is how you protect the investment.
The plugin collision problem
One underappreciated schema markup failure mode, particularly on WordPress and Shopify sites, is plugin collision. When two plugins each attempt to inject Organization schema for the same page, Google receives conflicting entity declarations and may suppress both from rich results. This is not a theoretical edge case — it’s common on any site with three or more active SEO or e-commerce plugins.
The correct approach is a single source of truth for each entity type. Audit what schema is currently being injected using a raw source inspection or the Schema.org validator. If you find duplicate @type: "Organization" declarations, decide which source should own that schema and disable or configure the others accordingly.
The measurement gap
Google Search Console blends AI Overview impressions with traditional search result impressions when the same URL appears in both. If a page ranks in position 4 and also appears in an AI Overview, GSC counts it once. This means standard CTR analysis will undercount the actual exposure your pages are receiving through AI surfaces.
The practical implication: don’t benchmark schema success exclusively on click volume. Track rich result impressions separately in GSC’s “Search Appearance” filter. Track Knowledge Panel accuracy for your organization. Monitor whether AI systems describe your organization correctly in response to branded queries — this is a leading indicator of entity resolution quality.
What to stop doing
A useful exercise for any schema audit is to look for implementations that were added for reasons that no longer hold:
- FAQPage schema on thin content — the rich result is gone; if the content itself isn’t genuinely authoritative and answer-rich, remove it.
- HowTo schema on supplementary content — Google’s March 2026 update explicitly demoted this usage pattern.
- Schema types that don’t match page content — still the most common implementation error and still the most likely cause of manual actions and schema suppression.
- Quantity over quality — 50 schema types poorly implemented is worse than 6 types correctly implemented. Google’s documentation is explicit: accurate, well-maintained structured data outperforms broad but inconsistent coverage.
The bigger picture
Schema markup has always been a bet on machine-readability. The early bet was on search engines needing help to classify content. The current bet is on a more complex ecosystem of AI systems — answer engines, voice assistants, agentic tools — that need to rapidly assess whether a source is trustworthy enough to surface to a user.
The underlying principle hasn’t changed. What has changed is the stakes: a URL that can’t be confidently attributed to a known, verified entity is increasingly invisible to systems that make citation decisions at millisecond speed.
Getting your entity schema right isn’t a technical formality. It’s how you make yourself legible to the infrastructure that now mediates most of the web’s information flow.

Leave a Reply