AI Summary
TLDR: JSON-LD is the correct default for AI search structured data in 2026. It is the format Google explicitly recommends, the format ChatGPT and Perplexity parse most reliably, and the format that survives template changes without breaking. Microdata still has narrow uses (e-commerce platforms with legacy templates, accessibility tooling), but those uses are shrinking. This guide covers the decision framework, the migration path, the four schema types that drive 80% of AI citations, and the validation workflow.
Why JSON-LD won the structured data race
When Google added structured data parsing in 2009, it supported three formats: Microdata, RDFa, and JSON-LD. JSON-LD shipped last (2014) but Google publicly endorsed it as the preferred format by 2017. The reason is architectural – JSON-LD lives in a single script tag, decoupled from the visible HTML, so it survives template refactors, theme changes, and CMS migrations that routinely break inline Microdata attributes.
AI search amplified that advantage. Retrieval pipelines parse JSON-LD as a clean structured object without having to walk the DOM. Microdata requires the parser to assemble fragments from itemprop attributes scattered across nested HTML. When the HTML changes, the parser silently produces incomplete or wrong objects. JSON-LD is parsed once, validated once, and either works or does not.
The four schema types that drive 80% of AI citations
Across client audits and the public Schema.org adoption data from Web Almanac 2025, four schema types punch above their weight in AI search citations:
- Article / BlogPosting with full author Person reference, datePublished, dateModified, and image. Drives editorial citations across AI Overviews and Perplexity.
- FAQPage with at least 3 atomic question-answer pairs. Disproportionately cited in ChatGPT and Copilot answers because the structure maps directly to RAG chunking.
- HowTo for procedural content. Drives the ‘steps’ formatted answers in AI Overviews.
- Product with offers, aggregateRating, and brand. Required for any e-commerce surface in AI shopping experiences (now live on ChatGPT, Perplexity, and Google).
Implement these four cleanly before adding any niche schemas (Course, Recipe, Event). The marginal lift on niche schemas is small unless they directly match your business.
The decision framework: JSON-LD or Microdata
Use this checklist to pick the format for any new schema implementation:
- Does Google’s Structured Data documentation list both formats? Default JSON-LD.
- Are you on Shopify, WooCommerce, or Magento with a theme that injects Microdata? Audit it, replace with JSON-LD where possible.
- Do you need to validate accessibility metadata at the element level? Microdata only.
- Are you building a fresh site in 2026? JSON-LD always.
- Does your CMS have a JSON-LD plugin (Yoast, RankMath, Schema Pro)? Use it instead of writing custom Microdata.
How to migrate Microdata to JSON-LD without breaking anything
Migration is safer than most teams expect because the two formats can co-exist temporarily. Run them in parallel for 30 to 60 days, validate both, then strip Microdata once JSON-LD is confirmed live and indexed.
- Crawl the site with Screaming Frog and export all current schema types and counts.
- Map each schema type to its JSON-LD equivalent in your CMS or template layer.
- Add JSON-LD blocks to staging and validate with Google’s Rich Results Test plus Schema.org validator.
- Push to production. Submit updated sitemap to GSC.
- Wait for recrawl confirmation in GSC’s Enhancement reports.
- Remove Microdata attributes from templates only after JSON-LD is confirmed indexed.
Common mistake: stripping Microdata before JSON-LD is recrawled. You end up with a window where neither format is parseable and rich result eligibility drops.
Validation tooling that actually works
Google’s Rich Results Test is the source of truth for eligibility, but it does not catch every schema-related bug. Use a layered validation stack:
- Google Rich Results Test – eligibility for specific rich result types.
- Schema.org Validator (validator.schema.org) – generic schema correctness, broader than Google.
- Schema App or Schema.dev (paid) – bulk validation across thousands of URLs.
- Lighthouse SEO audit – flags missing required properties on common schemas.
Run the full stack quarterly. Schema validation drift is one of the most common silent bugs in long-running content sites because theme updates, plugin updates, and CMS migrations all touch the structured data layer.
Common JSON-LD mistakes that suppress AI citations
Even teams with JSON-LD implemented often have these bugs that quietly prevent AI engines from parsing the data correctly:
- String dates instead of ISO 8601: ‘January 2026’ should be ‘2026-01-15’.
- Missing @context: Every JSON-LD block must start with ‘@context’: ‘https://schema.org’.
- Author as a string instead of Person object: Person objects with sameAs links to LinkedIn and Twitter triple author entity recognition.
- Image referenced by relative URL: Schema.org requires absolute URLs.
- FAQPage with a single question: Google requires 3+ FAQ items for rich result eligibility.
Audit your top 20 templates for these five mistakes. Most sites fix them in under 4 hours of dev time and see structured data coverage in GSC double within 30 days.
Frequently Asked Questions
Should I use both JSON-LD and Microdata?
Does Schema markup directly improve AI citation rates?
Do I need schema on every page?
How do I check if AI engines see my schema?
Will RDFa make a comeback?
Want this implemented for your brand?
I help growth-stage companies own their category in AI search. Audit your schema implementation.