The Site-wide Audit Tool scans every page of a domain and produces a structured data health report for the entire site. Instead of checking pages one by one, you enter a domain, start the audit, and get back a per-URL breakdown showing what schema is present, what’s missing, where errors exist, and which pages represent the biggest opportunities for structured data improvement.Documentation Index
Fetch the complete documentation index at: https://docs.schemagen.io/llms.txt
Use this file to discover all available pages before exploring further.
How to run a site audit
Open the Audit Tool
From the dashboard, click Audit in the left sidebar. The audit interface loads with a URL input and a mode selector.
Enter your sitemap URL
Paste the sitemap URL for the domain you want to audit—for example,
https://example.com/sitemap.xml. SchemaGen parses the sitemap XML, extracts all page URLs, and queues them for analysis. The audit covers up to 20 URLs from the sitemap per run.Start the audit
Click Run Audit. SchemaGen begins crawling and analyzing each URL in the queue. The audit runs page by page; you can see progress update in real time as each URL completes.
Review the per-URL report
When the audit finishes, the results appear as a table with one row per audited URL. Each row shows the page URL, its structured data health score, detected schema types, issues found, and AI-generated recommendations.
Drill into individual pages
Click any URL in the results table to open its detailed page report. Here you can see every schema issue on that page, the specific fields causing problems, and suggested fixes. You can also launch the Schema Builder from this view, pre-populated with the corrected schema for that page.
What the audit report covers
Each page in the audit results includes the following information:Schema health score
A score for each page reflecting how complete and error-free its structured data is. Pages with no schema score lowest; pages with valid, Google-eligible schema score highest.
Detected schema types
Every
@type found on the page, including schemas already deployed through SchemaGen and any existing JSON-LD in the page source.Syntax errors
Structural problems with existing schema markup that prevent search engines from reading it correctly—missing required fields, invalid property types, malformed JSON.
Missing schema opportunities
Pages where structured data could be added but isn’t. The audit identifies the schema type that would be most appropriate for each page based on its content.
AI deep audit
For each page, the audit also runs an AI deep audit that generates a plain-language recommendation specific to that page’s content and existing schema state. This goes beyond generic validation rules to provide context-aware guidance—for example, identifying that a product page has aProduct schema but is missing aggregateRating despite having visible review content, or that a FAQ section is present in the page body but has no corresponding FAQPage schema.
These recommendations are written in language your clients can understand, making them useful to include directly in the exported PDF report.
PDF export and client deliverables
The PDF report is designed to be shared with clients or stakeholders who don’t have access to the SchemaGen dashboard. It includes:- An executive summary with overall domain schema health
- A page-by-page breakdown with scores and top issues
- AI-generated recommendations per page
- Your agency branding (with white-label settings configured in Account → White Label)
Audit limitations and error states
- Site blocked: If the domain blocks automated crawlers, the audit cannot retrieve page content. The affected URLs appear as failed in the results table with a
site_unsupportederror. For sites you manage, check yourrobots.txtto ensure SchemaGen’s crawler is not disallowed. - Timeout: Pages that are very slow to load may time out during crawling. These appear as failed results. Try re-running the audit during off-peak hours if a significant number of pages are timing out.
- 20-page limit per run: Each audit run processes up to 20 URLs from the sitemap. For large sites, prioritize by submitting a focused sitemap (for example, just your product pages or blog posts) for targeted audits.