Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.schemagen.io/llms.txt

Use this file to discover all available pages before exploring further.

The Site-wide Audit Tool scans every page of a domain and produces a structured data health report for the entire site. Instead of checking pages one by one, you enter a domain, start the audit, and get back a per-URL breakdown showing what schema is present, what’s missing, where errors exist, and which pages represent the biggest opportunities for structured data improvement.
The Site-wide Audit Tool is an Agency plan feature ($99/mo). Single-page auditing is available to all users as a free tool from the dashboard. To unlock full site audits with PDF export, upgrade to the Agency plan.

How to run a site audit

1

Open the Audit Tool

From the dashboard, click Audit in the left sidebar. The audit interface loads with a URL input and a mode selector.
2

Enter your sitemap URL

Paste the sitemap URL for the domain you want to audit—for example, https://example.com/sitemap.xml. SchemaGen parses the sitemap XML, extracts all page URLs, and queues them for analysis. The audit covers up to 20 URLs from the sitemap per run.
Use your root sitemap if your site has multiple sitemaps. If your site uses a sitemap index file, link to the main index and SchemaGen will follow the references. For the most complete picture, make sure your sitemap is up to date before running an audit.
3

Start the audit

Click Run Audit. SchemaGen begins crawling and analyzing each URL in the queue. The audit runs page by page; you can see progress update in real time as each URL completes.
4

Review the per-URL report

When the audit finishes, the results appear as a table with one row per audited URL. Each row shows the page URL, its structured data health score, detected schema types, issues found, and AI-generated recommendations.
5

Drill into individual pages

Click any URL in the results table to open its detailed page report. Here you can see every schema issue on that page, the specific fields causing problems, and suggested fixes. You can also launch the Schema Builder from this view, pre-populated with the corrected schema for that page.
6

Export the PDF report

Click Export PDF to generate a shareable report covering the entire audit. The PDF includes the per-URL health scores, issue summaries, and AI recommendations formatted for client presentation.

What the audit report covers

Each page in the audit results includes the following information:

Schema health score

A score for each page reflecting how complete and error-free its structured data is. Pages with no schema score lowest; pages with valid, Google-eligible schema score highest.

Detected schema types

Every @type found on the page, including schemas already deployed through SchemaGen and any existing JSON-LD in the page source.

Syntax errors

Structural problems with existing schema markup that prevent search engines from reading it correctly—missing required fields, invalid property types, malformed JSON.

Missing schema opportunities

Pages where structured data could be added but isn’t. The audit identifies the schema type that would be most appropriate for each page based on its content.

AI deep audit

For each page, the audit also runs an AI deep audit that generates a plain-language recommendation specific to that page’s content and existing schema state. This goes beyond generic validation rules to provide context-aware guidance—for example, identifying that a product page has a Product schema but is missing aggregateRating despite having visible review content, or that a FAQ section is present in the page body but has no corresponding FAQPage schema. These recommendations are written in language your clients can understand, making them useful to include directly in the exported PDF report.

PDF export and client deliverables

The PDF report is designed to be shared with clients or stakeholders who don’t have access to the SchemaGen dashboard. It includes:
  • An executive summary with overall domain schema health
  • A page-by-page breakdown with scores and top issues
  • AI-generated recommendations per page
  • Your agency branding (with white-label settings configured in Account → White Label)
To export, click Export PDF from the audit results view. The file generates and downloads automatically. Agency plans include white-label deliverables, meaning you can replace SchemaGen’s branding with your own agency name and logo in the report.

Audit limitations and error states

  • Site blocked: If the domain blocks automated crawlers, the audit cannot retrieve page content. The affected URLs appear as failed in the results table with a site_unsupported error. For sites you manage, check your robots.txt to ensure SchemaGen’s crawler is not disallowed.
  • Timeout: Pages that are very slow to load may time out during crawling. These appear as failed results. Try re-running the audit during off-peak hours if a significant number of pages are timing out.
  • 20-page limit per run: Each audit run processes up to 20 URLs from the sitemap. For large sites, prioritize by submitting a focused sitemap (for example, just your product pages or blog posts) for targeted audits.