Wyrote Academy

Links and Sitemap

Sitemap setup, page crawling, and internal linking

Internal linking is a critical SEO signal. Wyrote uses your sitemap to automatically add relevant internal links to generated content. This guide covers setting up your sitemap, syncing pages, and managing your link infrastructure.

Click Links in the sidebar navigation.

Why Internal Linking Matters

Internal links:

  • Help search engines discover and index your pages
  • Distribute page authority (link juice) across your site
  • Guide readers to related content
  • Strengthen the pillar-cluster content model
  • Improve user engagement and time on site

Wyrote's sitemap is the sole source for internal link URLs. Only pages in your synced sitemap can be linked to in generated content.

Setting Up Your Sitemap

Manual Entry

  1. Enter your sitemap URL in the input field (e.g., https://yoursite.com/sitemap.xml).
  2. Click Save.

Auto-Detect

  1. Click the Auto-Detect button.
  2. Wyrote checks common sitemap locations on your domain:
    • /sitemap.xml
    • /sitemap_index.xml
    • /sitemap/
    • And other standard paths
  3. If found, the URL is suggested — click to accept it.

Tip: Most websites have their sitemap at /sitemap.xml. If yours is in a non-standard location, enter it manually.

Syncing Your Sitemap

After setting the sitemap URL:

  1. Click Sync Sitemap.
  2. Wyrote imports all URLs from the sitemap into your database.
  3. The page count and last sync timestamp are displayed.

You should re-sync whenever you add new pages to your website to keep Wyrote's link pool up to date.

Crawling Pages

Syncing imports URLs, but crawling extracts detailed information from each page:

  1. Click Start Crawl.
  2. Wyrote uses Firecrawl to visit each page and extract:
    • Title — The page's title tag
    • Description — The meta description
    • H1 — The main heading
    • Page Type — Blog, service, homepage, etc.
    • Status Code — 200, 301, 404, etc.
    • Last Modified — When the page was last updated

Crawl Progress

  • A real-time progress indicator shows how many pages have been crawled.
  • Any errors during crawling are reported.
  • Crawling runs in the background — you can navigate away and come back.

Note: Crawling gives Wyrote context about each page, enabling smarter internal linking decisions. The AI can match link targets to content topics more accurately with crawl data.

Pages Table

After syncing and crawling, the pages table shows all your sitemap URLs:

ColumnDescription
URLThe page URL
TitleExtracted page title
H1Main heading on the page
TypeClassified page type (blog, service, homepage, etc.)
Status CodeHTTP response code
Last ModifiedWhen the page was last updated
Last SyncedWhen Wyrote last synced this page
Crawl StatusWhether the page has been crawled

Filtering and Searching

  • Search — Find pages by URL
  • Type Filter — Filter by page type (blog, service, homepage, etc.)
  • Type Stats — Breakdown showing how many pages of each type exist

Pagination shows 50 pages at a time with Previous/Next navigation.

Once your sitemap is synced and pages are crawled:

  • Generated articles automatically include internal links to relevant pages from your sitemap.
  • The number of internal links per article is configurable in Style Settings (1–5 links).
  • Links are placed naturally within the content, not forced or clustered.
  • The AI matches link targets based on topical relevance.

Dashboard Warnings

If your sitemap isn't set up, the Dashboard will display alerts:

  • "Sitemap not configured" — You haven't added a sitemap URL. Internal linking is disabled.
  • "Sitemap pages not crawled" — Your sitemap is set and synced, but pages haven't been crawled yet.

Best Practices

  1. Sync regularly — Re-sync your sitemap whenever you publish new pages on your website.
  2. Crawl after syncing — New pages need to be crawled for the AI to link to them intelligently.
  3. Check for broken pages — Review the Status Code column for any 404 or error pages.
  4. Keep your sitemap updated — Ensure your website's sitemap includes all important pages.

Next Steps

On this page