Word to PDF and PDF to Word: Free Online Converters
Two conversions that come up constantly: turning a Word document into a fixed PDF for sharing, and turning a PDF back into an editable Word document. Both are now free, instant, and browser based.
An XML sitemap is the fastest way to make Google index your new pages, if you build it right. Here's what to include, what to leave out, and how to generate one free in your browser.
An XML sitemap is how you stop hoping Google finds your pages and start telling it where they are. Done right, it's the difference between a new article appearing in search results within hours versus a few weeks. Done wrong, outdated URLs, blocked pages listed, missing the canonical version, it actively hurts your indexing.
This guide explains what a sitemap actually does (and doesn't), the four files Google looks at to crawl your site, how to generate a clean sitemap free, and exactly where to submit it so it gets used.
A sitemap is a list of URLs on your site, written in XML, that tells search engines:
<lastmod>)<changefreq>, Google now mostly ignores this)<priority>, also mostly ignored)The two pieces that genuinely matter to Google in 2026 are the URL list itself and the lastmod date. The rest is legacy. Google confirmed in 2023 that it uses <lastmod> heavily as a recrawl signal but largely ignores <priority> and <changefreq>.
A sitemap is not:
What it is: a discovery shortcut. Particularly valuable for new sites, large sites, sites with weak internal linking, and sites that publish content frequently.
If your site has fewer than ~50 pages and good internal linking, technically no, Google will find everything. In practice, every site benefits from one because it speeds up indexing of new content and makes it visible in Search Console which URLs Google knows about.
You absolutely need one if:
A complete crawl setup has four files at your site's root. Each does something different and they work together.
robots.txt (at /robots.txt)Tells crawlers which paths to crawl and which to skip. Also points to your sitemap. Without this line in robots.txt, Google may not discover your sitemap on its own:
Sitemap: https://yoursite.com/sitemap.xml
Don't have one yet? Our robots.txt generator builds one in a minute.
sitemap.xml (or sitemap_index.xml)The actual list of URLs. If your site has more than 50,000 URLs or the sitemap is over 50 MB uncompressed, you need to split it across multiple sitemaps and point to them with a sitemap index file. Most small and mid-size sites are nowhere near these limits.
Separate XML files that list image and video URLs. Most sites don't need these, Google discovers images by crawling pages, but they help if you have a media-heavy site (stock photo, video tutorial library, etc.).
canonical tag, noindex)Every page should specify its canonical URL and whether it should be indexed. The sitemap should only ever list canonical, indexable URLs, listing duplicates or noindex'd pages confuses Google.
Include in the sitemap:
<lastmod>)Exclude:
<meta name="robots" content="noindex">/page/2, /page/3, debated, mostly skip)A common mistake: shipping a sitemap with 5,000 URLs of which only 2,000 are actually indexable. Google sees the mismatch in Search Console and trusts your sitemap less going forward.
For small sites and most blogs, you don't need to install anything. Using our free sitemap generator:
The tool crawls your site starting from the URL you give it, following internal links and discovering pages.
/wp-admin/, /cart/, /?s=)Crawling a 200-page site takes about a minute. The tool finds pages, gets their last-modified date (from headers or Open Graph), and builds the URL list.
Look at what it found. If it discovered URLs you don't want indexed, exclude their pattern. If it missed pages, link to them better from your existing pages, that's a sign of weak internal linking, which is fixable.
A standards-compliant XML file. Save it.
Put the file at https://yoursite.com/sitemap.xml. The location must be the root or a directory you specify in robots.txt.
Sitemap: https://yoursite.com/sitemap.xml
Go to Search Console → Sitemaps → enter sitemap.xml → Submit. Google will crawl it within hours and report any errors.
Repeat step 8 for Bing Webmaster Tools, it covers Bing, Yahoo, DuckDuckGo, and a chunk of AI search.
Google rechecks sitemaps regularly, typically daily for active sites, so it picks up updates without you having to resubmit.
A sitemap entry is a request, not a command. Google decides what to index based on perceived quality, crawl budget, duplicates, and a few other signals. Check Search Console's "Page indexing" report for the specific reason on each non-indexed URL. The most common are "Discovered, currently not indexed" (low perceived value) and "Crawled, not indexed" (content not unique enough).
Use sitemap.xml if you have under 50,000 URLs. Use sitemap_index.xml (which references multiple sub-sitemaps) only when you exceed that.
Not since 2019. Mobile-first indexing means Google treats your responsive site as one. Listing each URL once is correct.
Add <xhtml:link rel="alternate" hreflang="..."> tags inside each <url> entry. This signals to Google which language version corresponds to which region. Get this wrong and you'll have duplicate-content issues across language variants.
The old way of telling Google "I updated my sitemap, come look." Google retired sitemap pings in 2023. Now Google just rechecks active sitemaps regularly. You don't need to ping; you just need accurate <lastmod> dates.
The most common: the sitemap references URLs that 404, 301-redirect, or are blocked by robots.txt. Fix the source, remove the dead URLs from your site, or remove them from the sitemap, and resubmit. Don't ignore the warnings; they erode trust in your sitemap over time.
Generate a clean sitemap (only canonical, indexable, 200-OK URLs), reference it from robots.txt, and submit it once to Search Console. Keep it accurate when pages change. That's the entire workflow, and it's the single fastest way to make sure Google sees what you publish.
No comments yet, be the first to share your thoughts.
Comments are moderated and appear after review. Your email is never shown publicly or shared.