Project

General

Profile

Actions

Bug #89617

closed

XML sitemaps should not be indexed by search engines

Added by Chris Müller over 4 years ago. Updated about 1 year ago.

Status:
Closed
Priority:
Should have
Assignee:
-
Category:
SEO
Start date:
2019-11-08
Due date:
% Done:

100%

Estimated time:
TYPO3 Version:
9
PHP Version:
Tags:
Complexity:
Is Regression:
Sprint Focus:

Description

The content of a XML sitemap should not be listed on search engine result
pages. This is confirmed by John Mueller (Webmaster Trends Analyst from Google):

https://twitter.com/JohnMu/status/1192434849216434177

Actions #1

Updated by Gerrit Code Review over 4 years ago

  • Status changed from New to Under Review

Patch set 1 for branch master of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/62253

Actions #2

Updated by Gerrit Code Review over 4 years ago

Patch set 2 for branch master of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/62253

Actions #3

Updated by Gerrit Code Review over 4 years ago

Patch set 1 for branch 9.5 of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/62219

Actions #4

Updated by Chris Müller over 4 years ago

  • Status changed from Under Review to Resolved
  • % Done changed from 0 to 100
Actions #5

Updated by Gerrit Code Review over 4 years ago

  • Status changed from Resolved to Under Review

Patch set 2 for branch 9.5 of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/62219

Actions #6

Updated by Chris Müller over 4 years ago

  • Status changed from Under Review to Resolved
Actions #7

Updated by Benni Mack over 4 years ago

  • Status changed from Resolved to Closed
Actions #8

Updated by Mordamir almost 2 years ago

Whats the reason behind this decision?

With the x-robots-tag set to noindex, google search console does not accept the sitemap. And a xml sitemap is mostly for crawlers. So why should i exclude the crawlers from using the sitemap?

Maybe this is recent change from google?

Actions #9

Updated by Chris Müller over 1 year ago

"With the x-robots-tag set to noindex, google search console does not accept the sitemap."

I can't reproduce that. Added some sitemaps to GSC which is possible some time ago with this change.

You have to distinguish between crawling and indexing. Crawling means, that the crawler is able and allowed to retrieve the content and follow the links. This is possible for XML sitemaps with the "X-Robots-Tag: noindex" header. This is the same for a meta tag "robots" with "noindex" on a page: The page is crawled (and links followed) but the page isn't indexed (= displayed in the search results).

But: The XML sitemap should not be visible in the results of a search engine. Thus the "X-Robots-Tag: noindex". See also the link in the description of this issue.

And: If you want to allow the indexing of the XML sitemap (which means displaying in the search results) for whatever reason, you can easily achieve this by adding the following line in the TypoScript setup of your site package:

seo_sitemap.config.additionalHeaders.20 >

Actions #10

Updated by Florian Seirer about 1 year ago

Had the same issue as @Mordamir : Our XML sitemaps were not crawled by Google for weeks because of the noindex header, according to the report in the Google Search Console.

But simply resubmitting the sitemaps fixed the problem. All URLs from the sitemaps are now recognized.

Maybe this was just a temporary issue on Google's side?

Actions

Also available in: Atom PDF