Feature #85679

Epic #83559: SEO enhancements in Core

Add robot.txt functionality

Added by Richard Haeser over 1 year ago. Updated 7 months ago.

Status:
Closed
Priority:
Should have
Assignee:
-
Category:
SEO
Target version:
-
Start date:
2018-07-29
Due date:
% Done:

0%

PHP Version:
Tags:
Complexity:
Sprint Focus:

Description

TYPO3 should deliver an out-of-the-box robots.txt to prevent certain directories (like eg /typo3) to be indexed. Besides that, it should be possible by integrators to block certain parts of the website to be crawled to prevent using crawl budget while it is not needed to be in the index.

History

#1 Updated by Richard Haeser over 1 year ago

  • Category set to SEO

#2 Updated by Richard Haeser over 1 year ago

  • Status changed from New to Closed

It is possible to add your robots.txt by a static route. See https://docs.typo3.org/typo3cms/CoreApiReference/ApiOverview/SiteHandling/StaticRoutes.html for more information.

#3 Updated by Richard Haeser over 1 year ago

  • Status changed from Closed to New

#4 Updated by Richard Haeser over 1 year ago

It still would be nice if TYPO3 can render a (configurable) robots.txt

#5 Updated by Andreas Fernandez over 1 year ago

What the robots.txt could do:

  • Automatically add the sitemap.xml, if configured
  • Add Disallow: foo if page should not get indexed (discussed with Richard, potentially dangerous as Google doesn't "forget" wrong configs so fast)
  • Add typo3temp/var/ if in document root
  • Exclude typo3/
  • Exclude typo3conf/ext/*/Resources/Private/

#6 Updated by Benni Mack 7 months ago

  • Status changed from New to Closed

#7 Updated by Richard Haeser 7 months ago

A little note why we closed it:
It is best practise to keep your robots.txt as minimal as possible. So that is why we will not include anything in core currently.

For more information see https://yoast.com/wordpress-robots-txt-example as well.

Also available in: Atom PDF