Bug #14758
closedIndexer re-indexes external pages to often
0%
Description
In the new indexed search, coming with 3.8, the new feature to index external pages is re-indexing the page if the link appears twice (or more!!) on the website. This way the indexer could be requesting the same external URL hundreds or even thousands of times of the link is on the bottom of each page and you have a large website.
Simple put in an external URL in your website and enable the feature. Watch the logs of the external URL and see that the indexer is a frequent visitor - and yet maybe to frequent for the opposite server admin ;)
The indexer should go up the pagetree and see of the exact same URL has been visited since the last TTL period, and by that this should tell it not to follow the URL. That way it will make only one hit, even though the link is on the website many times.
(issue imported from #M1112)