Feature #85127

Epic #85006: Reduce falsely reported broken links

Add possibility to exclude specific external URLs / domains or patterns

Added by Sybille Peters over 1 year ago. Updated about 2 months ago.

Status:
New
Priority:
Should have
Category:
Linkvalidator
Target version:
-
Start date:
2018-05-31
Due date:
% Done:

0%

PHP Version:
Tags:
Complexity:
Sprint Focus:

Description

Sometimes there may be problems with external sites that linkvalidator / Guzzle doesn't handle correctly or which can't be handled correctly at all.

To make linkvalidator still usable, it would be good to be able to exclude certain domains / urls using regular expressions. This should be supplied in a way that an admin user can edit it.

Prerequisite: Provide general configuration for extension. This does not need to be made configurable per page tree.


Related issues

Related to TYPO3 Core - Bug #86918: Linkvalidator stops working on specific links (external URLs) Resolved 2018-11-13
Related to TYPO3 Core - Feature #89457: Add possibility to mark as error specific external URLs / domains or patterns New 2019-10-18

History

#1 Updated by Sybille Peters about 2 months ago

  • Subject changed from Add configuration option: file with regex patterns to exclude for external link checking to Add possibility to exclude specific external URLs / domains or patterns

Problem

In some rare cases, the checking of external URLs fails via linkvalidator even if they are not broken. See also #86918. This leads to false negatives (URLs reported as broken which are not).

One of the most annoying things for editors when working through the list of broken links are false negatives which keep coming up, clutter up the list, cannot be removed and make it really tedious to actually work through the list and fix broken links.

Proposed Solution

It would be ideal to optimize the crawl process so it always correctly reports broken links but this may not be entirely possible.

So as alternative we could add a mechanism to exclude specific URLs or URL patterns, e.g.

  • exact URL
  • URL starting with ... (or domain)
  • regular expression

This could be done by just adding optional files or could be done in the GUI with an "ignore" button.

#2 Updated by Sybille Peters about 2 months ago

  • Related to Bug #86918: Linkvalidator stops working on specific links (external URLs) added

#3 Updated by Lina Ourima about 1 month ago

  • Related to Feature #89457: Add possibility to mark as error specific external URLs / domains or patterns added

Also available in: Atom PDF