Feature #85127

Epic #85006: Reduce falsely reported broken links

linkvalidator: Add possibility to exclude specific external URLs / domains or patterns

Added by Sybille Peters about 3 years ago. Updated 4 months ago.

Status:
New
Priority:
Should have
Assignee:
-
Category:
Linkvalidator
Target version:
-
Start date:
2018-05-31
Due date:
% Done:

0%

Estimated time:
PHP Version:
Tags:
Complexity:
Sprint Focus:

Description

Sometimes there may be problems with external sites that linkvalidator / Guzzle doesn't handle correctly or which can't be handled correctly at all.

To make linkvalidator still usable, it would be good to be able to exclude certain domains / urls using regular expressions. This should be supplied in a way that an admin user can edit it.

Prerequisite: Provide general configuration for extension. This does not need to be made configurable per page tree.

Examples

URLs that work in browser, but check will fail:


Related issues

Related to TYPO3 Core - Bug #86918: Linkvalidator stops working on specific links (external URLs)Closed2018-11-13

Actions
Related to TYPO3 Core - Feature #89457: Add possibility to mark as error specific external URLs / domains or patternsNeeds Feedback2019-10-18

Actions
#1

Updated by Sybille Peters almost 2 years ago

  • Subject changed from Add configuration option: file with regex patterns to exclude for external link checking to Add possibility to exclude specific external URLs / domains or patterns

Problem

In some rare cases, the checking of external URLs fails via linkvalidator even if they are not broken. See also #86918. This leads to false negatives (URLs reported as broken which are not).

One of the most annoying things for editors when working through the list of broken links are false negatives which keep coming up, clutter up the list, cannot be removed and make it really tedious to actually work through the list and fix broken links.

Proposed Solution

It would be ideal to optimize the crawl process so it always correctly reports broken links but this may not be entirely possible.

So as alternative we could add a mechanism to exclude specific URLs or URL patterns, e.g.

  • exact URL
  • URL starting with ... (or domain)
  • regular expression

This could be done by just adding optional files or could be done in the GUI with an "ignore" button.

#2

Updated by Sybille Peters almost 2 years ago

  • Related to Bug #86918: Linkvalidator stops working on specific links (external URLs) added
#3

Updated by Lina Wolf almost 2 years ago

  • Related to Feature #89457: Add possibility to mark as error specific external URLs / domains or patterns added
#4

Updated by Sybille Peters over 1 year ago

  • Subject changed from Add possibility to exclude specific external URLs / domains or patterns to linkvalidator: Add possibility to exclude specific external URLs / domains or patterns
#5

Updated by Sybille Peters 11 months ago

  • Description updated (diff)
#6

Updated by Sybille Peters 4 months ago

  • Assignee deleted (Sybille Peters)

Also available in: Atom PDF