Bug #99909
openFalse positive broken links by parsing URLs not inside <a> tags
0%
Description
Linkvalidator always reports an error due to the appearance of "http://" or "https://" in the body text (just text, no href etc.).
Linkvalidator should only check real links not the apparente of link like texts.
In addition it would be grate to have an icon in the "Listing of broken links" next to the pencil that can be clicked to ignore these errors (this link) in future crawlings.
Files
Updated by Rémy DANIEL over 1 year ago
- Related to Epic #85006: Reduce falsely reported broken links added
Updated by Sybille Peters over 1 year ago
- Status changed from New to Needs Feedback
Linkvalidator always reports an error due to the appearance of "http://" or "https://" in the body text (just text, no href etc.).
Can you give an example here and steps to reproduce (preferably reproducible in TYPO3 v13)?
Do you mean URLs directly in the body text, not enclosed in an <a> tag, such as
https://example.org/abc
as opposed to
<a href="https://example.org/abc">link</a>
If URLs are parsed as links also depends on how you configured the softref field in TCA.
For example
$TCA['tt_content']['columns']['bodytext']['config']['softref'] = typolink_tag,email[subst],url;
If you remove the "url" here, URLs in bodytext will no longer be parsed as links.
Compare also how these URLs are rendered in the Frontend by TYPO3.
So, depending on your configuration this may be intended and correct behavior.
I am very well aware that there are still some problems with parsing, but this issue needs a better description.
See also:
Updated by Sybille Peters over 1 year ago
- Related to Bug #97937: Linkvalidator: Links and in tt_content.bodytext cause problems in UrlSoftReferenceParser added
Updated by Sybille Peters over 1 year ago
- Related to Bug #95878: In linkvalidator, soft reference parser extracts 2 links from rich text with URL as anchor text added
Updated by Sybille Peters over 1 year ago
In addition it would be grate to have an icon in the "Listing of broken links" next to the pencil that can be clicked to ignore these errors (this link) in future crawlings.
For excluding specific external URLs there is already an issue: https://forge.typo3.org/issues/85127
You can also look at my extension "Broken Link Fixer" (brofix) which is a fork of linkvalidator and implements this, see https://extensions.typo3.org/extension/brofix
Updated by Sybille Peters over 1 year ago
- Related to Feature #85127: linkvalidator: Add possibility to exclude specific external URLs / domains or patterns added
Updated by Sybille Peters over 1 year ago
- Subject changed from False positive broken links to False positive broken links by parsing URLs not inside <a> tags
If this issue relates specifically to parsing, I would change the title to differentiate from: https://forge.typo3.org/issues/101670
Updated by Sybille Peters over 1 year ago
- Related to Bug #101670: Linkvalidator reports some external URLs as "false positives" added
Updated by Christian Ludwig about 1 year ago
In the described case it is simply text, no anchor href. Here the example code from the rendered page as it is shown to the visitor.
<p>
... Eine verschlüsselte Verbindung erkennen Sie daran, dass die Adresszeile des Browsers
von „http://“ auf „https://“ wechselt und an dem Schloss-Symbol in Ihrer Browserzeile.
</p>