Project

General

Profile

Actions

Bug #97937

open

Linkvalidator: Links and   in tt_content.bodytext cause problems in UrlSoftReferenceParser

Added by Kai Strecker almost 2 years ago. Updated 4 months ago.

Status:
Under Review
Priority:
Should have
Assignee:
-
Category:
Linkvalidator
Target version:
-
Start date:
2022-07-14
Due date:
% Done:

0%

Estimated time:
TYPO3 Version:
11
PHP Version:
8.1
Tags:
Complexity:
Is Regression:
Sprint Focus:

Description

In our test case, we have a content element with CType = textmedia and the following data in the bodytext field:

<p>lorem ipsum https://weber.digital&nbsp; &nbsp; dolor sit amet</p>
<p><strong>Used Fonts and Iconfonts</strong><br /> Museo Sans Rounded Family OTF (<a class="external" href="https://www.myfonts.com">https://www.myfonts.com</a>)<br /> Droid Serif (<a class="external" href="https://www.fontsquirrel.com/fonts/droid-serif">https://www.fontsquirrel.com/fonts/droid-serif</a>)<br /> Icons Mind (<a class="external" href="https://iconsmind.com">https://iconsmind.com</a>)<br /> Linear Icons Free (<a class="external" href="https://linearicons.com">https://linearicons.com</a>) (CC BY-SA 4.0) by Perxis (<a class="external" href="https://perxis.com">https://perxis.com</a>)</p>
When the linkvalidator executes the UrlSoftReferenceParser, it finds two broken links:

So according to this test, the UrlSoftReferenceParser does not handle &nbsp; correctly and also has a bug in the big regex, which does all the parsing.


Related issues 4 (3 open1 closed)

Related to TYPO3 Core - Bug #98120: Link parsing problem in linkvalidator Closed2022-08-10

Actions
Related to TYPO3 Core - Bug #98328: Exception "Data too long for column url" when checking links New2022-09-11

Actions
Related to TYPO3 Core - Bug #99909: False positive broken links by parsing URLs not inside <a> tagsNeeds Feedback2023-02-09

Actions
Related to TYPO3 Core - Epic #85006: Reduce falsely reported broken linksNew2018-02-11

Actions
Actions

Also available in: Atom PDF