Project

General

Profile

Actions

Bug #91586

closed

indexer misses to urldecode parsed links when checking for files locally

Added by Henri Nathanson over 4 years ago. Updated about 4 years ago.

Status:
Closed
Priority:
Should have
Assignee:
-
Category:
Indexed Search
Target version:
-
Start date:
2020-06-04
Due date:
% Done:

100%

Estimated time:
TYPO3 Version:
9
PHP Version:
7.2
Tags:
indexed_search,search,indexer,htmlparser
Complexity:
Is Regression:
Sprint Focus:

Description

The indexer extracts links from a page with HtmlParser and gets urlencoded() uris, which then are not urldecoded() to work in local storage.

https://github.com/TYPO3-CMS/indexed_search/blob/master/Classes/Indexer.php -> extractHyperLinks()

Solution:

'localPath' => $this->createLocalPath(urldecode($tagAttributes[0]['href']))

Possible additional sources:
https://github.com/AOEpeople/crawler/issues/578

Actions

Also available in: Atom PDF