Project

General

Profile

Actions

Bug #94130

closed

Indexing throws error because of special characters like ß

Added by Manuel Winkelhofer almost 3 years ago. Updated over 2 years ago.

Status:
Closed
Priority:
Should have
Assignee:
-
Category:
Indexed Search
Target version:
-
Start date:
2021-05-12
Due date:
% Done:

100%

Estimated time:
TYPO3 Version:
10
PHP Version:
Tags:
Complexity:
Is Regression:
Sprint Focus:

Description

here is the error:
An exception occurred while executing 'INSERT INTO `index_words` (`wid`, `baseword`, `metaphone`) VALUES (?, ?, ?)' with params [246923395, "\x70\x72\x65\x76\x69\x65\x77\x5f\x75\x6b\x76\x76\x61\x30\x31\x5f\x5f\x6b\x6f\x72\x72\x65\x6b\x74\x75\x72\x5f\x5f\x76\x6f\x72\x62\x65\x75\x67\x75\x6e\x67\x73\x5f\x75\x6e\x64\x5f\x76\x65\x72\x62\x65\x73\x73\x65\x72\x75\x6e\x67\x73\x6d\x61\xc3", "268043616"]: Incorrect string value: '\xC3' for column 'baseword' at row 1

the indexing had an error because in this line of code here:
[[https://github.com/TYPO3/TYPO3.CMS/blob/9827a72a8bf71d32e25261638f65190f59d071bc/typo3/sysext/indexed_search/Classes/Indexer.php#L1078]]

$val = substr($val, 0, 60);

the string had a special character at exactly the position 60 and was broken by the cut
after replacing the line with teh following it worked for me again
$val = mb_substr($val, 0, 60);


Related issues 1 (1 open0 closed)

Related to TYPO3 Core - Bug #94240: Respect multibyte characters in indexed_search search wordsNew2021-05-31

Actions
Actions #1

Updated by Andreas Kienast almost 3 years ago

  • Status changed from New to In Progress
Actions #2

Updated by Gerrit Code Review almost 3 years ago

  • Status changed from In Progress to Under Review

Patch set 1 for branch master of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/69151

Actions #3

Updated by Manuel Winkelhofer almost 3 years ago

  • Related to Bug #94240: Respect multibyte characters in indexed_search search words added
Actions #4

Updated by Gerrit Code Review almost 3 years ago

Patch set 1 for branch 10.4 of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/69375

Actions #5

Updated by Andreas Fernandez almost 3 years ago

  • Status changed from Under Review to Resolved
  • % Done changed from 0 to 100
Actions #6

Updated by Benni Mack over 2 years ago

  • Status changed from Resolved to Closed
Actions

Also available in: Atom PDF