Project

General

Profile

Actions

Bug #20665

closed

tx_indexedsearch description text with utf-8 encoding issues

Added by Thomas over 15 years ago. Updated about 11 years ago.

Status:
Closed
Priority:
Should have
Assignee:
-
Category:
Indexed Search
Target version:
-
Start date:
2009-06-24
Due date:
% Done:

0%

Estimated time:
TYPO3 Version:
4.3
PHP Version:
Tags:
Complexity:
Is Regression:
Sprint Focus:

Description

Under certain conditions the preg_replace() function in the second line of markupSWpartsOfString() produces wrong output. This preg_replace() is resonsible for reducing multiple blank spaces to a single one.

$str = preg_replace('/\s\s+/',' ',$str);

In the Frontend, the blank space character produced by this function is parsed wrong, most likely by the htmlspecialchars() later in the same function.

Hint: The problem might be related to the iconv implementation "libiconv" version 1.11, as the error didn't appear on another system with exactly the same PHP / TYPO3 / MySQL setup.

It seems like the preg_replace() function produces a false encoded blank space, which is later encoded to a "broken" utf-8 character (square with question mark).

(issue imported from #M11405)


Related issues 1 (0 open1 closed)

Is duplicate of TYPO3 Core - Bug #19726: UTF8 produces bad character outputClosed2008-12-19

Actions
Actions #1

Updated by Dmitry Dulepov over 14 years ago

That can be solved by using "u" modifier to preg_replace but there are two problems:
- PCRE can be compiled without Unicode
- encoding can be other than UTF-8

No patch yet :(

Actions #2

Updated by Alexander Opitz over 11 years ago

  • Status changed from New to Needs Feedback
  • Target version deleted (0)
  • TYPO3 Version set to 4.3

The issue is very old, does this issue exists in newer versions of TYPO3 CMS (4.5 or 6.1)?

Actions #3

Updated by Alexander Opitz about 11 years ago

  • Status changed from Needs Feedback to Closed

No feedback for over 90 days.

Actions

Also available in: Atom PDF