Project

General

Profile

Actions

Bug #80085

closed

Extraction of metadata in PDF-documents does not recognize unicode characters

Added by Gerhard Rupp over 7 years ago. Updated almost 2 years ago.

Status:
Closed
Priority:
Must have
Assignee:
-
Category:
Indexed Search
Target version:
-
Start date:
2017-03-01
Due date:
% Done:

100%

Estimated time:
TYPO3 Version:
8
PHP Version:
7.0
Tags:
Complexity:
Is Regression:
No
Sprint Focus:

Description

If metadata in a PDF-document does f. e. contain German umlauts field value is cut-off.

Therefore in "FileContentParser.php" in function "splitPdfInfo" line 796 (TYPO3 7.6)

$res[strtolower(trim($parts[0]))] = trim($parts[1]);

has to be replaced by

$res[strtolower(trim($parts[0]))] = utf8_encode(trim($parts[1]));

Related issues 1 (0 open1 closed)

Related to TYPO3 Core - Bug #99352: PDF Metadata double-encoded by index-search indexer with poppler-utils pdfinfoClosed2022-12-13

Actions
Actions #1

Updated by Gerhard Rupp over 7 years ago

  • % Done changed from 0 to 100
Actions #2

Updated by Gerrit Code Review over 7 years ago

  • Status changed from New to Under Review

Patch set 3 for branch master of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/51938

Actions #3

Updated by Anja Leichsenring over 6 years ago

  • Status changed from Under Review to Needs Feedback

patch is not available.

Does the issue still exist?

Actions #4

Updated by Gerhard Rupp over 6 years ago

Yes, issue still exists.

Actions #5

Updated by Riccardo De Contardi over 6 years ago

  • Status changed from Needs Feedback to New
Actions #6

Updated by Gerhard Rupp over 6 years ago

  • % Done changed from 100 to 0

Can anyone tell me what´s the problem about this trivial patch? It´s so frustrating that it takes ages even for such obvious und already proven fixes to be recognized by the core team. This makes updating TYPO3 unneccessary time consuming.

Actions #7

Updated by Gerhard Rupp almost 6 years ago

  • TYPO3 Version changed from 7 to 8

After nearly two years this issue hasn´t been fixed even for recent versions (8.7, 9.5). Although a (obviously) working patch set was published long time ago. Frustrating ...

Actions #8

Updated by Gerrit Code Review almost 2 years ago

  • Status changed from New to Under Review

Patch set 1 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/76861

Actions #9

Updated by Gerrit Code Review almost 2 years ago

Patch set 2 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/76861

Actions #10

Updated by Gerrit Code Review almost 2 years ago

Patch set 3 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/76861

Actions #11

Updated by Gerrit Code Review almost 2 years ago

Patch set 4 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/76861

Actions #12

Updated by Gerrit Code Review almost 2 years ago

Patch set 5 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/76861

Actions #13

Updated by Gerrit Code Review almost 2 years ago

Patch set 1 for branch 11.5 of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/76892

Actions #14

Updated by Gerrit Code Review almost 2 years ago

Patch set 1 for branch 10.4 of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/76893

Actions #15

Updated by Tomas Norre Mikkelsen almost 2 years ago

  • Status changed from Under Review to Resolved
  • % Done changed from 0 to 100
Actions #16

Updated by Benni Mack almost 2 years ago

  • Status changed from Resolved to Closed
Actions #17

Updated by Benjamin Franzke almost 2 years ago

  • Related to Bug #99352: PDF Metadata double-encoded by index-search indexer with poppler-utils pdfinfo added
Actions

Also available in: Atom PDF