Bug #99527
closedEpic #65815: Improve Indexed search indexer
indexed_search does not properly index XLSX and PPTX files
100%
Description
Working on a client's install, it turns out indexed_search is not capable of properly indexing XLSX and PPTX files.
- Problem for XLSX: the wrong "unzipped" file is indexed, that file contains some kind of metadata which are basically only pointers (integers). The content extracted is plain wrong and useless
- Problem for PPTX: only the content of Slide 1 is extracted, it misses content from all other slides
Files
Updated by Xavier Perseguers about 2 years ago
- File G9-content-in-pptx-xlsx.patch G9-content-in-pptx-xlsx.patch added
- Status changed from New to In Progress
- % Done changed from 0 to 50
Suggested patch (applied locally via composer patch)
Updated by Gerrit Code Review about 1 month ago
- Status changed from In Progress to Under Review
Patch set 1 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/87441
Updated by Gerrit Code Review about 1 month ago
Patch set 2 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/87441
Updated by Gerrit Code Review about 1 month ago
Patch set 3 for branch main of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/87441
Updated by Gerrit Code Review 30 days ago
Patch set 1 for branch 13.4 of project Packages/TYPO3.CMS has been pushed to the review server.
It is available at https://review.typo3.org/c/Packages/TYPO3.CMS/+/87571
Updated by Benni Mack 30 days ago
- Status changed from Under Review to Resolved
- % Done changed from 50 to 100
Applied in changeset bb443592142c28a1cf756f490147dc6a5b7a8e4e.