CoreCommunity ExtensionsIncubatorDistributionsTYPO3 4.5 ProjectsTYPO3 4.6 ProjectsTYPO3 4.7 ProjectsTYPO3 6.0 ProjectsTYPO3 6.1 ProjectsTYPO3 6.2 Projects (+)

Bug #39866

HTMLContentExtractor does not remove internal style sheet information

Added by Dominik Steinborn 9 months ago. Updated 6 months ago.

Status:Closed Start date:2012-08-16
Priority:Should have Due date:
Assignee:Ingo Renner % Done:

100%

Category:Indexing
Target version:2.8
TYPO3 Version: Has patch:
PHP Version: Tags:
Votes: 0

Description

When the content contains style-tags, the information in between the tags gets indexed.

Example:

<style type="text/css">
p {color: #000;}
</style>

will be indexed as p {color: #000;}.

The patch appended to this report extends the method cleanContent to remove internal style sheet information.

class.tx_solr_htmlcontentextractor.patch (1.1 kB) Dominik Steinborn, 2012-08-16 16:30

Associated revisions

Revision ff942b76
Added by Dominik Steinborn 9 months ago

[BUGFIX] HTMLContentExtractor does not remove internal CSS

Resolves: #39866

History

Updated by Ingo Renner 9 months ago

  • Priority changed from Must have to Should have
  • Target version set to 2.8

Updated by Ingo Renner 9 months ago

  • Category changed from Solr TYPO3 Plugin to Indexing
  • Status changed from New to Resolved
  • Assignee set to Ingo Renner
  • % Done changed from 0 to 100

Fixed in ff942b76a66d4dda5c38a3fe68cec378797d141b

Thanks!

Updated by Ingo Renner 6 months ago

  • Status changed from Resolved to Closed

Also available in: Atom PDF