Bug #17799
closedExternal files indexing wrong with crawler when inserted in access-restricted tt_content elements
0%
Description
When you place a tt_content element with access restriction (FE groups) within a page, and this tt_content contains links to external files (like in a "Filelinks" one), the indexing is wrong.
Suppose a good crawler configuration for FE groups in place. The crawler may index each page once for every combination of FE Groups suitable of accessing the page. This is, if you have 4 possible combinations, one single content element may be indexed 4 times, even if its indexing information is the same all the times.
This runs smoothly using tt_content elements with no external files. But when files come into place they are only indexed once. That is, they should have been indexed up to 4 times but only 1 combination is stored.
This causes some FE groups being unable to get results from those files, and causes a big problem.
This does not happen when the access restrictions are page-wide. In that case, everything runs fine.
The sample configuration for the crawler is (Page TSConfig):
tx_crawler.crawlerCfg.paramSets.mypages =
tx_crawler.crawlerCfg.paramSets.mypages {
cHash = 1
procInstrFilter = tx_indexedsearch_reindex, tx_indexedsearch_crawler
baseUrl = http://mydomain/
}
tx_crawler.crawlerCfg.paramSets.grupo5 < tx_crawler.crawlerCfg.paramSets.mypages
tx_crawler.crawlerCfg.paramSets.grupo5 {
userGroups = 0,-2,5
}
tx_crawler.crawlerCfg.paramSets.grupo6 < tx_crawler.crawlerCfg.paramSets.mypages
tx_crawler.crawlerCfg.paramSets.grupo6 {
userGroups = 0,-2,6
}
tx_crawler.crawlerCfg.paramSets.grupo5y6 < tx_crawler.crawlerCfg.paramSets.mypages
tx_crawler.crawlerCfg.paramSets.grupo5y6 {
userGroups = 0,-2,5,6
}
(issue imported from #M6736)