Crawling embedded documents

Question

I have documents that contain embedded documents and I would like to crawl them and index as separate documents.

Answer

By default the LucidWorks Search crawler will flatten the embedded documents. In order to change this behavior you can follow these steps:

1. With a text editor open the defaults.yml file under [LWE_HOME]/conf/lwe-core/

2.Change:
datasource.tika.parsers.flatten.compound: true
to
datasource.tika.parsers.flatten.compound: false

3. Save the defaults.yml file

4. Restart the LucidWorks Processes

Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.
Powered by Zendesk