Having valid XML is imperative if you want your LucidWorks connector to get all your documents. A little known problem is that if you have things like special characters in your XML <- ie: is invalid XML, the connector can't handle this problem. It has to do with the underlying java code which essentially is going to throw an exception and not read the file anymore. Because of this, you may be missing a lot of documents. Check your connectors.xxx.log and it should tell you the Solr document and line number in the XML document that is problematic. But you may want to do a validation of your XML before you send it to the connector. Missing documents can be a squirrelly problem to figure out.
Have more questions? Submit a request