[daisy] Depth of Office document search
Bruno Dumon
bruno at outerthought.org
Thu Sep 6 03:22:48 CDT 2007
On Tue, 2007-09-04 at 14:41 -0400, Thomas.Connally at dds.net wrote:
>
> Is there a limit to the depth of a search on an Office document? That
> is, if I have a thousand page Office document, will the search
> encompass the entire document, or does it only search the first X
> number of pages/characters?
Strange question, have you encountered such limitation?
Unless there would be bugs in the textextractors, they should return the
full content of the documents.
There is however a configurable limit to the maximum size of documents
on which Daisy will perform text extraction. By default it is 20 MB, it
can be changed using this element in the configuration of the
fullTextIndexUpdater:
<!-- value in bytes -->
<dataMaxSize>20000000</dataMaxSize>
--
Bruno Dumon http://outerthought.org/
Outerthought - Open Source, Java & XML Competence Support Center
bruno at outerthought.org bruno at apache.org
More information about the daisy
mailing list