[daisy] Depth of Office document search

Bruno Dumon bruno at outerthought.org
Thu Sep 6 03:22:48 CDT 2007


On Tue, 2007-09-04 at 14:41 -0400, Thomas.Connally at dds.net wrote:
> 
> Is there a limit to the depth of a search on an Office document? That
> is, if I have a thousand page Office document, will the search
> encompass the entire document, or does it only search the first X
> number of pages/characters? 

Strange question, have you encountered such limitation?

Unless there would be bugs in the textextractors, they should return the
full content of the documents.

There is however a configurable limit to the maximum size of documents
on which Daisy will perform text extraction. By default it is 20 MB, it
can be changed using this element in the configuration of the
fullTextIndexUpdater:

<!-- value in bytes -->
<dataMaxSize>20000000</dataMaxSize>

-- 
Bruno Dumon                             http://outerthought.org/
Outerthought - Open Source, Java & XML Competence Support Center
bruno at outerthought.org                          bruno at apache.org



More information about the daisy mailing list