[daisy] Importing existing static HTML sites into daisy

Paul Focke paul at outerthought.org
Fri Jan 4 02:22:29 CST 2008


Happy New Year

If I could give you one word of advice, sunscreen would be it. Well not
really sunscreen but I'd remember to use the htmlcleaner on the html
before dumping it in daisy. Have the cleaner go over all the html files
first on a test run and have it report any errors since you can
sometimes find some really weird html out there.
If you feel comfortable doing this in javascript it is quite easy. I
remember doing something vaguely similar for a project we worked on
here. But it didn't rewrite links (<a href=""> & <img src="">) like you
would need to (I'm assuming that there might be links between the
pages). I guess that might be a tricky part of the exercise.

Paul

On Thu, 2008-01-03 at 14:26 -0800, Kealy, John wrote:
> Hello all,
> 
> I'm beginning the phase 2 portion of rolling out daisy at UCSF. We are
> looking are rolling a lot (~100) of small static HTML sites into daisy
> in the next six months. Most of the sites have less than 100 documents
> and images. Has anyone come up with a method to "scrape" a site from a
> URL (or file system) and dump it into daisy? I think it could be done,
> but I haven't even begun to seriously address this problem...
> 
> Any advice?
> 
> Happy 2008 to all!
> 
> Best regards,
> 
> John Kealy
> 
> _______________________________________________
> daisy community mailing list
> Professional Daisy support: http://outerthought.org/site/services/daisy/daisysupport.html
> mail to: daisy at lists.cocoondev.org
> list information: http://lists.cocoondev.org/mailman/listinfo/daisy


More information about the daisy mailing list