[p2p-hackers] Decentralized search engines

Antoine Pitrou solipsis at pitrou.net
Wed Dec 7 17:17:49 UTC 2005


Hi Gwendal :)

Le mercredi 07 décembre 2005 à 17:36 +0100, SIMON Gwendal RD-MAPS-ISS a
écrit :
> By the way, one first challenge is the implementation of a nice
> crawler for owned documents : an indexer. This indexer should be able
> to scan and retrieve words from various documents
> (.html, .doc, .pdf, ...). It should be light and run in idle time and,
> if possible, be cross-platform. If you know a good open-source
> indexer, please let us know.

You can look at the techniques used by Beagle :
http://beaglewiki.org/
or Kat :
http://kat.mandriva.com/
or the Gnome Deskbar applet :
http://live.gnome.org/DeskbarApplet
http://raphael.slinckx.net/deskbar/

Regards

Antoine.





More information about the P2p-hackers mailing list