[p2p-hackers] Decentralized search engines
John Casey
john.casey at gmail.com
Mon Dec 19 10:45:52 UTC 2005
Hi Simon, have you thought of using the Apache groups lucene search
engine and crawler ?? http://lucene.apache.org/java/docs/index.html
On 12/8/05, SIMON Gwendal RD-MAPS-ISS <gwendal.simon at francetelecom.com> wrote:
> In comparison with traditional filesharing approaches, a decentralized search for the web should take into account words inside the documents.
>
> As previously said, we are working on a system namely Maay which aims at performing a decentralized and personalized search on a distributed set of textual documents.
>
> http://maay.netofpeers.net
>
> Each node (said computer) can publish a set of documents. This information space does not initially contain the web. Our idea is to consider that the cache (or history) of the web browser should be, by default, included in the published set of documents. So, every page that has been visited by at least one people since x days will be available in the network. Obviously, more popular a page is, more available it is.
>
> By the way, one first challenge is the implementation of a nice crawler for owned documents : an indexer. This indexer should be able to scan and retrieve words from various documents (.html, .doc, .pdf, ...). It should be light and run in idle time and, if possible, be cross-platform. If you know a good open-source indexer, please let us know.
>
>
> -- Gwendal
>
>
>
>
>
>
>
>
>
>
>
>
> > -----Message d'origine-----
> > De : p2p-hackers-bounces at zgp.org
> > [mailto:p2p-hackers-bounces at zgp.org] De la part de Ludovic Courtès
> > Envoyé : mercredi 7 décembre 2005 17:19
> > À : strib at MIT.EDU
> > Cc : Peer-to-peer development.; zooko at zooko.com
> > Objet : [p2p-hackers] Decentralized search engines
> >
> > Hi,
> >
> > Jeremy Stribling <strib at amsterdam.lcs.mit.edu> writes:
> >
> > > Working on it. Should have something public within a few months:
> > >
> > > http://pdos.csail.mit.edu/papers/overcite:iptps05/index.html
> >
> > Indeed, that seems very promising!
> >
> > Similarly, are there people working on decentralized web indexing and
> > search engines? To paraphrase Zooko, it would be nice to decentralize
> > Google before it is too late...
> >
> > Thanks,
> > Ludovic.
> > _______________________________________________
> > p2p-hackers mailing list
> > p2p-hackers at zgp.org
> > http://zgp.org/mailman/listinfo/p2p-hackers
> > _______________________________________________
> > Here is a web page listing P2P Conferences:
> > http://www.neurogrid.net/twiki/bin/view/Main/PeerToPeerConferences
> >
> _______________________________________________
> p2p-hackers mailing list
> p2p-hackers at zgp.org
> http://zgp.org/mailman/listinfo/p2p-hackers
> _______________________________________________
> Here is a web page listing P2P Conferences:
> http://www.neurogrid.net/twiki/bin/view/Main/PeerToPeerConferences
>
More information about the P2p-hackers
mailing list