[p2p-hackers] Oblivious information dispersal?
Michael J. Freedman
mfreed at cs.nyu.edu
Sun Aug 28 21:19:44 UTC 2005
On Sun, 28 Aug 2005, Paul Campbell wrote:
> PIR also adds significant computational overhead and I just haven't seen a
> similar head-on comparison which doesn't get into really ridiculous amounts of
> bandwidth, although potentially it could be competitive.
The main goal of PIR schemes is o(n) bandwidth overhead, where n is the
size of the database. (Otherwise, a linear-bandwidth PIR solution is
trivial: send the entire database.) In '99, Cachin, Micali, and Stadler
presented the first polylog PIR scheme.
http://citeseer.ist.psu.edu/cachin99computationally.html
Much more recently (ICALP 2005), Gentry and Ramzan presented a PIR scheme
with constant bandwidth.
Still, a *costly* part of PIR schemes is the computation overhead (which
is generally linear in the size of the database), not the communication
overhead which protocols have mostly sought to minimize.
> In my mind, this could potentially do two things. First, it would be yet
> another route to do PIR...users could directly make queries but since the
> nature of the query is spread out over a dozen or more nodes, and the bits
> by themselves are fractional, it's still not possible to tell computationally
> what the goal of the query is. Second, it makes plausible deniability very
> easy to achieve...the database itself can't really tell what's in it's own
All of these protocols above, and actually what I think this thread has
centered about, are about computationally-secure single-server PIR.
In fact, most work in PIR has focused on info-theoretically-secure
multi-server PIR, in which databases are replicated across multiple
servers, and the protocol is effectively taking XORs of various shares of
the servers to recombine into the original data. Still, the best results
to date are not yet really practical.
http://www.cs.technion.ac.il/~yuvali/pubs/BIKR02.ps
What I think you are proposing is a more ad-hoc technique that these
protocols: Given a file, break it up into shares (by a secret sharing
scheme, for instance, or just XORing random bits) and spread those shares
across servers. Or, to add more "plausible deniability" by making those
shares not linked to any specific file, mix the shares from various files
together, so that Britney and the Declaration of Independence can both be
regenerated from some share s_i. Some examples of such systems include:
Private message service
http://www.inf.fu-berlin.de/~berthold/publ/BCKP_02.pdf
Tangler
http://www.cs.nyu.edu/~waldman/tangler.ps
See the related work in these papers for more citations.
However, the general difference between these proposals and PIR---and
similarly between MIX/proxy-based schemes and PIR as well---is that the
latter's security is formally provable via traditional reduction-based
techniques, while the former's security is ad-hoc and ill-defined. Of
course, this formal security comes at the cost of performance.
--mike
-----
www.michaelfreedman.org www.coralcdn.org
More information about the P2p-hackers
mailing list