<?xml version="1.0" encoding="utf-8"?><!-- generator="wordpress/2.2.2" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Millionaire&#8217;s Problem</title>
	<link>http://www.bryanmills.net/archives/2007/10/millionaires-problem/</link>
	<description></description>
	<pubDate>Thu, 09 Sep 2010 06:42:05 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2.2</generator>

	<item>
		<title>By: Samrobb</title>
		<link>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2836</link>
		<author>Samrobb</author>
		<pubDate>Sat, 17 Nov 2007 07:11:40 +0000</pubDate>
		<guid>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2836</guid>
		<description>I've thought about the problem ("Is X interested in Y") in terms of sparse bloom filters, with an agreement on how data is canonicalized before it is hashed.  This allows you to construct a binary blob that you can hand to me, and that I can then query, without knowing exactly what data went in to making up the blob.  It still requires some OOB data - but that's an agreement on protocol, not subject.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve thought about the problem (&#8221;Is X interested in Y&#8221;) in terms of sparse bloom filters, with an agreement on how data is canonicalized before it is hashed.  This allows you to construct a binary blob that you can hand to me, and that I can then query, without knowing exactly what data went in to making up the blob.  It still requires some OOB data - but that&#8217;s an agreement on protocol, not subject.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: The Doctor What</title>
		<link>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2641</link>
		<author>The Doctor What</author>
		<pubDate>Mon, 05 Nov 2007 21:46:33 +0000</pubDate>
		<guid>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2641</guid>
		<description>&lt;b&gt;Re: the hashing for album collections:&lt;/b&gt;

How about using &lt;a href="http://en.wikipedia.org/wiki/W-shingling" rel="nofollow"&gt;shingles&lt;/a&gt; instead?  They can be compared.  It's used in search engines a lot.

&lt;b&gt;Re: millionares problem&lt;/b&gt;

I remember a paper that came out around 1992 or so.  The problem was to do the classic "flip a coin, call it in the air" with two remote people and prevent lying.  It was similar, except that I actually understood (eventually) the math.

&lt;b&gt;Bug:&lt;/b&gt;

&lt;blockquote&gt;STEP 2: Bob takes the C value he computed and adds his wealth value J and subtracts one, C - J + 1 = 749 - 6 + 1 = 744. Bob then sends Alice this value.&lt;/blockquote&gt;

I assume that should have the words "adds" and "substracts" flipped...

Ciao!</description>
		<content:encoded><![CDATA[<p><b>Re: the hashing for album collections:</b></p>
<p>How about using <a href="http://en.wikipedia.org/wiki/W-shingling" rel="nofollow">shingles</a> instead?  They can be compared.  It&#8217;s used in search engines a lot.</p>
<p><b>Re: millionares problem</b></p>
<p>I remember a paper that came out around 1992 or so.  The problem was to do the classic &#8220;flip a coin, call it in the air&#8221; with two remote people and prevent lying.  It was similar, except that I actually understood (eventually) the math.</p>
<p><b>Bug:</b></p>
<blockquote><p>STEP 2: Bob takes the C value he computed and adds his wealth value J and subtracts one, C - J + 1 = 749 - 6 + 1 = 744. Bob then sends Alice this value.</p></blockquote>
<p>I assume that should have the words &#8220;adds&#8221; and &#8220;substracts&#8221; flipped&#8230;</p>
<p>Ciao!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bryan Mills</title>
		<link>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2549</link>
		<author>Bryan Mills</author>
		<pubDate>Wed, 31 Oct 2007 20:24:07 +0000</pubDate>
		<guid>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2549</guid>
		<description>Agreed, the interesting item to know is that I own a Neil Diamond album or just that I have some interest in X.  Using the music taste example its really not important that you know that I own an album but more important to know that people (aggregate) that own similar albums as you do also own some other album you don't have.  This means you need a way of grouping people without revealing identities.

I've looked at are partial hashing of collections in the hope of grouping multiple people together.  You can't use exact hashing because I might own 1 album different than you which would cause us not to be grouped.  If you do partial hashing of your collection of music then you can group with one another and then having a mapping to other hash values that contain other albums.  The hash works as just a way of aggregating preferences in an anonymous way.  I actually have a working prototype of this and will try to post this soon.

The problem with my little prototype is it current looks at all permutations of your album collection to and creates hash entries for each.  I've looking at using two hash functions and creating a 2d space where you then map your entries into that instead of the 1d hash function.  Then you can do things like range queries over the 2d space that should help cut down the number of permutations needed.  This still needs some more thought, although if your interested you should look at how you can do range queries using a DHT.</description>
		<content:encoded><![CDATA[<p>Agreed, the interesting item to know is that I own a Neil Diamond album or just that I have some interest in X.  Using the music taste example its really not important that you know that I own an album but more important to know that people (aggregate) that own similar albums as you do also own some other album you don&#8217;t have.  This means you need a way of grouping people without revealing identities.</p>
<p>I&#8217;ve looked at are partial hashing of collections in the hope of grouping multiple people together.  You can&#8217;t use exact hashing because I might own 1 album different than you which would cause us not to be grouped.  If you do partial hashing of your collection of music then you can group with one another and then having a mapping to other hash values that contain other albums.  The hash works as just a way of aggregating preferences in an anonymous way.  I actually have a working prototype of this and will try to post this soon.</p>
<p>The problem with my little prototype is it current looks at all permutations of your album collection to and creates hash entries for each.  I&#8217;ve looking at using two hash functions and creating a 2d space where you then map your entries into that instead of the 1d hash function.  Then you can do things like range queries over the 2d space that should help cut down the number of permutations needed.  This still needs some more thought, although if your interested you should look at how you can do range queries using a DHT.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Samrobb</title>
		<link>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2507</link>
		<author>Samrobb</author>
		<pubDate>Mon, 29 Oct 2007 17:58:11 +0000</pubDate>
		<guid>http://www.bryanmills.net/archives/2007/10/millionaires-problem/#comment-2507</guid>
		<description>This is near, but it took me a while to identify something that was bugging me about it: it's simply a way to compare two numbers.  The fact that we're comparing monetary amounts is OOB data that needs to be communicated and agreed on between the two parties.  I think that it's generally the OOB data that's the interesting part of the interaction.  Going back to a previous post of yours.. I don't really care how many Neil Diamond albums you own.  The fact that you do own a Neil Diamond album, on the other hand, is an interesting bit (literally) of data.  In order to ask/answer that question, though, we still need to agree on the subject.

For me, an interesting problem is figuring out how to answer those binary questions without revealing additional information, *including* the OOB information about the subject.</description>
		<content:encoded><![CDATA[<p>This is near, but it took me a while to identify something that was bugging me about it: it&#8217;s simply a way to compare two numbers.  The fact that we&#8217;re comparing monetary amounts is OOB data that needs to be communicated and agreed on between the two parties.  I think that it&#8217;s generally the OOB data that&#8217;s the interesting part of the interaction.  Going back to a previous post of yours.. I don&#8217;t really care how many Neil Diamond albums you own.  The fact that you do own a Neil Diamond album, on the other hand, is an interesting bit (literally) of data.  In order to ask/answer that question, though, we still need to agree on the subject.</p>
<p>For me, an interesting problem is figuring out how to answer those binary questions without revealing additional information, *including* the OOB information about the subject.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
