Representing Sets of Pure Order
If a man can expect to meet exactly N eligible women in his
life, what strategy should he use to maximize his chances of
choosing the best one? This is sometimes called "Bachelor's
Problem". The usual answer is that for sufficiently large N
he should wait until he has seen N/e candidates and then select
the next candidate that is better than any he's seen previously.
This gives a probability of about 1/e for selecting the best
candidate. (However, see the note Optimizing Your Wife for
a critique of this strategy).
More interesting than the problem itself are the assumptions
on which it's based. Obviously it's assumed the man can never
go back and rekindle a romance once it's been terminated, and
he can't have more than one relationship at a time. Also,
it's assumed that he meets the prospective brides in random
order of suitability. These assumptions are certainly
debateable in real life, but they're at least conceivable.
This brings us to the most interesting assumption underlying
the Bachelor's Problem: he can only discern RELATIVE suitability
between two "known quantities"; he has no means whatsoever of
assessing absolute suitability. For example, even if he's
deliriously happy with the first woman, he still has to assume
she may be the LEAST suitable, because he hasn't met any of the
others, and for all he knows, EVERY woman may be as nice, or
nicer, than this first one.
Even after he breaks up with the first woman and has a chance to
evaluate the second woman, we're asked to assume that he still
can't deduce anything about the underlying population (so to
speak). He can determine which of those two was better, but he
has no idea how these two women stand relative to the women he
has yet to meet. The same is true with the third and fourth
and so on. At each stage he's only able to determine the
RELATIVE ORDER of the women he has evaluated so far.
This is a very interesting premise, and it has clearly taxed the
ingenuity of problem-posers as they try to present a situation
in which this assumption would be strictly valid. For example,
in a recent email message someone presented the problem in
the form:
"Suppose you have a bag with N pieces of paper in it.
On every piece of paper there is a real number. The
numbers are all different and there is no upper or
lower bound. You draw out numbers one at a time until
you decide to stop (or you've drawn all the numbers),
and the last number you drew is your selection. What
strategy will maximize your chances of selecting the
largest number in the bag?"
It's often been observed that this type of formulation is
somewhat ill-posed, because there's no uniform probability
distribution over the set of all numbers. The very fact
that the N numbers have somehow been chosen implies that they
were drawn from some non-uniform distribution, which in turn
suggests that perhaps some information about the absolute
ordering of the numbers can be inferred from the early samples.
However, since we're not told what that distribution is, we
can't quantify any such inference. In it's simplest form
this connundrum is exposed in the notorious "High-Low Number
Problem" (see Choice Without Context?).
People who worry about things like this usually eliminate the
problematical aspect of the question by stating that when a
number is drawn the man is not allowed to actually see it.
Instead, a disinterested third party looks at the number and
tells the man where it falls, in order, relative to the numbers
already drawn. Thus the man really has access only to the
relative ordering, and knows nothing about the absolute
ordering.
This is okay, but it makes me wonder if it's the only way in
which purely relative ordering can be represented. Why do we
need to introduce a third party? Surely it must be possible to
create N packets of information such that we can unambiguously
establish the relative order of any packets we've examined, but
we have zero information about their positions relative to the
packets we have not examined.
The simplest way I can see to do this is not very simple. For
concretness, suppose we have 5 cards on which we want to place
marks that will allow relative ordering but that will convey zero
information about absolute order within the set of 5. Define ten
symbols, let's say the letters A through J, and put four symbols
on each card as follows
Card 1 Card 2 Card 3 Card 4 Card 5
------ ------ ------ ------ ------
A A B C D
B E F E G
C F H H I
D G I J J
The four letters on each card should be randomly arranged,
and we should apply a randomly selected permutation to the 10
characters. Anyway, each card has exactly one character in
common with each other card. Now to each of the 10 characters
we assign a number randomly selected from the set {0,1,2}.
This number is placed next to the corresponding character
on the first card where it appears, and the number j+1 (mod 3)
is placed next to the character on the second card where it
appears. The result is a set of 5 cards that look like this
Card 1 Card 2 Card 3 Card 4 Card 5
------ ------ ------ ------ ------
E 0 G 1 D 0 G 2 J 1
B 0 H 0 C 1 C 2 F 1
H 2 D 2 E 1 B 1 A 2
A 1 J 0 F 0 I 2 I 0
Given any two cards, the relative order is determined by comparing
the numbers n,m written next to the common character. The number
n comes before m if and only if m-n = 1 (mod 3). For example,
given Cards 2 and 5 we would note that the common character is J,
for which the numbers are 0 and 1 respectively. This shows that
Card 2 comes before Card 5. But nothing on these two cards gives
us any clue as to their order relative to the remaining three
cards.
This seems overly complicated, and there's probably a more
economical way of doing it, but the fact that it can be done is
interesting. In particular, this can be used to define a set of
n purely ordered elements, for any natural number n. All that's
required is that each element is characterized by n-1 characters
(and associated digits) from an alphabet of n(n-1)/2 characters.
In a sense, we could define these "cards" as "numbers", and they
closely correspond to the set of numbers [1,2,..,n] (i.e., each
element except the first has a unique predecessor, and so on).
What happens if we try to let n go to infinity? This seems
fairly straightforward for the set [1,..,n]. We just imagine
it extending to infinitely large values of n, and we arrive
at our notion of the complete infinite set "N" of ALL natural
numbers. However, for our purely ordered set of "cards" it
seems a bit trickier. Presumably we CANNOT extrapolate this
to a complete infinite set of purely ordered elements, because
that would contradict the fact that there is no uniform
distribution on the natural numbers.
What prevents us from conceiving an infinite set of purely
ordered elements? We might think the problem is that we don't
have a large enough alphabet to cover infinitely many cards, but
why not just use the natural numbers themselves? In other words,
A=1, B=2, and so on. We might begin to construct our infinite
set of purely ordered numbers like this
Card1 Card2 Card3 Card4 Card5 Card6 Card7 Card8 Card9
------ ----- ----- ----- ----- ----- ----- ----- -----
2 2 4 8 16 32 64 128 256
4 3 9 27 81 243 729 2187
8 9 5 25 125 625 3125
16 27 25 7 49 343
32 81 625 49 11
64 243 3125 343 etc.
128 729 15625
256 2187
512
Clearly each card shares exactly one "character" with each
other card, and we can uniformly select an "order code" from
the set {0,1,2} for each of these characters. So now all that
remains is to apply an arbitrary permutation to the labels of
these characters. But this is where we run into a problem,
because to perform a truly arbitrary permutation of our infinite
set of charcters we would need a uniform distribution on all
these characters, which we don't have.
Nevertheless, this illustrates an interesting point. We can
imagine infinitely many orderings of the elements of an infinite
set, but evidently we can't conceive of ALL POSSIBLE orderings of
an infinite set, at least not in the symmetrical way that the
symmetry of the definitions would lead us to expect. In other
words, even though no individual element and no particular ordering
of the elements of the set N (for example) is a_priori preferred
or distinguished over any other element or ordering, it appears that
some orderings of the natural numbers are more natural than others.
Return to MathPages Main Menu
Сайт управляется системой
uCoz