I have not been following this thread carefully. Just in case you are
interested in further reading, you could check this paper:
"A Bi-Level Bernoulli Scheme for Database Sampling"
Peter Haas, Christian Koenig (SIGMOD 2004)
--
Pip-pip
Sailesh
http://www.cs.berkeley.edu/~sailesh