Strong and weak sampling

From HandWiki

Strong and weak sampling are two sampling approach[1] in Statistics, and are popular in computational cognitive science and language learning.[2] In strong sampling, it is assumed that the data are intentionally generated as positive examples of a concept,[3] while in weak sampling, it is assumed that the data are generated without any restrictions.[4]

Formal Definition

In strong sampling, we assume observation is randomly sampled from the true hypothesis:

P(x|h)={1|h|, if xh0, otherwise

In weak sampling, we assume observations randomly sampled and then classified:

P(x|h)={1, if xh0, otherwise

Consequence: Posterior computation under Weak Sampling

P(h|x)=P(x|h)P(h)hP(x|h)P(h)={P(h)h:xhP(h), if xh0, otherwise

Therefore the likelihood P(x|h) for all hypotheses h will be "ignored".

References