... a t is, if the training and test sets are extracted from the same corpus, they will probably contain the same kind of errors in the same kind of situations This may cause the training procedure ... uncertainty in the evaluation m a y be, in some cases, larger than the reported improvements from one system to another, so invalidating the conclusions of the comparison Model since the tagger ... of u If the test and training corpora are independent, the probability of making the same error, given that both are wrong, will be the random / ( a - ) If the corpora are not independent, the...