Reproducing results: how big is the problem?