This is a seminal paper on reproducibility in cancer biology. It should be a gold standard for reproducible research work. Therefore, it should be attempted to reproduce it. Supposedly, this will be pretty easy to reproduce and can be used as a *positive control* in repro hacks!
The methods are widely applicable to other DNA sequence clustering problems. Someone may obtain contradicting results with a new algorithm. In such a case, rerunning our scripts on the same or new data may help elucidate the source of the differences between the results.
I tried as hard as possible to make it reproducible, which it is on my computer. I would be happy to see if this still works on other computers. Moreover, by allowing easy reproducibility, I hope that other people may easily build research on top of this work.