The methods are widely applicable to other DNA sequence clustering problems. Someone may obtain contradicting results with a new algorithm. In such a case, rerunning our scripts on the same or new data may help elucidate the source of the differences between the results.