Seol mar théacs é seo: Unsupervised identification of significant lineages of SARS-CoV-2 through scalable machine learning methods