cd-hit/4.8.1: a very widely used program for clustering and comparing protein or nucleotide sequences
Link to homepage: https://sites.google.com/view/cd-hit