Could someone explain to me how this cluster command works in the backend? I couldn't find any resource that explain the technique/algorithm behind this cluster command.
How does it cluster the matches (termlist/termset/ngramset)?
How is t be calculated? It doesn't seem to be probability based.
What kind of clustering algorithm it uses?
It would be the best if someone can explain the full algorithm for this cluster command. Much thanks
Bump