Type/token ratio

Example: click to zoom

This tool estimates the lexical diversity of one or more corpora in the form of a progression curve.


Load corpora in XML-TEI format (lemmatized with TXM — more accurate) or TXT UTF-8 (less accurate)

To load one or more XML-TEI files lemmatized in advance with TXM, navigate to the folder
TXM-VERSION / corpora / NAME-OF-THE-CORPUS / txm / NAME-OF-THE-CORPUS