Wiktionary:Frequency lists/Esperanto/Wikipedia 2023

From Wiktionary, the free dictionary
Jump to navigation Jump to search

Based on the words found in the Esperanto Wikipedia dump of 2023-03-01. All words are reduced to their base form (plural -j and accusative -n are stripped, the verb endings -as/-is/-os/-us/-u are changed to the infinitive -i). Each word is listed in the most typical case form (lower-case, capitalized, or all-caps). Non-Esperanto-ified proper names are mostly omitted (unless listed in common dictionaries). The total size of the corpus is more than 43 million words.


First hundred by frequency[edit]

Together these 100 words cover 51.93% percent of the whole corpus.

Second hundred[edit]

Together these 200 words cover 58.22% percent of the whole corpus.

Third hundred[edit]

Together these 300 words cover 62.10% percent of the whole corpus.

Fourth hundred[edit]

Together these 400 words cover 64.99% percent of the whole corpus.

Fifth hundred[edit]

Together these 500 words cover 67.32% percent of the whole corpus.

Frequency rank of 501–1000[edit]

Together these 1000 words cover 74.73% percent of the whole corpus.

Frequency rank of 1001–2000[edit]

Together these 2000 words cover 81.81% percent of the whole corpus.

Frequency rank of 2001–3000[edit]

Together these 3000 words cover 85.60% percent of the whole corpus.

Frequency rank of 3001–4000[edit]

Together these 4000 words cover 88.02% percent of the whole corpus.

Frequency rank of 4001–5000[edit]

Together these 5000 words cover 89.73% percent of the whole corpus.

Frequency rank of 5001–6000[edit]

Together these 6000 words cover 91.03% percent of the whole corpus.

Frequency rank of 6001–7000[edit]

Together these 7000 words cover 92.06% percent of the whole corpus.

Frequency rank of 7001–8000[edit]

Together these 8000 words cover 92.90% percent of the whole corpus.

Frequency rank of 8001–9000[edit]

Together these 9000 words cover 93.59% percent of the whole corpus.

Frequency rank of 9001–10000[edit]

Together these 10000 words cover 94.18% percent of the whole corpus.

Frequency rank of 10001–11000[edit]

Together these 11000 words cover 94.69% percent of the whole corpus.

Frequency rank of 11001–12000[edit]

Together these 12000 words cover 95.14% percent of the whole corpus.

Frequency rank of 12001–13000[edit]

Together these 13000 words cover 95.53% percent of the whole corpus.

Frequency rank of 13001–14000[edit]

Together these 14000 words cover 95.88% percent of the whole corpus.

Frequency rank of 14001–15000[edit]

Together these 15000 words cover 96.19% percent of the whole corpus.