skip to content
> abdulla_
...

Analyzing 2M Words Across 200+ Languages from Wiktionary

// 1 min read

Language is complex. Different languages, dialects, and constant external influences mean it never stays static — and that creates real measurement challenges for linguists.

I analyzed a dataset of 2 million words from Wiktionary spanning 200+ languages. In the dashboard I tried to explain the analysis in as plain language as possible. But since the subject is inherently complex, some parts might not be immediately obvious.

If you want to dig deeper, I’m sharing links to both the dashboard and a Medium post where I go into more detail.

Dashboard: https://lnkd.in/emhNTPWD Medium: https://lnkd.in/ePjFgYCH

P.S. — if the Medium post is long but you found the analysis useful, please like it. It helps other people find it.