The CLARIN language resources can be explored via the individual repositories of the institutions / centers participating in the network. The following repositories are available to researchers on Eastern Europe:
With the support of the German Research Foundation (DFG), the Linguistics Information Service (FID Linguistik) grants licences for individual corpora to researchers within Germany. Among others, corpora for the following languages are offered: Bulgarian, Czech, Estonian, Greek, Croatian, Latvian, Lithuanian, Polish, Romanian, Slovak, Slovenian, Hungarian.
Text+ is a consortium of the nationwide initiative to establish a national research data infrastructure (Nationale Forschungsdateninfrastruktur, NFDI). It focuses on the long-term preservation of text- and language-based research data and on enabling their broad use in science.
CLARIN Estonia (Eesti Keeleressursside Keskus / Center of Estonian Language Resources)
Eesti keele spontaanse kõne foneetiline korpus (Phonetic Corpus of Estonian Spontaneous Speech)
Murdekorpus – Eesti murrete korpus (Estonian dialects corpus)
Vana kirjakeele korpus (Corpus of the old written language)
The index curated by the Specialized Information Service for Finno-Ugric / Uralic Languages, Literatures and Cultures contains language corpora of individual Uralic languages represented in Siberia or in the Volga-Kama area and adjacent regions, among others.