TEITOK visualization and search interface for Northern Kurdish (Kurmanji)


Language nameNorthern Kurdish (Kurmanji)nort2641
Language familyIndo-Europeanindo1319
Corpus creatorHaig, Geoff and Vollmer, Maria and Thiele, Hanna
Translations providedEnglish
Glossesall
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Northern Kurdish (Kurmanji) DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Northern Kurdish (Kurmanji) DoReCo dataset in publications please cite

Haig, Geoff and Vollmer, Maria and Thiele, Hanna. 2024. Northern Kurdish (Kurmanji) DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/nort2641 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Northern Kurdish (Kurmanji) corpus:

GlossLGRMeaning
11first person
22second person
33third person
ADDnoneadditive
ADPnoneadposition
COMPLCOMPcomplementizer
COPCOPcopula
DEMDEMdemonstrative
DRCTnonedirectional
EMPHnoneemphatic
EXCLnoneexclamative
EZnoneezafe
FFfeminine
FUTFUTfuture
IMPIMPimperative
IMPERIMPimperative
INDINDindicative
INDEFINDFindefinite
INDFINDFindefinite
INFINFinfinitive
MMmasculine
MODnonemodality
NCnonenot considered
ncnonenot considered
NEGNEGnegation
OBLOBLoblique
PLPLplural
POPnonepostposition
POSSPOSSpossessive
PPRFnonepluperfect
PRFPRFperfect
PROnonepronoun
PROGPROGprogressive
PROGRPROGprogressive
PRSPRSpresent
PRTnoneparticle
PSTPSTpast
PTCPPTCPparticiple
RECIPnonereciprocal
REDnonereduplication
REDUPLnonereduplication
REFLREFLreflexive
SBJSBJVsubjunctive
SGSGsingular
SJBSBJVsubjunctive
SUBJSBJVsubjunctive
VOCVOCvocative