TEITOK visualization and search interface for Komnzo


Language nameKomnzokomn1238
Language familyYammore1255
Corpus creatorDöhler, Christian
Translations providedEnglish
Glossesall
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Komnzo DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Komnzo DoReCo dataset in publications please cite

Döhler, Christian. 2024. Komnzo DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/komn1238 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Komnzo corpus:

GlossLGRMeaning
11first person
22second person
33third person
2|3nonesecond or third person (syncretism)
ABLABLablative
ABSABSabsolutive
ADLZRnoneadjectivalizer
ALLALLallative
ALRnoneiamative 'already'
ANDnoneandative
ANIMnoneanimate
APPRnoneapprehensive
ASSOCnoneassociative
CHARnonecharacteristic case
DATDATdative
DEMDEMdemonstrative
DIMnonediminuitive
DISTDISTdistal
DISTRDISTRdistributive
DUDUdual
DURDURdurative
EMPHnoneemphatic
ERGERGergative
ETCnoneet cetera 'and all'
FEMFEMfeminine
FUTFUTfuture
FUTIMPnonefuture imperative
HABnonehabitual
IMMnoneimmediate
IMNnoneimminent
IMPIMPimperative
INDFINDFindefinite
INSINSinstrumental
IOnoneindirect object
IPFVIPFVimperfective
IPSTnoneimmediate past
IRRIRRirrealis
ITERnoneiterative
LOCLOClocative
MASCMmasculine
MEDnonemedial demonstrative
NEGNEGnegation
NMLZNMLZnominalizer
NPSTnonenonpast
NSGnonenon-singular
ONLYnoneexclusive marker 'only' 'just'
PFVPFVperfective
PLPLplural
POSSPOSSpossessive
POTnonepotential
PRIVnoneprivative case
PROPnoneproprietive case
PROXPROXproximal
PSTPSTpast
PST:PST/hitOBJobject
PURPPURPpurposive
RECOGnonerecognitional pronoun
REDUPnonereduplication
RPSTnonerecept past
SBJSBJsubject
SGSGsingular
SIMILnonesimilative
STATnonestative
TEMPnonetemporal case
VENTnonevenitive