TEITOK visualization and search interface for Goemai


Language nameGoemaigoem1240
Language familyAfro-Asiaticafro1255
Corpus creatorHellwig, Birgit
Translations providedEnglish
Glossesall
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Goemai DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Goemai DoReCo dataset in publications please cite

Hellwig, Birgit. 2024. Goemai DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/goem1240 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Goemai corpus:

GlossLGRMeaning
11first person
22second person
33third person
ADVZnoneadverbializer
ANTnoneanterior
ASSOCnoneassociative
BENBENbenefactive
COMCOMcomitative
COMITCOMcomitative
COMPCOMPcomplementizer
CONDCONDconditional
DEFDEFdefinite
DEMDEMdemonstrative
DEM.DISTnonedemonstrative distal
DIMnonediminutive
DIRnonedirection/vicinity of
EMPHnoneemphasis
FOCFOCfocus
FUT.CLnoneclose future
GENGENgenitive
HABnonehabitual
HOW/WHEREnonemanner/locative nominalizer
Inoneindependent pronoun
INTERRnoneinterrogative
IRRIRRirrealis
LOCLOClocative
LOC.ANAPHnonelocative anaphor
LogAnonelogophoric addressee
LogSnonelogophoric speaker
MMmasculine
NEGNEGnegation
NOMZNMLZnominalizer
OOobject pronoun
OBLOBLoblique
ORDnoneordinal number
PAST.CLnoneclose past
PAST.HESTnoneyesterday past
PAST.REMnoneremote past
PERMnonepermissive
PLPLplural
POSSPOSSpossessive
PRESnonepresentative
PROGRPROGprogressive
PROHPROHprohibitive
PROXPROXproximal
PURnonepurposive
REFLREFLreflexive
RESULTRESresultative
Snoneintransitive subject
SAYnonereported speech
SAYnonereported speech
SEQnonesequential
SGSGsingular
SPECnonespecific-indefinite article
SUBnonesubordination
WHEREnonemanner/locative nominalizer