TEITOK visualization and search interface for Cabécar


Language nameCabécarcabe1245
Language familyChibchanchib1249
Corpus creatorQuesada, Juan Diego and Skopeteas, Stavros and Pasamonik, Carolina and Brokmann, Carolin and Fischer, Florian
Translations providedEnglish
Glossesall
Annotation file licenceCC BY-NC

This is an interface for visualizing and searching the Cabécar DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Cabécar DoReCo dataset in publications please cite

Quesada, Juan Diego and Skopeteas, Stavros and Pasamonik, Carolina and Brokmann, Carolin and Fischer, Florian. 2024. Cabécar DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/cabe1245 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Cabécar corpus:

GlossLGRMeaning
11first person
22second person
33third person
ABSABSabsolutive
AGnoneagent
ANAPHnoneanaphor
ANIMnoneanimate
ANIMnoneanimate classifier
ANTnoneanterior
CFLCLFclassifier
CMPLnonecompletive
CONDCONDconditional
DEnone(unclear)
DEMDEMdemonstrative
DEONTnonedeontic
DIMnonediminuitive
DIRnonedirectional
DUBnonedubitative
EMPHnoneemphatic
ERGERGergative
EXCLEXCLexclusive
FINnonepurposive
FLATnoneflat classifier
GERnonegerund
HABnonehabitual
IDEOPHnoneideophone
IDPHnoneideophone
IMPIMPimperative
INCnone(unclear)
INCLINCLinclusive
INCMPnoneincompletive
INERnone(unclear)
INGRnoneingressive
INTnoneinterrogative
INTRINTRintransitive
ITERnoneiterative
LONGnonelong classifier
Mnone(unclear)
MIDnonemiddle voice
NEGNEGnegation
PARTnone(unclear)
PCNTnone(unclear)
PFVPFVperfective
PLPLplural
PNCTnonepunctual
POSnone(unclear)
POSSPOSSpossessive
PROGPROGprogressive
PRPSnonepurpose
PRSPnoneprospective aspect
PUNCTnonepunctual
PVFPFVperfective
RECPRECPreciprocal
REFLREFLreflexive
ROUNDnoneround classifier
SETnoneset classifier
SGSGsingular
STRnone(unclear)
SUBnonesubordinator
SUDnone(unclear)
VOLnonevolitive mood