TEITOK visualization and search interface for Daakie


Language nameDaakieport1286
Language familyAustronesianaust1307
Corpus creatorKrifka, Manfred
Translations providedBislama/English
Glossessome
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Daakie DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Daakie DoReCo dataset in publications please cite

Krifka, Manfred. 2024. Daakie DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/port1286 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Daakie corpus:

GlossLGRMeaning
11first person
22second person
33third person
compCOMPcomplementizer
copCOPcopula
cpCOPcopula
dDUdual
demDEMdemonstrative
detrDETdeterminer
distDISTdistal
dstDISTdistal
emphnoneemphasizer
exEXCLexclusive
focFOCfocus
futFUTfuture
inINCLinclusive
indINDindicative
indefINDFindefinite
irIRRirrealis
irrIRRirrealis
lcLOClocative
locLOClocative
negNEGnegation
nhumnonenon-human
nomNOMnominative
nreIRRnonrealis
pPLplural
plPLplural
possPOSSpossesive
prepnonepreposition
progPROGprogressive
pronnonepronoun
proxPROXproximate
prxPROXproximate
renonerealis
relRELrelative
sSGsingular
sgSGsingular
spnone(unclear)
tnonepaucal
trTRtransitive
transTRtransitive