TEITOK visualization and search interface for Totoli


Language nameTotolitoto1304
Language familyAustronesianaust1307
Corpus creatorBardají, Maria and Bracks, Christoph and Leto, Claudia and Hasan, Datra and Riesberg, Sonja and Alamudi, Winarno S. and Himmelmann, Nikolaus P.
Translations providedEnglish
Glossesall
Annotation file licenceCC BY-NC-SA

This is an interface for visualizing and searching the Totoli DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Totoli DoReCo dataset in publications please cite

Bardají, Maria and Bracks, Christoph and Leto, Claudia and Hasan, Datra and Riesberg, Sonja and Alamudi, Winarno S. and Himmelmann, Nikolaus P.. 2024. Totoli DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/toto1304 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Totoli corpus:

GlossLGRMeaning
1pe1 ; PL ; EXCLfirst person plural exclusive
1pi1 ; PL ; INCLfirst person plural inclusive
1s1 ; SGfirst person singular
22second person
2p2 ; PLsecond person plural
2s2 ; SGsecond person singular
3p3 ; PLthird person plural
3s3 ; SGthird person singular
a(none)/a/-like filled pause/interjection
ACT(none)actor
ADISTDISTdistal (adverb)
AH(none)/a/-like filled pause/interjection
AMED(none)medial (adverb)
AND(none)andative
APPL1APPLapplicative 1
APPL2APPLapplicative 2
APRX(none)approximal (adverbial)
AUTO.MOT(none)autonomous motion
AV(none)actor voice
CAUCAUScausative
CGE(none)Commission for General Elections
COLL(none)collective
CPLCOMPLcompletive
DEMDEMDemonstrative
DISTDISTdistal (deictic)
e(none)/e/-like filled pause/interjection
EH(none)/e/-like filled pause/interjection
EMPH(none)emphatic
EXIST(none)existential quantifier
FILL(none)filler
GENGENgenitive
GER(none)gerundive
HON(none)honorific
IMPIMPimperative
INCH(none)inchoative
INCPL(none)incompletive
INTJ(none)interjection
INVOL(none)involuntary action
ITJ(none)interjection
L.NR(none)locative nominalizer
LK(none)linker
LOCLOClocative
LV(none)locative voice
MED(none)medial (deictic)
NEGNEGnegation
NRNMLZnominalizer
o(none)/o/-like interjection
O.K(none)English OK
OK(none)English OK
ONE(none)one
ORD(none)ordinal number
PART(none)particle
PN(none)proper name
POSSPOSSpossessor
POT(none)potentive
PRXPROXproximative (deictic)
QQquestion word
QUOTQUOTquotative
RCPRCPreciprocal
RCP1RCPreciprocal 1
RDP~(none)reduplication
RDP1~(none)reduplication 1
RDP2~(none)reduplication 2
RDP3~(none)reduplication 3
RDP4~(none)reduplication 4
RELRELrelative
RLS(none)realis
RQV(none)requestative
RSTR(none)restrictive
SF(none)stem formant
ST(none)stative
STR.SIMDISTR ; (none)distributive+simultaneous
ty(none)"multiple of 10 (i.e -ty as in ""twenty"")"
UV1(none)undergoer voice 1
UV2(none)undergoer voice 2
VEN(none)venitive