TEITOK visualization and search interface for Movima


Language nameMovimamovi1243
Language familyIsolatena
Corpus creatorHaude, Katharina
Translations providedSpanish/English
Glossesall
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Movima DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Movima DoReCo dataset in publications please cite

Haude, Katharina. 2024. Movima DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/movi1243 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Movima corpus:

GlossLGRMeaning
11first person
22second person
33third person
anoneabsential
ABnoneabsential
ABSnoneabsolute state
ABSNnoneabsent referent
ABSTRnoneabstract
AGTnoneagentive
ANTnoneanterior
APPLAPPLapplicative
APPRnoneapproaching
BDPnonebodily process
BEnonebound element
BENBENbenefactive
BRnonebound root
CAUSCAUScausative
CLnoneclassifier
CLFnoneclassifier
COnoneco participant
CONJnoneconjunction
CSLnonecausal
dDISTdistal
Dnonedummy element
DEFnonedefinite
DEMDEMdemonstrative
DETDETdeterminer
DETRnonedetransitivization
DIRnonedirectional
DRnonebivalent direct
DR2nonesecondary direct marker
DSCnonediscontinutative
DUBnonedubitative
DURDURdurative
DYNnonedynamic
elnoneelevated
EMPHnoneemphatic
EVnoneevidential
EVDnoneevidential
EXCLEXCLexclusive
fFfeminine
FRUSTnonefrustrative
FUTFUTfuture
HESITnonehesitation
HORTnonehortative
HYPnonehypothetical
IJnoneinterjection
IMnoneimmediately/impossibly
IMPIMPimperative
INALnoneinalianable possession
INCINCLinclusive
INSTRnoneinstrumental nominalization
INTnoneintensifier
INTJnoneinterjection
INTRINTRintransitive
intrINTRintransitive
INVnonebivalent inverse
IRRIRRirrealis
ITNnoneintentional
LINKnonelinking nasal
LNnonelinking nasal
LOCnonelocation
LVnonelinking vowel
mMmasculine
MDnonemiddle voice
MLTnonemultiple event
MODnonemodal
nNneuter
Nnonenoun
NEGNEGnegation
NMLZNMLZnominalizer
NMLZ.AGTnoneagent nominalizer
NMZnoneaction/state nominalization
nstnonenon-standing
ntrnoneneutral
NTRnoneneutral
OBLOBLoblique
OBVnoneobviative
pPSTpast
PHnonephasal aspect
plPLplural
PRCnoneprocess nominalization/verbalization
PROnonefree pronoun
PROHPROHprohibitive
PSEUDOnonepseudo
PSTPSTpast
RREFLreflexive
REASnonereason
REDnonereduplication
RELnonerelativizer
RESnoneresultative
rtrnoneretreating
SNSnonesensation
SPKnoneproximate to speaker
stdnonestanding
TRCnonetruncated element
VBLZnoneverbalization
VBZnoneaction verbialization