TEITOK visualization and search interface for Vera'a


Language nameVera'avera1241
Language familyAustronesianaust1307
Corpus creatorSchnell, Stefan
Translations providedEnglish
Glossesall
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Vera'a DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Vera'a DoReCo dataset in publications please cite

Schnell, Stefan. 2024. Vera'a DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/vera1241 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Vera'a corpus:

GlossLGRMeaning
11first person
22second person
33third person
TLnonetrial
Anonetransitive subject
ABILnoneability
ABLABLablative
ADNnoneadnominal
ARTARTarticle
ASSnoneassociative
BEDnonepossessive classifier for bed possession
CARDnonecardinal numeral
CATnonecataphoric
CCnoneclause combining particle
CLCLFclassifier
COMCOMcomitative
CPLCOMPcomplementizer
CPLnonecomplementizer
CSnoneconstruct suffix
DATDATdative
DEICTnonedeictic
DELnonedelimitative
DEMDEMdemonstrative
DIRnonedirectional
DISnonedissociative possessive suffix
DISCnonediscourse particle
DLDUdual
DOMnonepossessive classifier for domestic possession
DRINKnonepossessive classifier for drink possession
EATnonepossessive classifier for eating possession
EMPHnoneemphatic
EMPHnoneemphatic particle
EVnoneepenthetic vowel
EXEXCLexclusive
FUTFUTfuture
GENGENgenitive
HESnonehesitation
HOUSEnonepossessive classifier for house possession
IMMnoneimmediacy
ININCLinclusive
INABIL1noneinability
INTENSnoneintensifier
INTERJnoneinterjection
INTERJnoneinterjection
LOCLOClocative
MANnonemanner adverb
MULTnonemultiplicative
NCnonenot considered
NEGNEGnegation
NMLZNMLZnominalization
NSGnonenonsingular
NUMnonenumeral prefix
NYnonenot yet' negation
OBLOBLoblique
ORDnoneordinal quantifier
PARTnonepartitive article
PERSnonepersonal
PFVPFVperfective
PLPLplural
POSSPOSSpossessive
PROnonepronoun
PROHnoneprohibitive
PROSPnoneprospective marker
PROXPROXproximal
PURPnonepurpose
QUOTQUOTquotative
RCPRECPreciprocal
RECnonereciprocal prefix
REDnonereduplication
RELnonerelativizer
REMnoneremote past
SGSGsingular
SIMnonesimultaneity
SPnonespecific
STATnonestative
TAMnonetense aspect mood
TAM1nonetense aspect mood 1
TEMPnonetemporial adverb
THINGnoneplaceholder word
VESnonepossessive classifier for vessel possession
VOCVOCvocative
ZEROnonezero