TEITOK visualization and search interface for Fanbyak


Language nameFanbyakorko1234
Language familyAustronesianaust1307
Corpus creatorFranjieh, Michael
Translations providedEnglish
Glossessome
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Fanbyak DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Fanbyak DoReCo dataset in publications please cite

Franjieh, Michael. 2024. Fanbyak DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/orko1234 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Fanbyak corpus:

GlossLGRMeaning
11first person
22second person
33third person
ABSABSabsolutive case
ANAnoneanaphor
ASPECTnoneaspect
ASSnoneassociative
AVEnone(unclear)
CL1noneclassifier/possesive class
CL2noneclassifier/possesive class
CL3noneclassifier/possesive class
CONJnoneconjunction
CONTnone(unclear)
COPCOPcopula
CSTnone(unclear)
CVnone(unclear)
DEMDEMdemonstrative
DISTDISTdistal
DLnonedual
EXnoneexclusive
FOCFOCfocus
FUTFUTfuture
GENGENgenitive case
INnoneinclusive
INCEPTnoneinceptive
INDINDindicative
INTJnoneinterjection
IRRIRRirrealis
LOCLOClocative case
MEDnonemedial
NEGNEGnegation
NEG1NEGnegation
NEG2NEGnegation
NMLZNMLZnominalizer
NRECnonenon-recent
PARTnonepartitive
PCnonepaucal
PERFPRFperfect
pinoneplural inclusive
PLPLplural
POSSPOSSpossesive
POTnonepotentive
PROXPROXproximate
PSTPSTpast
RECnonerecent
RELRELrelative
SGSGsingular
SPECnonespecificity marker
SUBnonesubordinate marker
TOPTOPtopic
TRTRtransitive