TEITOK visualization and search interface for Jejuan


Language nameJejuanjeju1234
Language familyKoreanickore1284
Corpus creatorKim, Soung-U
Translations providedKorean/English
Glossesall
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Jejuan DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Jejuan DoReCo dataset in publications please cite

Kim, Soung-U. 2024. Jejuan DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/jeju1234 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Jejuan corpus:

GlossLGRMeaning
11first person
22second person
ABILnoneability
ACCACCaccusative
ADDnoneadditive
ADNnoneadnominal
ADVLnoneadverbializer
ASSOCnoneassociative
AUXAUXauxiliary
CAUSCAUScausative
CGnonecommon ground suffix
CHNGnonechange
CLFCLFclassifier
CNDCONDconditional
CNTnonecontent
COMCOMcomitative
COMPCOMPcomplementizer
COPCOPcopula
CVBCVBconverb
DATDATdative
DECLDECLdeclarative
DELIMnonedelimiting
DEMDEMdemonstrative
DIRnonedirectional
DISTDISTdistal
EACHnoneeach' particle
EGOnoneegophoric
EMPHnoneemphasis
EPnoneepenthetic
EPNnoneepenthetic
EVnoneevidential
EXISTnoneexistential copula
FOCFOCfocus
GENGENgenitive
HONnonehonorific
HORTnonehortative
ILLOCnoneillocutionary force
IMPIMPimperative
INDINDindicative
INSTRINSinstrumental
INTENTnoneintentional
INTFnoneinterfix
IPFIPFVimperfective
IRRIRRirrealis
LOCLOClocative
MEDnonemedial
MIRnonemirative
MODnonemodal
NEGNEGnegation
NMLZNMLZnominalizer
NOMNOMnominative
NPSTnonenonpast
ORDnoneordinal
PASSPASSpassive
PERFPFVperfective
PFPFVperfective
PLPLplural
PLRnonepolar
POLnonepoliteness marker
POTnonepotential
PROGPROGprogressive
PROXPROXproximal
PRSPRSpresent
PSTPSTpast
PURPPURPpurposive
Qnonequestion
QUOTQUOTquotative
REALnonerealis
REDUPnonereduplication
REPnonerepetition
RSnonereason
SEQnonesequential
SGSGsingular
SIMnonesimultaneous
SRCnonesource
STNnonestance
TOPTOPtopic
TRSPnone(unclear)
VOCVOCvocative
VOLnonevolitional