TEITOK visualization and search interface for Tabasaran


Language nameTabasarantaba1259
Language familyNakh-Daghestaniannakh1245
Corpus creatorBogomolova, Natalia and Ganenkov, Dmitry and Schiborr, Nils Norman
Translations providedEnglish
Glossesall
Annotation file licenceCreative Commons Attribution License

This is an interface for visualizing and searching the Tabasaran DoReCo dataset. For more information about this dataset, including metadata, consult the DoReCo dataset page, where you can also download the data. Use the links in the left-side menu to search through this dataset, or to access individual documents for visualization.

When using actual data from the Tabasaran DoReCo dataset in publications please cite

Bogomolova, Natalia and Ganenkov, Dmitry and Schiborr, Nils Norman. 2024. Tabasaran DoReCo dataset. In Seifart, Frank, Ludger Paschen and Matthew Stave (eds.). Language Documentation Reference Corpus (DoReCo) 2.0. Lyon: Laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2). https://doreco.huma-num.fr/languages/taba1259 (Accessed on 23/01/2026). DOI:10.34847/nkl.6eaf5laq

When using results obtained from DoReCo's TEITOK version in publications, such as frequency counts obtained through the TEITOK search function, please cite — in addition to the reference to the Bora DoReCo dataset:

Janssen, Maarten & Frank Seifart. 2025. Searchable Language Documentation Corpora: DoReCo meets TEITOK. In: Éric Le Ferrand, Elena Klyachko, Anna Postnikova, Tatiana Shavrina, Oleg Serikov, Ekaterina Voloshina & Ekaterina Vylomova (eds.), Proceedings of the Fourth Workshop on NLP Applications to Field Linguistics, 58–64. Vienna, Austria: Association for Computational Linguistics. https://aclanthology.org/2025.fieldmatters-1.5/.

Gloss Abbreviations

Below is the list of language-specific glosses used in the Tabasaran corpus:

GlossLGRMeaning
11first person
22second person
33third person
ABSABSabsolutive
ABSTRnoneabstract
ADDnoneadditive
ADVADVadverbial
AGnoneagent
AORnoneaorist
APUDnone"spatial case ‘by
ATTRnoneattributive
CITnoneverb k’ur ‘say.FUT’ used as a quotative
COMCOMcomitative
COMPnonecomparative particle
CONDCONDconditional
CONTnonecontinuative
CONTRnonecontrastive
COPCOPcopula
DATDATdative
DEFDEFdefinite
DIRnonespatial case ‘to’
DISTDISTdistal
DOWNnoneprefixal marker in demonstrative pronoun ‘downwards’
ELATnoneelative
EMPHnoneemphasis
ERGERGergative
EXCLEXCLexclusive
FOCFOCfocal particle
FUTFUTfuture
GENGENgenitive
HSGnonehuman singular
ICVBnoneimperfective converb
IMPIMPimperative
INnonespatial case ‘in’
INCLINCLinclusive
INDEFINDFindefinite
INFINFinfinitive
INTERnonespatial case ‘between’
INTERJnoneinterjection
IPFVIPFVimperfective
JUSSnonejussive
LATnonelative
LOCLOClocative
MSDnonemasdar
NEGNEGnegative
NMLZNMLZnominalization
NSGnoneneuter singular
ORDnoneordinal number
PATnonepatient
PCVBnoneperfective converb
PFVPFVperfective
PLPLplural
POSSPOSSpossessive
POSTnone"spatial case ‘behind
PROHPROHprohibitive
PROXPROXproximal
PRSPRSpresent
PRTnoneparticle
PSTPSTpast
PTCPPTCPparticiple
QQquestion particle
REFLREFLreflexive
RESRESresultative
SGSGsingular
SUBnonespatial case ‘under’
SUPERnonespatial case ‘on’
TEMPnonetemporal marker ‘when’
VOCVOCvocative