TS Corpus Tools

Web-based lexical and linguistic tools for Turkish language processing.

TS Tokenizer

TS Tokenizer is a hybrid (lexicon-based and rule-based) tokenizer designed specifically for tokenizing Turkish texts.

Open
TS POS Tagger

TS POS Tagger is a Turkish Part-of-Speech tagger — a custom neural model trained from scratch on TS Corpus data.
It operates as a hybrid pipeline, combining rule-based token classification from TS-Tokenizer with data-driven POS inference from a spaCy-based neural tagger.

Open
Concordancer

Keyword-in-context search and collocation explorer for Turkish corpora.

Coming Soon