got

Gothic 𐌲𐌿𐍄𐌹𐍃𐌺

ISO 639-3: got I A

10,445 Words in vocabulary

2.88x Best compression

0.1831 Best isotropy

Sample text

Excerpts from Gothic Wikipedia articles.

𐌺𐌰𐌽𐌰𐌳𐌰 𐌹𐍃𐍄 𐌻𐌰𐌽𐌳 𐌰𐌽𐌰 𐌰𐌹𐍂𐌸𐌰𐌳𐌰𐌹𐌻𐌰𐌹 𐌽𐌰𐌿𐍂𐌸𐌰𐌼𐌰𐌹𐍂𐌹𐌺𐌰 𐌾𐌰𐌷 𐌲𐌰𐌼𐌰𐍂𐌺𐍉𐌸 𐌲𐌰𐌲𐌰𐌷𐌰𐍆𐍄𐌹𐌳𐌰 𐍂𐌴𐌹𐌺𐌾𐌰𐌹. ...

𐌰𐍀𐌻𐍃 — 𐌰𐌺𐍂𐌰𐌽 𐌰𐍀𐌻𐌰𐌱𐌰𐌲𐌼𐌴 𐌾𐌰𐌷 𐍅𐌰𐌹𐌻𐌰𐌺𐌿𐌽𐌸𐌰 𐍆𐍉𐌳𐌴𐌹𐌽𐍃 𐌹𐍃𐍄·

𐌺𐌰𐌿𐌻𐌿𐌼𐌱𐌾𐌰 (Colombia) 𐌹𐍃𐍄 𐌻𐌰𐌽𐌳 𐌹𐌽 𐍃𐌿𐌽𐌸𐍂𐌰𐌰𐌼𐌰𐌹𐍂𐌹𐌺𐌰𐌹. 𐌰𐌼𐌴𐍂𐌹𐌺𐌰 This page is brought t...

The 20 most frequently used words in Gothic Wikipedia.

Explore Gothic interactively with browser-based demos.

Key metrics for all model types at a glance.

from wikilangs import tokenizer
tok = tokenizer('latest', 'got', 32000)
tokens = tok.tokenize("Your text here")

from wikilangs import ngram
ng = ngram('latest', 'got', gram_size=3)
score = ng.score("Your text here")

from wikilangs import markov
mc = markov('latest', 'got', depth=3)
text = mc.generate(length=50)

from wikilangs import vocabulary
vocab = vocabulary('latest', 'got')
info = vocab.lookup("word")

from wikilangs import embeddings
emb = embeddings('latest', 'got', dimension=64)
vec = emb.embed_word("word")

Model Type	Variants	Description
Tokenizers	8k, 16k, 32k, 64k	BPE tokenizers with different vocabulary sizes
N-gram (Word)	2, 3, 4, 5-gram	Word-level language models
N-gram (Subword)	2, 3, 4, 5-gram	Subword-level language models
Markov (Word)	Depth 1–5	Word-level text generation
Markov (Subword)	Depth 1–5	Subword-level text generation
Vocabulary	—	Word dictionary with frequency and IDF
Embeddings	32d, 64d, 128d	Position-aware word embeddings