List Corpus & Models
List PyThaiNLP Corpus and Models.
blackboard_pt_tagger
part-of-speech tagging (perceptron) from blackboar ...
blackboard_unigram_tagger
part-of-speech tagging (unigram) from blackboard t ...
g2p
Thai grapheme to phoneme
lst20-cls
lst20-cls (LST20)
ltw2v
LTW2V: The Large Thai Word2Vec
ltw2v_v1.0_15_window
LTW2V: The Large Thai Word2Vec v1.0 (15 window)
ltw2v_v1.0_5_window
LTW2V: The Large Thai Word2Vec v1.0 (5 window)
onnx_lst20ner
lst20 ner model
oscar_icu
Thai unigram word frequency from OSCAR Corpus (icu ...
pos_lst20_perceptron
Perceptron POS tagger (LST20)
pos_lst20_unigram
Unigram POS tagger (LST20)
scb_1m_en-th_moses
SCB_1M+TBASE_en-th_moses-spm.
scb_1m_en-th_spm
scb_1m_en-th_spm
scb_1m_th-en_newmm
SCB_1M+TBASE_th-en_newmm-moses.
scb_1m_th-en_spm
SCB_1M+TBASE_th-en_spm-spm.
scb_en_th
scb_1m_en-th_spm
scb_th_en
scb_1m_en-th_spm
test_zip
It's a test file.
thai-g2p
Thai grapheme to phoneme (PyTorch)
thai2fit_wv
thai2vec word embeddings
thai2rom-dataset
Thai romanization model
thai2rom-pytorch
Thai romanization model (LSTM)
thai2rom-pytorch-attn
Thai romanization model (LSTM-Attention)
thai_dict
This dataset collect from Thai wiktionary.
thai_nner
Thai Nested Named Entity Recognition
thai_synonym
The synonym for thai (open source & open data)
thai_w2p
Thai Word-to-Phoneme (W2P) converter
thainer
Thai Named Entity Recognition
thainer-1.4
Thai Named Entity Recognition 1.4 for PyThaiNLP 2. ...
tnc_bigram_word_freqs
Bigram word frequency from Thai National Corpus (T ...
tnc_trigram_word_freqs
Trigram word frequency from Thai National Corpus ( ...
wiki_itos_lstm
ULMFit index to text for LSTM
wiki_lm_lstm
Wikipedia-pretrained ULMFit language model for LST ...