oscar_icu
latest version: 1.0Description: Thai unigram word frequency from OSCAR Corpus (icu word tokenize)
Long Description: Thai unigram word frequency from OSCAR Corpus (icu word tokenize)
HomePage: https://web.facebook.com/groups/colab.thailand/permalink/1524070061101680/?_rdc=1&_rdr
Authors: Korakot Chaovavanich
Download and Use
Download
from pythainlp.corpus import download
download('oscar_icu')
Use
It's get path file of corpus.
from pythainlp.corpus import get_corpus_path
get_corpus_path('oscar_icu')
If get_corpus_path('oscar_icu')
is None
then you have not downloaded oscar_icu
.
Release history
1.0
File Name: oscar_word_freq.csv
md5: -
PyThaiNLP version: >=2.2
Link Download: oscar_word_freq.csv