< Back

oscar_icu

latest version: 1.0


Description: Thai unigram word frequency from OSCAR Corpus (icu word tokenize)

Long Description: Thai unigram word frequency from OSCAR Corpus (icu word tokenize)

HomePage: https://web.facebook.com/groups/colab.thailand/permalink/1524070061101680/?_rdc=1&_rdr

Authors: Korakot Chaovavanich


Download and Use

Download

from pythainlp.corpus import download
download('oscar_icu')

Use

It's get path file of corpus.

from pythainlp.corpus import get_corpus_path
get_corpus_path('oscar_icu')

If get_corpus_path('oscar_icu') is None then you have not downloaded oscar_icu.


Release history

1.0

File Name: oscar_word_freq.csv

md5: -

PyThaiNLP version: >=2.2

Link Download: oscar_word_freq.csv