High-Performance Open-Source Archive
label_wrap_jp() and
label_wrap_jp_gen() for Japanese word wrapping in ggplot2
labellers.label_date_jp() and
label_date_jp_gen() for Japanese calendar date labels in
ggplot2.strj_parse_date() to parse Japanese calendar date
strings into POSIXct values.mecab and sudachipy engines and
related arguments from strj_tokenize().global_idf3.bind_tf_idf2.
norm=TRUE. Cosine nomalization is
now performed on tf_idf values as in the RMeCab
package.tf="itf" and idf="df" options.pack for performance.tokenize_mecab and
tokenize_sudachipy.bind_lr function which can calculate the ‘LR’
value of bigrams.pack now always returns a tibble, not a
data.frame.bind_tf_idf2 can calculate and bind the term frequency,
inverse document frequency, and tf-idf of the tidy text dataset.collapse_tokens, mute_tokens, and
lexical_density can be used for handling a tidy text
dataset of tokens.strj_tokenize now preserves the original order of text
names.prettify now can get delim argument.strj_fill_iter_mark function.
strj_fill_iter_mark now replaces a sequence of
iteration marks recursively.strj_tokenize function.
strj_tokenize now can retrieve engine
argument to switch tokenizers for splitting text into tokens.ngram_tokenizer function.pack function.
pack function.
pack now accepts pull as its second argument
and n as its third argument.pull now can accept a symbol.NEWS.md file to track changes to the
package.
Need mirroring services?
Contact our team at info@vpspulse.com.
Mirror powered by VPSpulse
Infrastructure sponsored by VPSPulse & Secure Payments by ArionPay.