An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
Ultima versiune de pe oct. 07, 2016汉语言处理包
Ultima versiune de pe nullA Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Ultima versiune de pe dec. 14, 2016HanLP: Han Language Processing
Ultima versiune de pe dec. 27, 2020A Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Ultima versiune de pe dec. 14, 2016