jieba-rs
🚀 Help me to become a full-time open-source developer by sponsoring me on GitHub
The Jieba Chinese Word Segmentation Implemented in Rust
Installation
Add it to your Cargo.toml
:
[]
= "0.7"
then you are good to go. If you are using Rust 2015 you have to extern crate jieba_rs
to your crate root as well.
Example
use Jieba;
Enabling Additional Features
default-dict
feature enables embedded dictionary, this features is enabled by defaulttfidf
feature enables TF-IDF keywords extractortextrank
feature enables TextRank keywords extractor
[]
= { = "0.7", = ["tfidf", "textrank"] }
Run benchmark
Benchmark: Compare with cppjieba
jieba-rs
bindings
@node-rs/jieba
NodeJS bindingjieba-php
PHP bindingrjieba-py
Python bindingcang-jie
Chinese tokenizer for tantivytantivy-jieba
An adapter that bridges between tantivy and jieba-rsjieba-wasm
the WebAssembly binding
License
This work is released under the MIT license. A copy of the license is provided in the LICENSE file.