Commit Graph

113 Commits (1e20c89b66f56c9301b0feed211733ffaa1bd72a)

Author SHA1 Message Date
fxsjy eb37e048da update version to 0.41 5 years ago
JesseyXujin 381b0691ac Add enable_paddle interface to install paddle and import packages (#802)
* enable_paddle_interface

* Add enable_paddle interface to install paddle and import packages

* Add enable_paddle interface to install paddle and import packages

* add posseg lcut for paddle mode

* fix vocabulary
5 years ago
JesseyXujin 0b74b6c2de add jieba upgrade not in README.md and change import imp to import importlib in _compat.py (#794) 5 years ago
Sun Junyi 2fdee89883
Update README.md 5 years ago
Sun Junyi 9dc8e6d992
Update README.md 5 years ago
JesseyXujin 5b3bb4b7f2 加入paddle分词和词性标注功能 (#788)
* paddle cut release

* 修改README.md,提示用户安装paddlepaddle.tiny

* 删除两个init.py文件中utf头文件

* 修改readme细节
5 years ago
Paul Meng 3645a5bb5d Update README.md (#745) 6 years ago
Sun Junyi 8212b6c572
Update README.md 6 years ago
Sun Junyi 7653db2e33
Update README.md 7 years ago
OOC b485ae916c Update README.md 8 years ago
OOC ee0ce32bbd Update 8 years ago
Yanyi Wu 76ae798137 Update README.md 9 years ago
Dingyuan Wang 87734d3785 support POS tagging in __main__ 9 years ago
Dingyuan Wang 99d0fb1a8a use regex and fix encoding related issues in load_userdict 9 years ago
andersc 343bfe9783 Add introduction to jieba.NET port. 10 years ago
Dingyuan Wang d0e68974bf improved doc for tmp_dir and cache_file 10 years ago
Dingyuan Wang ceb5c26be4 fix self.FREQ in cut_for_search; make pair object iterable 10 years ago
Dingyuan Wang 3b76328f2a allow ignoring word frequency while providing pos tag 10 years ago
Dingyuan Wang 94840a734c wraps most globals in classes
API changes:
* class jieba.Tokenizer, jieba.posseg.POSTokenizer
* class jieba.analyse.TFIDF, jieba.analyse.TextRank
* global functions are mapped to jieba.(posseg.)dt, the default (POS)Tokenizer
* multiprocessing only works with jieba.(posseg.)dt
* new lcut, lcut_for_search functions that returns a list
* jieba.analyse.textrank now returns 20 items by default

Tests:
* added test_lock.py to test multithread locking
* demo.py now contains most of the examples in README
10 years ago
Fukuball Lin e712a4de61 更新 README
增加结巴分词 PHP 版本相關資訊
10 years ago
Dingyuan Wang eeaab012bf update docs 10 years ago
Dingyuan Wang 4fa2728fb6 update README about new features 10 years ago
Dingyuan Wang f29430f49e details in textrank; update README 10 years ago
Yanyi Wu 5d321cbccd Update README.md 10 years ago
Dingyuan Wang 22bcf8be7a Merge master and jieba3k, make the code Python 2/3 compatible 10 years ago
Dingyuan Wang 4197dfb8fa store int directly in FREQ; small improvements 10 years ago
yanyiwu 1d91072498 add iosjieba 10 years ago
Dingyuan Wang 7a6caa0c3c port extract_tags, etc to jieba3k; add auto2to3 script 10 years ago
Qin Wenfeng 77a831b8c1 Add jiebaR in README 10 years ago
Dingyuan Wang 751ff35eb5 improve extract_tags; unify extract_tags and testrank 10 years ago
Dingyuan Wang fd9f1f2c0e update README, textrank, etc. 10 years ago
Dingyuan Wang 9d2818b440 fix English part of README 10 years ago
Dingyuan Wang 31b7d11809 improve README 10 years ago
Dingyuan Wang a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank 10 years ago
Sun Junyi 0049b0c5b4 Merge pull request #191 from sing1ee/master
add some introduction of textrank
10 years ago
zhangcheng 138d713e98 add some introduction of textrank 10 years ago
Sun Junyi 1850bd6d37 Update README.md 10 years ago
fxsjy f5ca87e088 merge change of @fukuball 10 years ago
Sun Junyi 10b86e90fb Update README.md 10 years ago
Dingyuan Wang 6fad5fbb2c update to v0.33 11 years ago
Sun Junyi fc511de012 Merge pull request #176 from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Sun Junyi 99ea59e88d Update README.md 11 years ago
Fukuball Lin d432789cb4 fix typo 11 years ago
Fukuball Lin cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
11 years ago
Fukuball Lin e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Dingyuan Wang c04ccd0d12 Update to v0.32 according to the master branch. 11 years ago
Sun Junyi 28621e8b00 Update README.md 11 years ago
Yanyi Wu ccfa54530e add nodejieba into README.md
add nodejieba into README.md
11 years ago
Sun Junyi 6a66620088 Update README.md 12 years ago
Sun Junyi 00bc72c877 Update README.md 12 years ago