Commit Graph

10 Commits (7653db2e33abfcc2150430e7932e991bad6b5b4a)

Author SHA1 Message Date
Dingyuan Wang ceb5c26be4 fix self.FREQ in cut_for_search; make pair object iterable 10 years ago
Dingyuan Wang 94840a734c wraps most globals in classes
API changes:
* class jieba.Tokenizer, jieba.posseg.POSTokenizer
* class jieba.analyse.TFIDF, jieba.analyse.TextRank
* global functions are mapped to jieba.(posseg.)dt, the default (POS)Tokenizer
* multiprocessing only works with jieba.(posseg.)dt
* new lcut, lcut_for_search functions that returns a list
* jieba.analyse.textrank now returns 20 items by default

Tests:
* added test_lock.py to test multithread locking
* demo.py now contains most of the examples in README
10 years ago
Dingyuan Wang 22bcf8be7a Merge master and jieba3k, make the code Python 2/3 compatible 10 years ago
Dingyuan Wang 51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements 11 years ago
Dingyuan Wang 6fad5fbb2c update to v0.33 11 years ago
cloudaice e0434871eb 修改demo.py的代码格式,使得符合pep8规范 12 years ago
Sun Junyi 9c07d80edb first py3k version of jieba 12 years ago
Sun Junyi e0bd9a6a50 version chage; doc update 12 years ago
Sun Junyi 15a5a2d50e add a sample script about tags extraction 13 years ago
fxsjy 64b3c0d0e0 add one more example 13 years ago