Commit Graph

106 Commits (8212b6c5725d08311952a3a08e5509eeaee33eb7)

Author SHA1 Message Date
Sun Junyi 8212b6c572
Update README.md
Sun Junyi 7653db2e33
Update README.md
OOC b485ae916c Update README.md
OOC ee0ce32bbd Update
Yanyi Wu 76ae798137 Update README.md
Dingyuan Wang 87734d3785 support POS tagging in __main__
Dingyuan Wang 99d0fb1a8a use regex and fix encoding related issues in load_userdict
andersc 343bfe9783 Add introduction to jieba.NET port.
Dingyuan Wang d0e68974bf improved doc for tmp_dir and cache_file
Dingyuan Wang ceb5c26be4 fix self.FREQ in cut_for_search; make pair object iterable
Dingyuan Wang 3b76328f2a allow ignoring word frequency while providing pos tag
Dingyuan Wang 94840a734c wraps most globals in classes
API changes:
* class jieba.Tokenizer, jieba.posseg.POSTokenizer
* class jieba.analyse.TFIDF, jieba.analyse.TextRank
* global functions are mapped to jieba.(posseg.)dt, the default (POS)Tokenizer
* multiprocessing only works with jieba.(posseg.)dt
* new lcut, lcut_for_search functions that returns a list
* jieba.analyse.textrank now returns 20 items by default

Tests:
* added test_lock.py to test multithread locking
* demo.py now contains most of the examples in README
Fukuball Lin e712a4de61 更新 README
增加结巴分词 PHP 版本相關資訊
Dingyuan Wang eeaab012bf update docs
Dingyuan Wang 4fa2728fb6 update README about new features
Dingyuan Wang f29430f49e details in textrank; update README
Yanyi Wu 5d321cbccd Update README.md
Dingyuan Wang 22bcf8be7a Merge master and jieba3k, make the code Python 2/3 compatible
Dingyuan Wang 4197dfb8fa store int directly in FREQ; small improvements
yanyiwu 1d91072498 add iosjieba
Dingyuan Wang 7a6caa0c3c port extract_tags, etc to jieba3k; add auto2to3 script
Qin Wenfeng 77a831b8c1 Add jiebaR in README
Dingyuan Wang 751ff35eb5 improve extract_tags; unify extract_tags and testrank
Dingyuan Wang fd9f1f2c0e update README, textrank, etc.
Dingyuan Wang 9d2818b440 fix English part of README
Dingyuan Wang 31b7d11809 improve README
Dingyuan Wang a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank
Sun Junyi 0049b0c5b4 Merge pull request from sing1ee/master
add some introduction of textrank
zhangcheng 138d713e98 add some introduction of textrank
Sun Junyi 1850bd6d37 Update README.md
fxsjy f5ca87e088 merge change of @fukuball
Sun Junyi 10b86e90fb Update README.md
Dingyuan Wang 6fad5fbb2c update to v0.33
Sun Junyi fc511de012 Merge pull request from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
Sun Junyi 99ea59e88d Update README.md
Fukuball Lin d432789cb4 fix typo
Fukuball Lin cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
Fukuball Lin e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
Dingyuan Wang c04ccd0d12 Update to v0.32 according to the master branch.
Sun Junyi 28621e8b00 Update README.md
Yanyi Wu ccfa54530e add nodejieba into README.md
add nodejieba into README.md
Sun Junyi 6a66620088 Update README.md
Sun Junyi 00bc72c877 Update README.md
ZoeyYoung dce353f88b merge from master
Sun Junyi 92c6c3d9cd Update README.md
Sun Junyi 0bb2ddcc1b Update README.md
Sun Junyi e1c1d46324 Update README.md
Sun Junyi 915b3164b0 Update README.md
Sun Junyi 45e6594a09 Update README.md
Sun Junyi 465e475460 Update README.md