Commit Graph

453 Commits (3ec4c43788777aed57d413bffb4d21e904ebf5be)
 

Author SHA1 Message Date
Dingyuan Wang bb1e6000c6 fix version; fix spaces at end of line 11 years ago
Dingyuan Wang 14671d4feb fix __main__.py 11 years ago
Dingyuan Wang b367690eeb use prefix dict instead of trie, add a command line interface, and a few small improvements 11 years ago
Dingyuan Wang 51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements 11 years ago
fxsjy eb98eb9248 fix performance problem of extrag_tags 11 years ago
Sun Junyi 7f965e0aa3 Merge pull request #184 from keroro520/master
fix issues 125 (https://github.com/fxsjy/jieba/issues/125)
11 years ago
keroro520 77b442fa88 fix issues (https://github.com/fxsjy/jieba/issues/125) 11 years ago
Sun Junyi 8f52419386 Merge pull request #183 from gumblex/jieba3k
Jieba3k update to v0.33
11 years ago
Dingyuan Wang 626b415152 fix dict.itervalues mistake 11 years ago
Dingyuan Wang 6a3f228c72 fix python3 stuff 11 years ago
Dingyuan Wang b16cf0d63f fix indent typo 11 years ago
Dingyuan Wang 6fad5fbb2c update to v0.33 11 years ago
Sun Junyi fc511de012 Merge pull request #176 from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Sun Junyi 99ea59e88d Update README.md 11 years ago
fxsjy 6eb43acc10 pip install jieba3k 11 years ago
fxsjy 40adb1c591 version 0.33 11 years ago
Fukuball Lin d432789cb4 fix typo 11 years ago
Fukuball Lin cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
11 years ago
Fukuball Lin e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Sun Junyi 16d626d347 Merge pull request #174 from fukuball/master
讓 jieba 可以切換 idf 語料庫及 stop words 語料庫
11 years ago
Fukuball Lin b658ee69cb 讓 jieba 可以自行增加 stop words 語料庫
1. 增加範例 stop words 語料庫
2. 為了讓 jieba 可以切換 stop words 語料庫,新增 set_stop_words 方法,並改寫 extract_tags
3. test 增加 extract_tags_stop_words.py 測試範例
11 years ago
Fukuball Lin 7198d562f1 讓 jieba 可以切換 idf 語料庫
1. 新增繁體中文 idf 語料庫
2. 為了讓 jieba 可以切換 iff 語料庫,新增 get_idf, set_idf_path 方法,並改寫 extract_tags
3. test 增加 extract_tags_idfpath
11 years ago
Sun Junyi 91e5b26f5f Merge pull request #165 from gumblex/jieba3k
fix the u'xxx' string.
11 years ago
Dingyuan Wang 8b07bce568 fix the u'xxx' string. 11 years ago
Sun Junyi 0d99ebce54 Merge pull request #164 from gumblex/jieba3k
Jieba3k v0.32 update
11 years ago
Dingyuan Wang c04ccd0d12 Update to v0.32 according to the master branch. 11 years ago
Dingyuan Wang 81f77d7a08 Fix the re in enable_parallel. 11 years ago
Sun Junyi 473ac1df75 Merge pull request #162 from ShuraChow/master
fix issue #161
11 years ago
ShuraChow 7583f7760a fix issue #161
posseg每次根据jieba.user_word_tag_tab的长度判断是否有新词载入,如果有,则更新word_tag_tab,然后清空jieba.user_word_tag_tab
11 years ago
Sun Junyi 2726a7c89b Merge pull request #158 from davidlihm/patch-1
Thanks
11 years ago
davidlihm 5b2ec920ed Update __init__.py 11 years ago
Sun Junyi 5574304a9e Merge pull request #152 from jagt/jieba3k
close cache file to avoid warning message.
11 years ago
jagt 7f3513edb7 close cache file to avoid warning message. 11 years ago
Sun Junyi 28621e8b00 Update README.md 11 years ago
Sun Junyi 1f144ebf55 Merge pull request #141 from windch/jieba3k
use logging instead of print in __init__ file of py3k branch
11 years ago
wind 7488b114e7 use logging instead of print in init file 11 years ago
fxsjy 2682e887b8 Merge branch 'master' of https://github.com/fxsjy/jieba 11 years ago
fxsjy 9d4ac26f16 fix the bug of issue#137 11 years ago
Sun Junyi 6942795fae Merge pull request #135 from aszxqw/patch-1
add nodejieba into README.md
11 years ago
Yanyi Wu ccfa54530e add nodejieba into README.md
add nodejieba into README.md
11 years ago
Sun Junyi 3e430e9769 Update __init__.py 11 years ago
Sun Junyi 6946b00f14 Merge pull request #134 from Honghe/master
Fix a bug about can not import ChineseAnalyzer
11 years ago
Honghe Wu 7720fbc1d8 fix a bug about can not import ChineseAnalyzer with change tab to 4 wihte spaces under PEP8 11 years ago
fxsjy cc708de40c version 0.32 released 11 years ago
fxsjy dafc73425e fix a little problem of dict.txt 11 years ago
fxsjy 7cc7e70843 Merge branch 'master' of https://github.com/fxsjy/jieba 11 years ago
fxsjy 18678d50c6 fix bug issue #132 11 years ago
Sun Junyi 62240c5add Merge pull request #131 from aholic/master
better indent
11 years ago
aholic e2c796088f better indent 11 years ago
fxsjy 5e6a2c4661 fix a bug of add_word 11 years ago