Commit Graph

417 Commits (f3a53dd2da58d55db687eb3429e6d5939ad518a1)
 

Author SHA1 Message Date
Dingyuan Wang 9d2818b440 fix English part of README 11 years ago
Dingyuan Wang 31b7d11809 improve README 11 years ago
Dingyuan Wang a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank 11 years ago
Sun Junyi 0049b0c5b4 Merge pull request #191 from sing1ee/master
add some introduction of textrank
11 years ago
zhangcheng 138d713e98 add some introduction of textrank 11 years ago
Sun Junyi 4030d8ed86 Merge pull request #190 from sing1ee/master
add a simple implementation of textrank
11 years ago
zhangcheng 6eb9f6149c add a simple implementation of textrank 11 years ago
Sun Junyi 1850bd6d37 Update README.md 11 years ago
fxsjy f5ca87e088 merge change of @fukuball 11 years ago
Sun Junyi 10b86e90fb Update README.md 11 years ago
fxsjy ba87fcb01f remove trie, use prefix set instead 11 years ago
fxsjy 82bfffb6ed version update to 0.34 11 years ago
Sun Junyi 56e8336af1 Merge pull request #188 from gumblex/jieba3k
不用Trie,同#187
11 years ago
Sun Junyi 4a93f21918 Merge pull request #187 from gumblex/master
不用Trie,减少内存加快速度;优化代码细节
11 years ago
Dingyuan Wang bb1e6000c6 fix version; fix spaces at end of line 11 years ago
Dingyuan Wang 14671d4feb fix __main__.py 11 years ago
Dingyuan Wang b367690eeb use prefix dict instead of trie, add a command line interface, and a few small improvements 11 years ago
Dingyuan Wang 51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements 11 years ago
fxsjy eb98eb9248 fix performance problem of extrag_tags 11 years ago
Sun Junyi 7f965e0aa3 Merge pull request #184 from keroro520/master
fix issues 125 (https://github.com/fxsjy/jieba/issues/125)
11 years ago
keroro520 77b442fa88 fix issues (https://github.com/fxsjy/jieba/issues/125) 11 years ago
Sun Junyi 8f52419386 Merge pull request #183 from gumblex/jieba3k
Jieba3k update to v0.33
11 years ago
Dingyuan Wang 626b415152 fix dict.itervalues mistake 11 years ago
Dingyuan Wang 6a3f228c72 fix python3 stuff 11 years ago
Dingyuan Wang b16cf0d63f fix indent typo 11 years ago
Dingyuan Wang 6fad5fbb2c update to v0.33 11 years ago
Sun Junyi fc511de012 Merge pull request #176 from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Sun Junyi 99ea59e88d Update README.md 11 years ago
fxsjy 6eb43acc10 pip install jieba3k 11 years ago
fxsjy 40adb1c591 version 0.33 11 years ago
Fukuball Lin d432789cb4 fix typo 11 years ago
Fukuball Lin cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
11 years ago
Fukuball Lin e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Sun Junyi 16d626d347 Merge pull request #174 from fukuball/master
讓 jieba 可以切換 idf 語料庫及 stop words 語料庫
11 years ago
Fukuball Lin b658ee69cb 讓 jieba 可以自行增加 stop words 語料庫
1. 增加範例 stop words 語料庫
2. 為了讓 jieba 可以切換 stop words 語料庫,新增 set_stop_words 方法,並改寫 extract_tags
3. test 增加 extract_tags_stop_words.py 測試範例
11 years ago
Fukuball Lin 7198d562f1 讓 jieba 可以切換 idf 語料庫
1. 新增繁體中文 idf 語料庫
2. 為了讓 jieba 可以切換 iff 語料庫,新增 get_idf, set_idf_path 方法,並改寫 extract_tags
3. test 增加 extract_tags_idfpath
11 years ago
Sun Junyi 91e5b26f5f Merge pull request #165 from gumblex/jieba3k
fix the u'xxx' string.
11 years ago
Dingyuan Wang 8b07bce568 fix the u'xxx' string. 11 years ago
Sun Junyi 0d99ebce54 Merge pull request #164 from gumblex/jieba3k
Jieba3k v0.32 update
11 years ago
Dingyuan Wang c04ccd0d12 Update to v0.32 according to the master branch. 11 years ago
Dingyuan Wang 81f77d7a08 Fix the re in enable_parallel. 11 years ago
Sun Junyi 473ac1df75 Merge pull request #162 from ShuraChow/master
fix issue #161
11 years ago
ShuraChow 7583f7760a fix issue #161
posseg每次根据jieba.user_word_tag_tab的长度判断是否有新词载入,如果有,则更新word_tag_tab,然后清空jieba.user_word_tag_tab
11 years ago
Sun Junyi 2726a7c89b Merge pull request #158 from davidlihm/patch-1
Thanks
11 years ago
davidlihm 5b2ec920ed Update __init__.py 11 years ago
Sun Junyi 5574304a9e Merge pull request #152 from jagt/jieba3k
close cache file to avoid warning message.
11 years ago
jagt 7f3513edb7 close cache file to avoid warning message. 11 years ago
Sun Junyi 28621e8b00 Update README.md 11 years ago
Sun Junyi 1f144ebf55 Merge pull request #141 from windch/jieba3k
use logging instead of print in __init__ file of py3k branch
11 years ago
wind 7488b114e7 use logging instead of print in init file 11 years ago