Commit Graph

523 Commits (master)
 

Author SHA1 Message Date
Sun Junyi cf2aa88122 Merge pull request #195 from gumblex/master
统一获取关键词接口,优化缓存命名
10 years ago
Dingyuan Wang 751ff35eb5 improve extract_tags; unify extract_tags and testrank 10 years ago
Dingyuan Wang e3f3dcccba improve the loading and caching process 10 years ago
Sun Junyi 4cb1924d09 Merge pull request #193 from gumblex/jieba3k
jieba3k 对应更新 #192
10 years ago
Sun Junyi d6ef07a472 Merge pull request #192 from gumblex/master
更新、完善说明;命令行加入自定义词典功能
10 years ago
Dingyuan Wang fd9f1f2c0e update README, textrank, etc. 10 years ago
Dingyuan Wang 9d2818b440 fix English part of README 10 years ago
Dingyuan Wang 31b7d11809 improve README 10 years ago
Dingyuan Wang a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank 10 years ago
Sun Junyi 0049b0c5b4 Merge pull request #191 from sing1ee/master
add some introduction of textrank
10 years ago
zhangcheng 138d713e98 add some introduction of textrank 10 years ago
Sun Junyi 4030d8ed86 Merge pull request #190 from sing1ee/master
add a simple implementation of textrank
10 years ago
zhangcheng 6eb9f6149c add a simple implementation of textrank 10 years ago
Sun Junyi 1850bd6d37 Update README.md 10 years ago
fxsjy f5ca87e088 merge change of @fukuball 10 years ago
Sun Junyi 10b86e90fb Update README.md 10 years ago
fxsjy ba87fcb01f remove trie, use prefix set instead 10 years ago
fxsjy 82bfffb6ed version update to 0.34 10 years ago
Sun Junyi 56e8336af1 Merge pull request #188 from gumblex/jieba3k
不用Trie,同#187
10 years ago
Sun Junyi 4a93f21918 Merge pull request #187 from gumblex/master
不用Trie,减少内存加快速度;优化代码细节
10 years ago
Dingyuan Wang bb1e6000c6 fix version; fix spaces at end of line 10 years ago
Dingyuan Wang 14671d4feb fix __main__.py 10 years ago
Dingyuan Wang b367690eeb use prefix dict instead of trie, add a command line interface, and a few small improvements 10 years ago
Dingyuan Wang 51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements 10 years ago
fxsjy eb98eb9248 fix performance problem of extrag_tags 10 years ago
Sun Junyi 7f965e0aa3 Merge pull request #184 from keroro520/master
fix issues 125 (https://github.com/fxsjy/jieba/issues/125)
11 years ago
keroro520 77b442fa88 fix issues (https://github.com/fxsjy/jieba/issues/125) 11 years ago
Sun Junyi 8f52419386 Merge pull request #183 from gumblex/jieba3k
Jieba3k update to v0.33
11 years ago
Dingyuan Wang 626b415152 fix dict.itervalues mistake 11 years ago
Dingyuan Wang 6a3f228c72 fix python3 stuff 11 years ago
Dingyuan Wang b16cf0d63f fix indent typo 11 years ago
Dingyuan Wang 6fad5fbb2c update to v0.33 11 years ago
Sun Junyi fc511de012 Merge pull request #176 from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Sun Junyi 99ea59e88d Update README.md 11 years ago
fxsjy 6eb43acc10 pip install jieba3k 11 years ago
fxsjy 40adb1c591 version 0.33 11 years ago
Fukuball Lin d432789cb4 fix typo 11 years ago
Fukuball Lin cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
11 years ago
Fukuball Lin e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
11 years ago
Sun Junyi 16d626d347 Merge pull request #174 from fukuball/master
讓 jieba 可以切換 idf 語料庫及 stop words 語料庫
11 years ago
Fukuball Lin b658ee69cb 讓 jieba 可以自行增加 stop words 語料庫
1. 增加範例 stop words 語料庫
2. 為了讓 jieba 可以切換 stop words 語料庫,新增 set_stop_words 方法,並改寫 extract_tags
3. test 增加 extract_tags_stop_words.py 測試範例
11 years ago
Fukuball Lin 7198d562f1 讓 jieba 可以切換 idf 語料庫
1. 新增繁體中文 idf 語料庫
2. 為了讓 jieba 可以切換 iff 語料庫,新增 get_idf, set_idf_path 方法,並改寫 extract_tags
3. test 增加 extract_tags_idfpath
11 years ago
Sun Junyi 91e5b26f5f Merge pull request #165 from gumblex/jieba3k
fix the u'xxx' string.
11 years ago
Dingyuan Wang 8b07bce568 fix the u'xxx' string. 11 years ago
Sun Junyi 0d99ebce54 Merge pull request #164 from gumblex/jieba3k
Jieba3k v0.32 update
11 years ago
Dingyuan Wang c04ccd0d12 Update to v0.32 according to the master branch. 11 years ago
Dingyuan Wang 81f77d7a08 Fix the re in enable_parallel. 11 years ago
Sun Junyi 473ac1df75 Merge pull request #162 from ShuraChow/master
fix issue #161
11 years ago
ShuraChow 7583f7760a fix issue #161
posseg每次根据jieba.user_word_tag_tab的长度判断是否有新词载入,如果有,则更新word_tag_tab,然后清空jieba.user_word_tag_tab
11 years ago
Sun Junyi 2726a7c89b Merge pull request #158 from davidlihm/patch-1
Thanks
11 years ago