Commit Graph

75 Commits (ebd40ed65e09ec918786603fff53fc173c79e96f)

Author SHA1 Message Date
ZoeyYoung d49542c06e fix bug 12 years ago
ZoeyYoung dce353f88b merge from master 12 years ago
ZoeyYoung 2857ae45cc Merge branch 'master' into jieba3k
Conflicts:
	Changelog
	jieba/__init__.py
	jieba/finalseg/__init__.py
	jieba/posseg/__init__.py
	setup.py
	test/parallel/test_file.py
	test/test_file.py
12 years ago
fxsjy b77645b3aa modify test_file.py; use less memory 12 years ago
Linker Lin 5d83855088 自动检测CPU数目,启动合适数目的进程。 12 years ago
Linker Lin 2ceb981da0 自动检测CPU数目,启动合适数目的进程。 12 years ago
Sun Junyi 6549deabbd merge change from master 12 years ago
Cheng wei 6035bb6320 fix invalid syntax for python3 12 years ago
Sun Junyi 9d0ea771a5 fix bug; decimals & digit-english mixed 12 years ago
Sun Junyi ba5114dc95 update whoosh example 12 years ago
Sun Junyi f424862222 clean the files in tmp 12 years ago
Sun Junyi b18d56d2a3 Merge pull request #72 from linkerlin/master
添加一个tmp目录,好让test_whoosh.py可以运行。
12 years ago
Sun Junyi b9b1f1a418 fix conflict of merging 12 years ago
miao.lin becd32b178 made test_whoosh.py happy.
添加一个tmp目录,好让test_whoosh.py可以运行。
12 years ago
Sun Junyi c01680c6a8 merge the new file 12 years ago
Sun Junyi b62f052927 PEP8 12 years ago
Sun Junyi 45daf561c7 follow PEP8: change tab to 4 white spaces 12 years ago
Sun Junyi dbec3ad9df add some comments 12 years ago
Sun Junyi efc784312c add ChineseAnalyzer for whoosh search engine 12 years ago
Sun Junyi f08690a2df add 'search mode' for jieba.tokenize 12 years ago
Sun Junyi cb1b0499f7 unittest for jieba.tokenize 12 years ago
Sun Junyi 11a3b10755 new method: jieba.tokenize 12 years ago
Sun Junyi ca97b19951 merge change from master 12 years ago
Sun Junyi c0816b9bb0 more mixed words 12 years ago
Sun Junyi c9e8da9e63 add more mix words to dict.txt 12 years ago
fxsjy 08bfabb9d7 Merge branch 'jieba3k' of https://github.com/fxsjy/jieba into jieba3k 12 years ago
fxsjy be1686654d merge master to jieba3k 12 years ago
fxsjy 0087a4e7e3 adjust prob_trans for better support of name entity; fix some bad cases 12 years ago
Sun Junyi 4300f79788 add a example of using sklearn+jieba 12 years ago
Sun Junyi a8f902545c fix some bad cases 12 years ago
cloudaice 9ee20a5293 add generator test 12 years ago
cloudaice 0c050b5eb2 add jieba.posseg test case 12 years ago
cloudaice b0f9e6721e 添加cutall 测试用例 12 years ago
cloudaice a7ff398edc 添加cut,set_dictionary,cut_for_search三个测试用例 12 years ago
cloudaice 667203a9ae 替换tab为空格,使用join代替循环 12 years ago
cloudaice a2d2078465 将tab换成空格,使用is判断对象是否为None 12 years ago
cloudaice e0434871eb 修改demo.py的代码格式,使得符合pep8规范 12 years ago
Sun Junyi c1bf815343 update test case 12 years ago
Sun Junyi 0e833cd441 fix a bug in py3k test case 12 years ago
Sun Junyi 273996f7d4 fix a test script in jieba3k 12 years ago
fxsjy aae91b6fb6 merge change from master to jieba3k 12 years ago
Sun Junyi 94d455b079 hot fix of cut_all=True 12 years ago
Sun Junyi 59d5d3b811 fix bug and change version 12 years ago
fxsjy 8666428fb0 fix a bug of changing dictionary 12 years ago
fxsjy 9bebe6120b utf-8 output is more friendly to Linux 12 years ago
Sun Junyi d3339633d5 in the speed test: initialize first to ignore the time of dict loading 12 years ago
fxsjy bc049090a5 make lazy load thread safe 12 years ago
fxsjy b46166f768 use CRLF as seperator to make chunks in parallel mode 12 years ago
fxsjy 6b83593b5a rm stub.log 12 years ago
fxsjy 62cf22121f new feature: parallel segment with multiprocessing 12 years ago