Commit Graph

53 Commits (ca97b1995173453c70125ea6694dc4eaaaf2caca)

Author SHA1 Message Date
Sun Junyi ca97b19951 merge change from master 12 years ago
Sun Junyi c0816b9bb0 more mixed words 12 years ago
Sun Junyi c9e8da9e63 add more mix words to dict.txt 12 years ago
fxsjy 08bfabb9d7 Merge branch 'jieba3k' of https://github.com/fxsjy/jieba into jieba3k 12 years ago
fxsjy be1686654d merge master to jieba3k 12 years ago
fxsjy 0087a4e7e3 adjust prob_trans for better support of name entity; fix some bad cases 12 years ago
Sun Junyi 4300f79788 add a example of using sklearn+jieba 12 years ago
Sun Junyi a8f902545c fix some bad cases 12 years ago
cloudaice 9ee20a5293 add generator test 12 years ago
cloudaice 0c050b5eb2 add jieba.posseg test case 12 years ago
cloudaice b0f9e6721e 添加cutall 测试用例 12 years ago
cloudaice a7ff398edc 添加cut,set_dictionary,cut_for_search三个测试用例 12 years ago
cloudaice 667203a9ae 替换tab为空格,使用join代替循环 12 years ago
cloudaice a2d2078465 将tab换成空格,使用is判断对象是否为None 12 years ago
cloudaice e0434871eb 修改demo.py的代码格式,使得符合pep8规范 12 years ago
Sun Junyi c1bf815343 update test case 12 years ago
Sun Junyi 0e833cd441 fix a bug in py3k test case 12 years ago
Sun Junyi 273996f7d4 fix a test script in jieba3k 12 years ago
fxsjy aae91b6fb6 merge change from master to jieba3k 12 years ago
Sun Junyi 94d455b079 hot fix of cut_all=True 12 years ago
Sun Junyi 59d5d3b811 fix bug and change version 12 years ago
fxsjy 8666428fb0 fix a bug of changing dictionary 12 years ago
fxsjy 9bebe6120b utf-8 output is more friendly to Linux 12 years ago
Sun Junyi d3339633d5 in the speed test: initialize first to ignore the time of dict loading 12 years ago
fxsjy bc049090a5 make lazy load thread safe 12 years ago
fxsjy b46166f768 use CRLF as seperator to make chunks in parallel mode 12 years ago
fxsjy 6b83593b5a rm stub.log 12 years ago
fxsjy 62cf22121f new feature: parallel segment with multiprocessing 12 years ago
Sun Junyi 6da857b554 merge changes from master branch 12 years ago
Sun Junyi 8d89e8afda handle 的 12 years ago
fxsjy 45591bb9ab support flag '_'; ignore white space 12 years ago
Sun Junyi c77823aa1d merge improvement to Py3k branch 12 years ago
Sun Junyi 94ad7e7035 support decimal point 12 years ago
Sun Junyi a383f035ba support decimal point: example PI=3.141569 = > PI / = / 3.14159 12 years ago
Sun Junyi 8e49199993 keep punctuation marks 12 years ago
Sun Junyi 58c363655c support user defined word tag 12 years ago
Sun Junyi 6cc0e95759 rm 1.log 12 years ago
Sun Junyi d2634a049b fix a bug in pypy 12 years ago
Sun Junyi 0f4f9067c3 fix bugs in jieba for py3k 12 years ago
Sun Junyi 06ebc6f71c en-chn mix words in POS 12 years ago
Sun Junyi a8ae0398b4 add one example 12 years ago
Sun Junyi 6517119110 remove 1.log 12 years ago
Sun Junyi 8c05efed68 remove tlbb.txt 12 years ago
Sun Junyi 379cd4933a support en-chn mixed words, like B超 12 years ago
Sun Junyi 9c07d80edb first py3k version of jieba 12 years ago
Sun Junyi e0bd9a6a50 version chage; doc update 12 years ago
Sun Junyi 176c49d15c remove some files 12 years ago
Sun Junyi 59c3efeb2f improve speed of tagging 12 years ago
fxsjy 1a2a64a13f one more example of POS tagging 12 years ago
fxsjy 90cd4b3014 improve POS tagging 12 years ago