结巴分词:做最好的Python中文分词。
此次release包含以下更新:
1. 新增分词控制选项:可以关闭新词发现功能;详见:https://github.com/fxsjy/jieba/blob/master/test/test_no_hmm.py#L8
2. 修复词性标注子模块的Bug;详见: https://github.com/fxsjy/jieba/issues/111 https://github.com/fxsjy/jieba/issues/132
3. ChineseAnalyzer提供了更好的英文支持(感谢@jannson),例如单词Stemming; 详见:https://github.com/fxsjy/jieba/pull/106
项目主页:https://github.com/fxsjy/jieba
来自:开源中国社区

