com.hankcs.hanlp.tokenizer
类 NLPTokenizer
java.lang.Object
com.hankcs.hanlp.tokenizer.NLPTokenizer
public class NLPTokenizer
- extends Object
可供自然语言处理用的分词器
- 作者:
- hankcs
| 从类 java.lang.Object 继承的方法 |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
SEGMENT
public static final Segment SEGMENT
- 预置分词器
NLPTokenizer
public NLPTokenizer()
segment
public static List<Term> segment(String text)
segment
public static List<Term> segment(char[] text)
- 分词
- 参数:
text - 文本
- 返回:
- 分词结果
seg2sentence
public static List<List<Term>> seg2sentence(String text)
- 切分为句子形式
- 参数:
text - 文本
- 返回:
- 句子列表
Copyright © 2014–2015 鐮佸啘鍦�/a>. All rights reserved.