com.hankcs.hanlp.seg.CRF
类 CRFSegment
java.lang.Object
com.hankcs.hanlp.seg.Segment
com.hankcs.hanlp.seg.CharacterBasedGenerativeModelSegment
com.hankcs.hanlp.seg.CRF.CRFSegment
public class CRFSegment
- extends CharacterBasedGenerativeModelSegment
基于CRF的分词器
- 作者:
- hankcs
| 从类 com.hankcs.hanlp.seg.Segment 继承的方法 |
atomSegment, combineByCustomDictionary, enableAllNamedEntityRecognize, enableCustomDictionary, enableIndexMode, enableJapaneseNameRecognize, enableMultithreading, enableMultithreading, enableNameRecognize, enableOffset, enableOrganizationRecognize, enablePartOfSpeechTagging, enablePlaceRecognize, enableTranslatedNameRecognize, mergeNumberQuantifier, quickAtomSegment, seg, seg, seg2sentence, simpleAtomSegment |
| 从类 java.lang.Object 继承的方法 |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CRFSegment
public CRFSegment()
segSentence
protected List<Term> segSentence(char[] sentence)
- 从类
Segment 复制的描述
- 给一个句子分词
- 指定者:
- 类
Segment 中的 segSentence
- 参数:
sentence - 待分词句子
- 返回:
- 单词列表
toTermList
protected static List<Term> toTermList(List<Vertex> vertexList,
boolean offsetEnabled)
- 将一条路径转为最终结果
- 参数:
vertexList - offsetEnabled - 是否计算offset
- 返回:
atomSegment
public static List<String> atomSegment(char[] sentence)
atomSegmentToTable
public static String[][] atomSegmentToTable(char[] sentence)
enableNumberQuantifierRecognize
public Segment enableNumberQuantifierRecognize(boolean enable)
- 从类
Segment 复制的描述
- 是否启用数词和数量词识别
即[二, 十, 一] => [二十一],[十, 九, 元] => [十九元]
- 覆盖:
- 类
Segment 中的 enableNumberQuantifierRecognize
- 返回:
Copyright © 2014–2015 鐮佸啘鍦�/a>. All rights reserved.