com.hankcs.hanlp.corpus.occurrence
类 Occurrence

java.lang.Object
  继承者 com.hankcs.hanlp.corpus.occurrence.Occurrence

public class Occurrence
extends Object

词共现统计,最多统计到三阶共现

作者:
hankcs

字段摘要
static char RIGHT
          两个词的正向连接符 中国 RIGHT 人民
 
构造方法摘要
Occurrence()
           
 
方法摘要
 void addAll(List<Term> resultList)
           
 void addAll(String text)
           
 void addAll(String[] termList)
           
 void addPair(String first, String second)
          添加一个共现
 void addTerm(String key)
          统计词频
 void addTria(String first, String second, String third)
           
 void compute()
          输入数据完毕,执行计算
 double computeLeftEntropy(PairFrequency pair)
          计算左熵
 double computeMutualInformation(PairFrequency pair)
           
 double computeMutualInformation(String first, String second)
           
 double computeRightEntropy(PairFrequency pair)
          计算右熵
 int getPairFrequency(String first, String second)
           
 List<PairFrequency> getPhraseByLe()
           
 List<PairFrequency> getPhraseByMi()
           
 List<PairFrequency> getPhraseByRe()
           
 List<PairFrequency> getPhraseByScore()
           
 int getTermFrequency(String term)
          获取词频
 String toString()
           
 
从类 java.lang.Object 继承的方法
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

字段详细信息

RIGHT

public static final char RIGHT
两个词的正向连接符 中国 RIGHT 人民

另请参见:
常量字段值
构造方法详细信息

Occurrence

public Occurrence()
方法详细信息

addPair

public void addPair(String first,
                    String second)
添加一个共现

参数:
first - 第一个词
second - 第二个词

addTerm

public void addTerm(String key)
统计词频

参数:
key - 增加一个词

addTria

public void addTria(String first,
                    String second,
                    String third)

getTermFrequency

public int getTermFrequency(String term)
获取词频

参数:
term -
返回:

getPairFrequency

public int getPairFrequency(String first,
                            String second)

addAll

public void addAll(String[] termList)

getPhraseByMi

public List<PairFrequency> getPhraseByMi()

getPhraseByLe

public List<PairFrequency> getPhraseByLe()

getPhraseByRe

public List<PairFrequency> getPhraseByRe()

getPhraseByScore

public List<PairFrequency> getPhraseByScore()

addAll

public void addAll(List<Term> resultList)

addAll

public void addAll(String text)

toString

public String toString()
覆盖:
Object 中的 toString

computeMutualInformation

public double computeMutualInformation(String first,
                                       String second)

computeMutualInformation

public double computeMutualInformation(PairFrequency pair)

computeLeftEntropy

public double computeLeftEntropy(PairFrequency pair)
计算左熵

参数:
pair -
返回:

computeRightEntropy

public double computeRightEntropy(PairFrequency pair)
计算右熵

参数:
pair -
返回:

compute

public void compute()
输入数据完毕,执行计算



Copyright © 2014–2015 码农场. All rights reserved.