Lucene 评分机制 - 代码天地

Lucene 评分机制

企业开发 2018-05-12 15:20:20 阅读次数: 0

1. tf(t in d) correlates to the term's frequency, defined as the number of times term t appears in the currently scored document d. Documents that have more occurrences of a given term receive a higher score.

2. idf(t) stands for Inverse Document Frequency. This value correlates to the inverse of docFreq (the number of documents in which the term t appears). This means rarer terms give higher contribution to the total score

3. coord(q,d) is a score factor based on how many of the query terms are found in the specified document

4. norm(t,d) encapsulates a few (indexing time) boost and length factors:

Document boost - set by calling doc.setBoost() before adding the document to the index.
Field boost - set by calling field.setBoost() before adding the field to a document.
lengthNorm(field) - computed when the document is added to the index in accordance with the number of tokens of this field in the document, so that shorter fields contribute more to the score. When a document is added to the index, all the above factors are multiplied. If the document has multiple fields with the same name, all their boosts are multiplied together

对于wildcard search， Lucene默认会屏蔽掉norm部分的分数。如果想让它参与的话，可以重新设置它的rewrite method。

for (BooleanClause clause : ((BooleanQuery) query).getClauses()) {
        if (clause.getQuery() instanceof MultiTermQuery) {
        	((MultiTermQuery) clause.getQuery()).setRewriteMethod(MultiTermQuery.SCORING_BOOLEAN_QUERY_REWRITE);
        } else {        		
        	enableCalScore(clause.getQuery());
	}
}

猜你喜欢

转载自fenglingxuewqk.iteye.com/blog/2100418

Lucene 评分机制

Lucene Scoring 评分机制

<转>Lucene Scoring 评分机制

solr 评分机制

Lucene Similarity (Lucene 文档评分score机制详解)

Wifi 评分机制分析

Lucene的几种评分方式

Lucene评分源码调研

Lucene 评分(score)机制--Document Boost和Field Boost

三、Android 网络评分机制

【NLP】MT中BLEU评分机制

Elasticsearch中的相似度评分机制

向量空间模型与Lucene的打分机制以及影响打分的几种方式

Lucene TFIDFSimilarity评分公式详解

深入 Lucene 索引机制

网络连接评分机制之NetworkMonitor

（一百九十五）Android Q 学习WiFi的评分机制（一）

（一百九十七）Android Q 学习WiFi的评分机制（四）

（一百九十六）Android Q 学习WiFi的评分机制（三）

（一百九十六）Android Q 学习WiFi的评分机制（二）

Android P WiFi自动连接评分机制

Android Studio RadioButton实现单选试题评分机制

影响lucene的评分的几种方法

Lucene5学习之评分Scoring

【转】深入 Lucene 索引机制

Lucene

Lucene3.0源码分析(二) Lucene中获得评分前N的DocumentID算法

lucene4.5源码分析系列：lucene的默认评分算法-向量空间模型（Vector Space Model）

Lucene笔记25-Lucene的使用-根据域进行评分设定

Lucene笔记24-Lucene的使用-自定义评分简介

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

循环神经网络（rnn）讲解

Tigao教程四：单独的关节运动

金蝶K3WISE15.0-注册套打教程

如何在Mac上配置Kubernetes

Android应用结束自身进程的方法

SpringMVC学习十三拦截器栈

中国驻洛杉矶总领馆举行新春招待会

HttpClient get post 发送

11 - three.js 笔记 - 绘制三维字体模型

Mysql递归获取某个父节点下面的所有子节点和子节点上的所有父节点

每日归档

更多

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)