SGPT: GPT Sentence Embeddings for Semantic Search - 代码天地

SGPT: GPT Sentence Embeddings for Semantic Search

业界资讯 2023-12-17 22:19:49 阅读次数: 0

在这里插入图片描述

简介

语义搜索分为两个部分：
1.搜索和query 相关的topk文档。
2.理解文档和query后面隐藏的语义信息，而不是字面含义。
这篇论文提出了SGPT模型，只用decoder-only的transformer来进行语义搜索和sentence向量的提取。
1.SGPT-BE：来对文档和query进行粗略的相关度计算，由于可以对文档的向量进行缓存，所以计算量和文档的数量线性相关，SGPT使用了BitFit的方式只对模型bias等少部分参数进行微调，大部分模型参数在微调的过程中是被冻结的，所以能够大大提升模型的训练效率。
2.SGPT-CE：对文档和query进行concat拼接，拼接后输入到gpt模型中去，对模型输出的query token的概率进行sum pooling的方式，作为文档的得分。由于CE的方式每一个query都需要重复计算很多次，所以计算量比较大，所以一般是在BE之后，对top的文档进行encoder概率计算。

SGPT Cross-Encoder

在这里插入图片描述

SGPT Bi-Encoder

在这里插入图片描述

猜你喜欢

转载自blog.csdn.net/WitsMakeMen/article/details/133862074

SGPT: GPT Sentence Embeddings for Semantic Search

Deep Fragment Embeddings for Bidirectional Image Sentence Mapping

SCD Self-Contrastive Decorrelation for Sentence Embeddings

《Learning Semantic Concepts and Order for Image and Sentence Matching》

Easy Semantic Search

论文阅读：《a simple but tough-to-beat baseline for sentence embeddings》

论文解读：PromptBERT: Improving BERT Sentence Embeddings with Prompts

文献阅读笔记 # SimCSE: Simple Contrastive Learning of Sentence Embeddings

论文阅读 | Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

文献阅读笔记 # Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

UNE BASE SIMPLE MAIS PARFAITE POUR SENTENCE EMBEDDINGS(一个简单但很难超越的Sentence Embedding基线方法)

[句边界检测/标点符号预测]A Bidirectional LSTM Approach with Word Embeddings for Sentence Boundary Detection

文献阅读笔记 # Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation

sentence

Sentence A

[sentence]

Real-time Personalization using Embeddings for Search Ranking at Airbnb

Elasticsearch：语义搜索 - Semantic Search in python

《Cross-Modal Retrieval in the Cooking Context__Learning Semantic Text-Image Embeddings》

读论文，多区块处理：Learning Semantic Concepts and Order forImage and Sentence Matching

How Contextual are Contextualized Word Representations in BERT、ELMO and GPT-2 Embeddings

【阅读笔记】Real-time Personalization using Embeddings for Search Ranking at Airbnb

推荐系统之Airbnb推荐：Real-time Personalizaton using Embeddings for Search Ranking at Airbnb

空间语义图像检索: Spatial-Semantic Image Search by Visual Feature Synthesis

《Learning Deep Structured Semantic Models for Web Search using Clickthrough Data 》论文总结

论文笔记系列-Auto-DeepLab:Hierarchical Neural Architecture Search for Semantic Image Segmentation

Semantic Segmentation---Auto-DeepLab: Hierarchical Neural Architecture Search for Semant ...（论文解读十六）

【DSSM】Learning Deep Structured Semantic Models for Web Search using Clickthrough Data

【语义分割】Auto-DeepLab Hierarchical Neural Architecture Search for Semantic Image Segmentation阅读翻译

Daily Sentence

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)