Enriching BERT with Knowledge Graph Embeddings for Document Classification - 代码天地

Enriching BERT with Knowledge Graph Embeddings for Document Classification

其他 2020-02-12 08:51:08 阅读次数: 0

arXiv github

这是GermEval 2019 Task 1 – Shared task on hierarchical classification of blurbs的一个实验性文章，文中所关注的问题总体上来说属于文本分类，但根据所使用的数据集具体来说是一个关于层次化标签的文本多分类任务。相对于基本的文本分类所使用的数据集，本文中所使用的的GermEval2019中样本的内容包含：

标题
作者列表
描述性文本
URL
ISBN号
出版日期

根据任务的需求，每一个层次所包含的数据类型是不同的，而且根据具体层次的数据又有对应的标签。其中标题、作者列表和描述性文本所对应的的类别数为8、93和242。因此，针对于具体层次的分类难度是不同的。作者使用BERT来获取文档的表示，同时通过统计样本的元数据（如作者的个数、是否为学术性标题、标题所包含的单词数等）和基于维基百科的图嵌入模型（Graph embedding model）来为最后的分类提供额外的信息，从而最终提升分类模型的性能。

模型示意图如下所示，结构清晰，原理简单。首先通过BERT获取关于Title和Text的表示向量，在使用MLP进行分类预测前，将上述两种方式提供的额外信息表示和Title和Text的表示向量进行拼接，最后通过Softmax的全连接层进行类别预测。

在这里插入图片描述

实验结果如下所示，通过消融实验可以看出额外提供的信息的确可以帮助提升分类模型的性能。

在这里插入图片描述

由于只是实验性文章，因此没有太多新的思想，详细的内容可见原文~

Forlogen

发布了267 篇原创文章 · 获赞 91 · 访问量 19万+

私信关注

猜你喜欢

转载自blog.csdn.net/Forlogen/article/details/103909801

Enriching BERT with Knowledge Graph Embeddings for Document Classification

KBGAN: Adversarial Learning for Knowledge Graph Embeddings理解

KG-BERT for Knowledge Graph Completion 笔记

Neural factorization for Offer Recommendation using Knowledge Graph Embeddings

论文解读：（TransR）Learning Entity and Relation Embeddings for Knowledge Graph Completion

K-BERT: Enabling Language Representation with Knowledge Graph阅读笔记

【知识图谱解耦系列】DisenE: Disentangling Knowledge Graph Embeddings

BERT+知识图谱： K-BERT Enabling Language Representation with Knowledge Graph 文献理解

转：Language Models as Knowledge Embeddings

【论文笔记】K-BERT: Enabling Language Representation with Knowledge Graph

bert之token embeddings、segmentation embeddings、position embeddings

BERT and Knowledge Distillation

Knowledge Graph - NLP

论文阅读课4-Long-tail Relation Extraction via Knowledge Graph Embeddings（GCN,关系抽取，2019，远程监督，少样本不平衡，2注意

[Paper] From Word Embeddings To Document Distances

文献阅读 - From Word Embeddings To Document Distances

知识图谱（Knowledge Graph）

知识图谱Knowledge Graph

Enriching Pre-trained Language Model with Entity Information for Relation Classification 论文研读

图嵌入表示学习—Graph Embeddings

Entity Alignment between Knowledge Graphs Using Attribute Embeddings

论文笔记《Domain Adapted Word Embeddings for Improved Sentiment Classification》

Towards Unsupervised Text Classification Leveraging Experts and Word Embeddings

MultiLabel Text Classification using BERT Transformers

论文解读：PromptBERT: Improving BERT Sentence Embeddings with Prompts

Introducing the Knowledge Graph: things, not strings【阅读翻译】

Deep Learning 和 Knowledge Graph howto

2019 WWW Workshop | Knowledge Graph Technology and Applications

DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning

Bootstrapping Entity Alignment with Knowledge Graph Embedding理解

今日推荐

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

周排行

rbac——界面、权限

Apache CXF + SpringMVC 整合发布WebService

so插件化

Vue.js实战系列---图标字体制作（svg格式）

PAT乙级 1007 素数对猜想(孪生素数对) (20分) ---（C语言 + 详细注释）

被IRM保护的文档，打开失败

Calendar和Date计算日期差的小问题

win10子系统ubuntu18.4安装docker

利用Wrap Shell Script定位Android Native内存泄漏

MySQL: Transaction (Part I - Basic Concept)

每日归档

更多

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)