主题模型聚类匹配2018TKDE阅读笔记（Topic Models for Unsupervised Cluster Matching）

其他 2018-06-14 12:07:21 阅读次数: 2

本文作者：合肥工业大学管理学院钱洋 email：[email protected] 内容可能有不到之处，欢迎交流。

未经本人允许禁止转载。

论文来源

Iwata T, Hirao T, Ueda N. Topic Models for Unsupervised Cluster Matching[J]. IEEE Transactions on Knowledge and Data Engineering, 2018, 30(4): 786-795.

作者是日本人Iwata T，也是个机器学习大牛，每年都有一系列的文章出来，还是很厉害的。这篇文章是作者18年在TKDE上发表的。

论文简介

这篇文章的目的是利用主题模型，将不同领域数据进行聚类，并形成聚类结果的匹配。例如，在没有对应信息的情况下，发现英语和德语文章聚类之间的对应关系，例如不同语言下使用词汇的对应关系、同语义语句对应关系等。在作者的模型中，所有语言中的文档具有共同的主题，主题在所有语言中是共享的。每篇文档有其特定的一个主题分布以及特定语言中主题的词分布。为了学习文档主题分布，作者将不同语言的文档分配到共同的簇中，每个簇有其对应的主题分布。文档(不同的语言)被分配到一个相同的簇中被认为是匹配。作者使用的方法是collapsed Gibbs sampling。

论文详细介绍

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

这里写图片描述

猜你喜欢

转载自blog.csdn.net/qy20115549/article/details/80031523

主题模型聚类匹配2018TKDE阅读笔记（Topic Models for Unsupervised Cluster Matching）

【论文阅读笔记】Recursive Unsupervised Learning of Finite Mixture Models

131.005 Unsupervised Learning - Cluster | 非监督学习 - 聚类

[文献阅读]—Improving the Lexical Ability of Pretrained Language Models for Unsupervised NMT

论文笔记：Cluster Alignment with a Teacher for Unsupervised Domain Adaptation

Language Models are Unsupervised Multitask Learners 论文纪要

Language Models are Unsupervised Multitask Learners翻译

NIPS20 基于在线聚类的表征学习 SwAV《Unsupervised Learning of Visual Features by Contrasting Cluster Assignment》

CS231n Lecture 13 | Unsupervised Learning and Generative Models

GPT2.0 Language Models are Unsupervised Multitask Learners 论文解读

【NLP经典论文精读】Language Models are Unsupervised Multitask Learners

Hybrid Contrastive Learning with Cluster Ensemble for Unsupervised Person Re-identification

《Unsupervised Image Captioning》阅读笔记

动态主题模型（Dynamic Topic Models）

RainDiffusion:When Unsupervised Learning Meets Diffusion Models for Real-world Image Deraining

模式匹配Pattern Matching

稳定匹配 - Stable Matching

文本匹配（Text Matching）

完美匹配（matching）

String Matching（模式匹配）

人脸匹配（face matching）

2022 TIP: Cluster-guided Asymmetric Contrastive Learning for Unsupervised Person Re-Identification

聚类 Cluster

Introduction to Probabilistic Topic Models

Unsupervised Translation of Programming Languages阅读笔记

Django基础__( models模型 )

django 模型models

tensorflow/models模型下载

Models模型（下）

Django的Models模型

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)