Mahout: distributed item-based algorithm 1 - 代码天地

Mahout: distributed item-based algorithm 1

企业开发 2018-05-12 21:03:32 阅读次数: 0

co-occurrence matrix

Instead of computing the similarity between every pair of items, it’ll compute the number of times each pair of items occurs together in some user’s list of preferences, in order to fill out the matrix.

Co-occurrence is like similarity; the more two items turn up together, the more related or similar they probably are. The co-occurrence matrix plays a role like that of ItemSimilarity in the nondistributed item-based algorithm.

user vectors

Likewise, in a data model with n items, user preferences are like a vector over n dimensions, with one dimension for each item. The user’s preference values for items are the values in the vector. Items that the user expresses no preference for map to a 0 value in the vector. Such a vector is typically quite sparse, and mostly zeroes, because users typically express a preference for only a small subset of all items.

Producing the recommendations

The product of the co-occurrence matrix and a user vector is itself a vector whose dimension is equal to the number of items. The values in this resulting vector, R, lead us directly to recommendations: the highest values in R correspond to the best recommendations.

That third row contains co-occurrences between item 103 and all other items. Intuitively, if item 103 co-occurs with many items that user 3 expresses a preference for, then it’s probably something that user 3 would like.

References

http://en.wikipedia.org/wiki/Matrix_multiplication

http://haselgrove.id.au/wikipedia.htm

猜你喜欢

转载自ylzhj02.iteye.com/blog/2059507

Mahout: distributed item-based algorithm 1

Mahout: distributed item-based algorithm 2

Mahout: distributed item-based algorithm 3

Parallel and Distributed Algorithm-1

Item-based recommendation

Parallel and Distributed Algorithm-3

Parallel and Distributed Algorithm-2

1）Apache Mahout 理论介绍

mahout

Distributed

基于Item-Based协同过滤推荐

阅读笔记：Item-based Collaborative Filtering Recommendation Algorithms

论文笔记：Item-Based Collaborative Filtering Recommendation Algorithms

mahout中bayes分类分析—1

mahout之1-Canopy聚类

mahout从入门到放弃--安装（1）

Interview(1)Algorithm Book

Sorting Algorithm(1)

Algorithm：No1 Sorting

c++ algorithm（1）

algorithm learning for Leetcode (1)

[Algorithm]Algorithm章1 排序算法

Privacy Protection in Distributed Fingerprint-based Authentication

MyCAT – Distributed database middleware based on MySQL

【译】Distributed Deep Learning - Part 1 - An Introduction

Storm based realtime recommendation algorithm

Clustering：Model-Based Algorithm

PyQt（Python+Qt）学习随笔:基于项的项部件（Item Widgets（Item-Based））概述

协同过滤User-based算法与Item-based算法对比

nowcoder basic algorithm chapter 1

今日推荐

Linus “吃狗粮”最积极！

开源日报 | Winamp播放器即将开源；生成式AI之战升级第二轮；Linus“吃狗粮”最积极；AI进入泡沫前期；吴泳铭为阿里云带来了什么？

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

周排行

SVN服务端安装在阿里云

实战 | 相机标定

webpack核心概念

note20——》只要肯低头吃苦，人生就会有救

PAT甲级 1062 Talent and Virtue （25 分）排序

NG Toolset开发笔记--5GNR Resource Grid（26）

如何对待上司

oracle命令

第9章 STL迭代器

logstash使用es映射模板

每日归档

更多

2024-05-20(36)

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)