Understanding matrix factorization for recommendation - 代码天地

Understanding matrix factorization for recommendation

其他 2019-10-08 17:33:51 阅读次数: 0

http://nicolas-hug.com/blog/matrix_facto_4

import numpy as np
import surprise  # run 'pip install scikit-surprise' to install surprise
from surprise.model_selection import cross_validate


class MatrixFacto(surprise.AlgoBase):
    '''A basic rating prediction algorithm based on matrix factorization.'''

    def __init__(self, learning_rate, n_epochs, n_factors):

        self.lr = learning_rate  # learning rate for SGD
        self.n_epochs = n_epochs  # number of iterations of SGD
        self.n_factors = n_factors  # number of factors

    def fit(self, trainset):
        '''Learn the vectors p_u and q_i with SGD'''

        print('Fitting data with SGD...')

        # Randomly initialize the user and item factors.
        p = np.random.normal(0, .1, (trainset.n_users, self.n_factors))
        q = np.random.normal(0, .1, (trainset.n_items, self.n_factors))

        # SGD procedure
        for _ in range(self.n_epochs):
            for u, i, r_ui in trainset.all_ratings():
                err = r_ui - np.dot(p[u], q[i])
                # Update vectors p_u and q_i
                p[u] += self.lr * err * q[i]
                q[i] += self.lr * err * p[u]
                # Note: in the update of q_i, we should actually use the previous (non-updated) value of p_u.
                # In practice it makes almost no difference.

        self.p, self.q = p, q
        self.trainset = trainset

    def estimate(self, u, i):
        '''Return the estmimated rating of user u for item i.'''

        # return scalar product between p_u and q_i if user and item are known,
        # else return the average of all ratings
        if self.trainset.knows_user(u) and self.trainset.knows_item(i):
            return np.dot(self.p[u], self.q[i])
        else:
            return self.trainset.global_mean


# data loading. We'll use the movielens dataset (https://grouplens.org/datasets/movielens/100k/)
# it will be downloaded automatically.
data = surprise.Dataset.load_builtin('ml-100k')
#data.split(2)  # split data for 2-folds cross validation




algo = MatrixFacto(learning_rate=.01, n_epochs=10, n_factors=10)
#surprise.evaluate(algo, data, measures=['RMSE'])
cross_validate(algo, data, measures=['RMSE', 'MAE'], cv=5, verbose=True)

猜你喜欢

转载自www.cnblogs.com/pengwang52/p/11636698.html

Understanding matrix factorization for recommendation

Metric-Factorization Recommendation beyond Matrix Factorization论文干货

实现论文SoRec Social Recommendation Using Probabilistic Matrix Factorization

Matrix Factorization

Translation-based Factorization Machines for Sequential Recommendation

Understanding Confusion Matrix

MF系列一：从Matrix Factorization到Probabilistic Matrix Factorization

Neural factorization for Offer Recommendation using Knowledge Graph Embeddings

推荐系统——Matrix factorization techniques for recommender systems

矩阵分解笔记（Notes on Matrix Factorization）

概率矩阵分解（Probabilistic Matrix Factorization）

隐语义模型和Matrix Factorization Model

推荐系统：矩阵分解（Matrix factorization）

机器学习技法笔记：15 Matrix Factorization

NMF: non-negative matrix factorization.

推荐系列（四）：矩阵分解|Matrix Factorization

推荐系统:矩阵分解(Matrix factorization)

nonnegative matrix factorization (NMF）的R实现

Secure Federated Matrix Factorization学习总结

白话NMF（Non-negative Matrix Factorization）——Matlab 实现

矩阵分解（MATRIX FACTORIZATION）在推荐系统中的应用

推荐系统——Confidence-Aware Matrix Factorization for Recommender Systems

Machine Learning Techniques 笔记：2-15 Matrix Factorization

Non-negative Matrix Factorization 非负矩阵分解

Matrix Factorization 学习记录（一）：基本原理及实现

Algorithms for Non-negative Matrix Factorization 非负矩阵分解

温故valse|2014XJTU-MengDeyu Matrix Factorization with unknown noise

机器学习笔记7：矩阵分解Recommender.Matrix.Factorization

Stochastic variance reduced multiplicative update for nonnegative matrix factorization

论文笔记：Deep Matrix Factorization Models for Recommender Systems

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)