ML.NET 示例：推荐之One Class 矩阵分解

发布时间：2023-09-06 02:26责任编辑：傅花花关键词：.NET

写在前面
准备近期将微软的machinelearning-samples翻译成中文，水平有限，如有错漏，请大家多多指正。
如果有朋友对此感兴趣，可以加入我：https://github.com/feiyun0112/machinelearning-samples.zh-cn

产品推荐 - 矩阵分解问题示例

ML.NET 版本	API 类型	状态	应用程序类型	数据类型	场景	机器学习任务	算法
v0.8	动态 API	最新版本	控制台应用程序	.txt 文件	推荐	矩阵分解	MatrixFactorizationTrainer (One Class)

在这个示例中，您可以看到如何使用ML.NET来构建产品推荐方案。

本示例中的推荐方式基于共同购买或经常一起购买的产品，这意味着它将根据客户的购买历史向客户推荐一组产品。

在这个示例中，基于经常一起购买的学习模型来推荐产品。

问题

在本教程中，我们将使用亚马逊共同购买产品数据集。

我们将使用One-Class因式分解机来构建我们的产品推荐器，它使用协同过滤方法。

我们介绍的one-class和其他因式分解机的区别在于，在这个数据集中，我们只有购买历史的信息。

我们没有评分或其他详细信息，如产品描述等。

“协同过滤”是在一个基本假设的情况下运作的，即如果某人A在一个问题上与某人B具有相同的意见，则在另一个问题上，相对其他随机选择的人，A更倾向于B的观点。

数据集

原始数据来自SNAP:
https://snap.stanford.edu/data/amazon0302.html

ML 任务 - 矩阵分解 (推荐)

这个示例的ML任务是矩阵分解，它是一个执行协同过滤的有监督的机器学习任务。

解决方案

要解决此问题，您需要在现有训练数据上建立和训练ML模型，评估其有多好（分析获得的指标），最后您可以使用/测试模型来预测给定输入数据变量的需求。

1. 建立模型

建立模型包括:

从 https://snap.stanford.edu/data/amazon0302.html 下载并复制数据集文件Amazon0302.txt。
使用以下内容替换列名：ProductID ProductID_Copurchased
在读取器中，我们已经提供了KeyRange，并且产品ID已经编码，我们需要做的就是使用几个额外的参数调用MatrixFactorizationTrainer。

下面是用于建立模型的代码：

 ????//STEP 1: Create MLContext to be shared across the model creation workflow objects ????var ctx = new MLContext(); ???//STEP 2: Create a reader by defining the schema for reading the product co-purchase dataset ???// ???????Do remember to replace amazon0302.txt with dataset from ??????????????https://snap.stanford.edu/data/amazon0302.html ???var reader = ctx.Data.TextReader(new TextLoader.Arguments() ???{ ???????Separator = "tab", ???????HasHeader = true, ???????Column = new[] ???????{ ???????????????new TextLoader.Column("Label", DataKind.R4, 0), ???????????????new TextLoader.Column("ProductID", DataKind.U4, new [] { new TextLoader.Range(0) }, new KeyRange(0, 262110)), ???????????????new TextLoader.Column("CoPurchaseProductID", DataKind.U4, new [] { new TextLoader.Range(1) }, new KeyRange(0, 262110)) ???????????} ???????}); ???????//STEP 3: Read the training data which will be used to train the movie recommendation model ???????var traindata = reader.Read(new MultiFileSource(TrainingDataLocation)); ???????//STEP 4: Your data is already encoded so all you need to do is call the MatrixFactorization Trainer with a few extra hyperparameters: ???????// ???????LossFunction, Alpa, Lambda and a few others like K and C as shown below. ????????var est = ctx.Recommendation().Trainers.MatrixFactorization("ProductID", "CoPurchaseProductID", ??????????????????????????????????????labelColumn: "Label", ????????????????????????????????????advancedSettings: s => ????????????????????????????????????{ ????????????????????????????????????????s.LossFunction = MatrixFactorizationTrainer.LossFunctionType.SquareLossOneClass; ????????????????????????????????????????s.Alpha = 0.01; ????????????????????????????????????????s.Lambda = 0.025; ????????????????????????????????????????// For better results use the following parameters ????????????????????????????????????????//s.K = 100; ????????????????????????????????????????//s.C = 0.00001; ????????????????????????????????????});

2. 训练模型

一旦定义了评估器，就可以根据可用的训练数据对评估器进行训练。

这将返回一个训练过的模型。

 ???//STEP 5: Train the model fitting to the DataSet ???//Please add Amazon0302.txt dataset from https://snap.stanford.edu/data/amazon0302.html to Data folder if FileNotFoundException is thrown. ???var model = est.Fit(traindata);

3. 使用模型

我们将通过创建预测引擎/函数来执行此模型的预测，如下所示。

 ???public class Copurchase_prediction ???{ ???????public float Score { get; set; } ???} ???public class ProductEntry ???{ ???????[KeyType(Contiguous = true, Count = 262111, Min = 0)] ???????public uint ProductID { get; set; } ???????[KeyType(Contiguous = true, Count = 262111, Min = 0)] ???????public uint CoPurchaseProductID { get; set; } ???????}

一旦创建了预测引擎，就可以预测两个产品被共同购买的分数。

 ???//STEP 6: Create prediction engine and predict the score for Product 63 being co-purchased with Product 3. ???// ???????The higher the score the higher the probability for this particular productID being co-purchased ????var predictionengine = model.MakePredictionFunction<ProductEntry, Copurchase_prediction>(ctx); ???var prediction = predictionengine.Predict( ????????????????????????????new ProductEntry() ????????????????????????????{ ????????????????????????????ProductID = 3, ????????????????????????????CoPurchaseProductID = 63 ????????????????????????????});

ML.NET 示例：推荐之One Class 矩阵分解

原文地址：https://www.cnblogs.com/feiyun0112/p/10110581.html

ML.NET 示例：推荐之One Class 矩阵分解

写在前面

产品推荐 - 矩阵分解问题示例

问题

数据集

ML 任务 - 矩阵分解 (推荐)

解决方案

1. 建立模型

2. 训练模型

3. 使用模型

知识推荐