“`” 参考回答:

推荐算法:

基于人口学的推荐、基于内容的推荐、基于用户的协同过滤推荐、基于项目的协同过滤推荐、基于模型的协同过滤推荐、基于关联规则的推荐

FM:

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637289817_6CBE803838BD47DA6A26DD810D78B3AC"">

LR:

逻辑回归本质上是线性回归,只是在特征到结果的映射中加入了一层逻辑函数g(z),即先把特征线性求和,然后使用函数g(z)作为假设函数来预测。g(z)可以将连续值映射到0 和1。g(z)为sigmoid function.

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637314282_ABB45846A64BE026999B9DABBA362E7B"">

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637336566_C010D5A3D04510FDEF55D95CD161A772"">

sigmoid function 的导数如下:

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637357116_8D7C8B52CDF9D7EC482A998311A2B818"">

逻辑回归用来分类0/1 问题,也就是预测结果属于0 或者1 的二值分类问题。这里假设了二值满足伯努利分布,也就是

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637381679_F86D134E5F5D14B70687DC5056207F35"">

其也可以写成如下的形式:

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637414266_2341937DFE3AB4397DA2D3208D941214"">

对于训练数据集,特征数据x={x1, x2, … , xm}和对应的分类标签y={y1, y2, … , ym},假设m个样本是相互独立的,那么,极大似然函数为:

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637431278_30275CD5F73DDC724BBF2B58F9D45873"">

log似然为:

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637486261_09AE66DE1E5BC09CEE7BA7671FD57127"">

如何使其最大呢?与线性回归类似,我们使用梯度上升的方法(求最小使用梯度下降),那么<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637470767_2C54D430EF1DB11745101A428E1DFF8A"">。

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637510300_EDB8DB5C2AFC07D939F66F3EB0446E91"">

如果只用一个训练样例(x,y),采用随机梯度上升规则,那么随机梯度上升更新规则为:

<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637659967_D9809693D0D46FB380CD1561583B08A0"">

Embedding:

Embedding在数学上表示一个maping:<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637687727_E68A0F3E25D4E8A01B9489357EE9A93E"">,也就是一个function。其中该函数满足两个性质:1)injective (单射的):就是我们所说的单射函数,每个Y只有唯一的X对应;2)structure-preserving(结构保存):比如在X所属的空间上<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637707145_3E12A2902C5F5C6E8ED1380D7A7DAA6F"">,那么映射后在Y所属空间上同理<img alt=""img"" referrerpolicy=""no-referrer"" src=""https://uploadfiles.nowcoder.com/images/20190315/311436_1552637574195_F515A3790698D5F759A645E8216D1177"">。

那么对于word embedding,就是找到一个映射(函数)将单词(word)映射到另外一个空间(其中这个映射具有injective和structure-preserving的特点),生成在一个新的空间上的表达,该表达就是word representation。

<pre><code> "“`

Was this helpful?

0 / 0

发表回复 0

Your email address will not be published.