python - 如何在 TensorFlow 中使用 tf.nn.embedding_lookup

python - 如何在 TensorFlow 中使用 tf.nn.embedding_lookup_sparse？

转载作者：太空狗更新时间：2023-10-30 01:50:29

27

4

我们已经尝试使用 tf.nn.embedding_lookup 并且它有效。但它需要密集的输入数据，现在我们需要 tf.nn.embedding_lookup_sparse 来进行稀疏输入。

我写了下面的代码，但出现了一些错误。

import tensorflow as tf
import numpy as np

example1 = tf.SparseTensor(indices=[[4], [7]], values=[1, 1], shape=[10])
example2 = tf.SparseTensor(indices=[[3], [6], [9]], values=[1, 1, 1], shape=[10])

vocabulary_size = 10
embedding_size = 1
var = np.array([0.0, 1.0, 4.0, 9.0, 16.0, 25.0, 36.0, 49.0, 64.0, 81.0])
#embeddings = tf.Variable(tf.ones([vocabulary_size, embedding_size]))
embeddings = tf.Variable(var)

embed = tf.nn.embedding_lookup_sparse(embeddings, example2, None)

with tf.Session() as sess:
    sess.run(tf.initialize_all_variables())

    print(sess.run(embed))

错误日志如下所示。

现在我不知道如何正确修复和使用此方法。如有任何意见，我们将不胜感激。

深入了解 safe_embedding_lookup_sparse 的单元测试后，我更困惑的是，如果给出稀疏权重，为什么我会得到这个结果，尤其是为什么我们得到像 embedding_weights[0][3 这样的东西] 其中 3 没有出现在上面的代码中。

最佳答案

tf.nn.embedding_lookup_sparse() 使用 Segmentation组合嵌入，这需要 SparseTensor 的索引从 0 开始并增加 1。这就是为什么会出现此错误。

您的稀疏张量不需要 bool 值，只需保存您要从嵌入中检索的每一行的索引。这是您调整后的代码:

import tensorflow as tf
import numpy as np

example = tf.SparseTensor(indices=[[0], [1], [2]], values=[3, 6, 9], dense_shape=[3])

vocabulary_size = 10
embedding_size = 1
var = np.array([0.0, 1.0, 4.0, 9.0, 16.0, 25.0, 36.0, 49.0, 64.0, 81.0])
embeddings = tf.Variable(var)

embed = tf.nn.embedding_lookup_sparse(embeddings, example, None)

with tf.Session() as sess:
    sess.run(tf.initialize_all_variables())
    print(sess.run(embed)) # prints [  9.  36.  81.]

此外，您可以使用 tf.SparseTensor() 中的索引，使用允许的 tf.nn.embedding_lookup_sparse() 之一组合词嵌入。组合器:

"sum" computes the weighted sum of the embedding results for each row.

"mean" is the weighted sum divided by the total weight.

"sqrtn" is the weighted sum divided by the square root of the sum of the squares of the weights.

例如:

example = tf.SparseTensor(indices=[[0], [0]], values=[1, 2], dense_shape=[2])
...
embed = tf.nn.embedding_lookup_sparse(embeddings, example, None, combiner='sum')
...
print(sess.run(embed)) # prints [ 5.]

关于python - 如何在 TensorFlow 中使用 tf.nn.embedding_lookup_sparse？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/39207587/

27

4

0

文章推荐： database - 组合缓存方法 - 基于内存缓存/磁盘

文章推荐： language-agnostic - 选择特定数据库管理系统的因素有哪些？

文章推荐： ruby-on-rails - rails : Auto-Detecting Database Adapter

文章推荐： ruby-on-rails - Rails 3 - 如何删除已创建的数据库表？

regex - 是否有正则表达式来替换 VIM 中 nn :nn:nn. nn 中的前导零(最后一个除外)和冒号？
在 Vim 中，我打开了一个基本结构如下的文件: 3677137 00:01:47.04 666239 00:12:57.86 4346 00:00:01.77 418 00:00:0
python - [nn.nn] 或 [nn] 的正则表达式，具有更正的分组
我正在尝试构建一个正则表达式来处理以字符串形式呈现给我的数据类型，有两种可能的格式: 字符串[nmin..nmax] 字符串[nmax] 其中 nmin 和 nmax 是一些数字。我构建了适合我的正
logging - tensorflow log_softmax tf.nn.log(tf.nn.softmax(predict)) tf.nn.softmax_cross_entropy_with_logits
我尝试按照 tensorflow 教程实现 MNIST CNN 神经网络，并找到这些实现 softmax 交叉熵的方法给出了不同的结果: (1) 不好的结果 softmax = tf.nn.softm
pytorch - 什么时候应该使用 nn.ModuleList，什么时候应该使用 nn.Sequential？
我是 Pytorch 的新手，我不太了解的一件事是 nn.ModuleList 的用法。和 nn.Sequential .我能知道什么时候应该使用一个而不是另一个吗？谢谢。最佳答案 nn.Modul
pytorch - 一起使用 nn.Linear() 和 nn.BatchNorm1d()
我不明白当数据为 3D 时 BatchNorm1d 如何工作(批量大小、H、W)。示例输入大小:(2,50,70) 图层:nn.Linear(70,20) 输出大小:(2,50,20) 如果我随后
python - NLTK 正则表达式模式中 * 和 * 之间有什么区别？
我浏览了chapter 7 NLTK 书中的内容正在寻找解决方案，但到目前为止我还不清楚。 *表示 0 个或多个名词 *正如书中所解释的，意思是0个或多个任何类型的名词 NLTK 中是 NN , NN
python - nn.MaxPool2d 与 nn.function.max_pool2d 之间的区别？
:nn.MaxPool2d(kernel_size, stride) 和 nn.function.max_pool2d(t, kernel_size, stride) 之间有什么区别？我在模块中定义
Hadoop 高可用性。配置了自动故障转移，但备用 NN 在 NN 再次启动之前不会变为事件状态
我正在使用 Hadoop 2.6.0-cdh5.6.0。我已经配置了 HA。我显示了事件(NN1)和备用名称节点(NN2)。现在，当我向事件名称节点(NN1)发出终止信号时，备用名称节点(NN2)不会
Pytorch:为什么在 nn.modules.loss 和 nn.functional 模块中都实现了损失函数？
Pytorch 中的许多损失函数都在 nn.modules.loss 和 nn.functional 中实现。例如，下面的两行返回相同的结果。 import torch.nn as nn impor
Tensorflow，tf.nn.softmax_cross_entropy_with_logits 和 tf.nn.sparse_softmax_cross_entropy_with_logits 的区别
我已阅读 docs of both functions ，但据我所知，对于函数 tf.nn.softmax_cross_entropy_with_logits(logits, labels, dim=
tensorflow - tf.nn.fused_batch_norm 返回的方差与 tf.nn.moments 不同
当我尝试比较 tf.nn.fused_batch_norm 的方差输出和 tf.nn.moments 的方差输出时，对于相同的输入，我没有相同的值。 import numpy as np import
tensorflow - tf.nn.fused_batch_norm 返回的方差与 tf.nn.moments 不同
当我尝试比较 tf.nn.fused_batch_norm 的方差输出和 tf.nn.moments 的方差输出时，对于相同的输入，我没有相同的值。 import numpy as np import
python - torch.nn.sequential 与多个 torch.nn.linear 的组合
这个问题在这里已经有了答案: Are there any computational efficiency differences between nn.functional() Vs nn.seq
java - 一旦主 NN 出现故障，自动从 Java 应用程序连接到 HDFS 辅助 NN
我有一个简单的 Java 客户端，可以将文件保存到 HDFS - 配置了 1 个名称节点。为此，我使用 hadoop 配置，指定默认文件系统，如: org.apache.hadoop.conf.Con
sql - 在 SQL Server 中，转换具有这种格式的 varchar (nnn :nn:nn)
我将此 varchar 格式作为时间累积，我想将其转换为整数以执行 SUM 并获得一组的总时间。第一部分可以是1、2、3、4甚至5位数字，代表小时数的累加，然后用冒号隔开。然后是第二部分，即分钟的累积
lstm - nn.LSTMCell 的 torch 0.4.0 nn.LayerNorm 示例的任何示例？
在 pytorch 0.4.0 版本中，有一个 nn.LayerNorm模块。我想在我的 LSTM 网络中实现这一层，尽管我在 LSTM 网络上找不到任何实现示例。 pytorch 贡献者暗示这 n
python-3.x - PyTorch 中的 nn.functional() 与 nn.sequential() 之间是否存在计算效率差异
以下是使用 PyTorch 中的 nn.functional() 模块的前馈网络 import torch.nn as nn import torch.nn.functional as F class
nhibernate - 当数据库中的列为 Null 时，是什么导致了 "Invalid index nn for this SqlParameterCollection with Count=nn"？
对于住宿实体，我们有两列可以为空:CollectionType和 AccommodationUnitType . 但是我注意到在数据中它们被设置为零而不是空，导致 NHibernate 尝试查找 id
python - 如何在 Python 中使用 NLTK 仅将具有以下模式 * *"run in" 的句子分块？
我只需要分块那些只有那种模式的短语，而不是再分块一次。我在 Python 中使用 NLTK 库完成了它，但不起作用 import nltk import re document="they run
python - 是否可以自动调整 PyTorch 中 torch.nn.Sequential 中 torch.nn.Flatten 之后的层的后续输入？
例如，如果我有以下模型类: class MyTestModel(nn.Module): def __init__(self): super(MyTestModel, self)

首页

博学

6Ren·AI

商城

python - 如何在 TensorFlow 中使用 tf.nn.embedding_lookup_sparse？