如何通过TensorFlow的feed字典传递标量

我的TensorFlow模型使用tf.random_uniform来初始化变量。我希望在开始训练时指定范围，因此我为初始化值创建了一个占位符。

init = tf.placeholder(tf.float32, name="init")v = tf.Variable(tf.random_uniform((100, 300), -init, init), dtype=tf.float32)initialize = tf.initialize_all_variables()

我在训练开始时这样初始化变量。

session.run(initialize, feed_dict={init: 0.5})

这会给我以下错误：

ValueError: initial_value must have a shape specified: Tensor("Embedding/random_uniform:0", dtype=float32)

我无法找出传递给tf.placeholder的正确shape参数。我认为对于标量，我应该这样做init = tf.placeholder(tf.float32, shape=0, name="init")，但这会导致以下错误：

ValueError: Incompatible shapes for broadcasting: (100, 300) and (0,)

如果我在调用tf.random_uniform时用字面值0.5替换init，它会工作。

我如何通过feed字典传递这个标量初始值？

回答：

TL;DR: 按如下方式定义init为标量形状：

init = tf.placeholder(tf.float32, shape=(), name="init")

这看起来像是tf.random_uniform()的一个不幸的实现细节：它当前使用tf.add()和tf.multiply()来将随机值从[-1, +1]重新缩放到[minval, maxval]，但如果minval或maxval的形状未知，tf.add()和tf.multiply()无法推断出正确的形状，因为可能涉及广播。

通过定义init为已知形状（标量是()或[]，而不是0），TensorFlow可以对tf.random_uniform()结果的形状进行正确的推断，你的程序应该按预期工作。

学技术

如何通过TensorFlow的feed字典传递标量

发表回复取消回复

相关文章：

Related Posts

使用LSTM在Python中预测未来值

如何在gensim的word2vec模型中查找双词组的相似性

dask_xgboost.predict 可以工作但无法显示 – 数据必须是一维的

ML Tuning – Cross Validation in Spark

如何在React JS中使用fetch从REST API获取预测

如何分析ML.NET中多类分类预测得分数组？

发表回复 取消回复

发表回复取消回复