tensorflow实现从.ckpt文件中读取任意变量

2023-07-24 05:31:04 426

思路有些混乱，希望大家能理解我的意思。

看了fasterrcnn的tensorflow代码，关于fix_variables的作用我不是很明白，所以写了以下代码，读取了预训练模型vgg16得fc6和fc7的参数，以及fasterrcnn中heat_to_tail中的fc6和fc7，将它们做了对比，发现结果不一样，说明vgg16的fc6和fc7只是初始化了fasterrcnn中heat_to_tail中的fc6和fc7，之后后者被训练。

具体读取任意变量的代码如下：

importtensorflowastf
importnumpyasnp
fromtensorflow.pythonimportpywrap_tensorflow

file_name='/home/dl/projectBo/tf-faster-rcnn/data/imagenet_weights/vgg16.ckpt'#.ckpt的路径
name_variable_to_restore='vgg_16/fc7/weights'#要读取权重的变量名
reader=pywrap_tensorflow.NewCheckpointReader(file_name)
var_to_shape_map=reader.get_variable_to_shape_map()
print('shape',var_to_shape_map[name_variable_to_restore])#输出这个变量的尺寸
fc7_conv=tf.get_variable("fc7",var_to_shape_map[name_variable_to_restore],trainable=False)#定义接收权重的变量名
restorer_fc=tf.train.Saver({name_variable_to_restore:fc7_conv})#定义恢复变量的对象
sess=tf.Session()
sess.run(tf.variables_initializer([fc7_conv],name='init'))#必须初始化
restorer_fc.restore(sess,file_name)#恢复变量
print(sess.run(fc7_conv))#输出结果

用以上的代码分别读取两个网络的fc6和fc7，对应参数尺寸和权值都不同，但参数量相同。

再看lib/nets/vgg16.py中的：

（注意注释）

deffix_variables(self,sess,pretrained_model):
print('FixVGG16layers..')
withtf.variable_scope('Fix_VGG16')asscope:
withtf.device("/cpu:0"):
#fixthevgg16issuefromconvweightstofcweights
#fixRGBtoBGR
fc6_conv=tf.get_variable("fc6_conv",[7,7,512,4096],trainable=False)
fc7_conv=tf.get_variable("fc7_conv",[1,1,4096,4096],trainable=False)
conv1_rgb=tf.get_variable("conv1_rgb",[3,3,3,64],trainable=False)#定义接收权重的变量，不可被训练
restorer_fc=tf.train.Saver({self._scope+"/fc6/weights":fc6_conv,
self._scope+"/fc7/weights":fc7_conv,
self._scope+"/conv1/conv1_1/weights":conv1_rgb})#定义恢复变量的对象
restorer_fc.restore(sess,pretrained_model)#恢复这些变量

sess.run(tf.assign(self._variables_to_fix[self._scope+'/fc6/weights:0'],tf.reshape(fc6_conv,
self._variables_to_fix[self._scope+'/fc6/weights:0'].get_shape())))
sess.run(tf.assign(self._variables_to_fix[self._scope+'/fc7/weights:0'],tf.reshape(fc7_conv,
self._variables_to_fix[self._scope+'/fc7/weights:0'].get_shape())))
sess.run(tf.assign(self._variables_to_fix[self._scope+'/conv1/conv1_1/weights:0'],
tf.reverse(conv1_rgb,[2])))#将vgg16中的fc6、fc7中的权重reshape赋给faster-rcnn中的fc6、fc7

我的理解：fasterrcnn的网络继承了分类网络的特征提取权重和分类器的权重，让网络从一个比较好的起点开始被训练，有利于训练结果的快速收敛。

补充知识：TensorFlow:加载部分ckpt文件变量&不同命名空间中加载模型

TensorFlow中，在加载和保存模型时，一般会直接使用tf.train.Saver.restore()和tf.train.Saver.save()

然而，当需要选择性加载模型参数时，则需要利用pywrap_tensorflow读取模型，分析模型内的变量关系。

例子：Faster-RCNN中，模型加载vgg16.ckpt，需要利用pywrap_tensorflow读取ckpt文件中的参数

fromtensorflow.pythonimportpywrap_tensorflow

model=VGG16()#此处构建vgg16模型
variables=tf.global_variables()#获取模型中所有变量

file_name='vgg16.ckpt'#vgg16网络模型
reader=pywrap_tensorflow.NewCheckpointReader(file_name)
var_to_shape_map=reader.get_variable_to_shape_map()#获取ckpt模型中的变量名
print(var_to_shape_map)

sess=tf.Session()

my_scope='my/'#外加的空间名
variables_to_restore={}#构建字典：需要的变量和对应的模型变量的映射
forvinvariables:
ifmy_scopeinv.nameandv.name.split(':')[0].split(my_scope)[1]invar_to_shape_map:
print('Variablesrestored:%s'%v.name)
variables_to_restore[v.name.split(':0')[0][len(my_scope):]]=v
elifv.name.split(':')[0]invar_to_shape_map:
print('Variablesrestored:%s'%v.name)
variables_to_restore[v.name]=v

restorer=tf.train.Saver(variables_to_restore)#将需要加载的变量作为参数输入
restorer.restore(sess,file_name)

实际中，FasterRCNN中所构建的vgg16网络的fc6和fc7权重shape如下：

,
,

vgg16.ckpt的fc6，fc7权重shape如下:

'vgg_16/fc6/weights':[7,7,512,4096],
'vgg_16/fc7/weights':[1,1,4096,4096],

因此,有如下操作：

fc6_conv=tf.get_variable("fc6_conv",[7,7,512,4096],trainable=False)
fc7_conv=tf.get_variable("fc7_conv",[1,1,4096,4096],trainable=False)

restorer_fc=tf.train.Saver({"vgg_16/fc6/weights":fc6_conv,
"vgg_16/fc7/weights":fc7_conv,
})
restorer_fc.restore(sess,pretrained_model)
sess.run(tf.assign(self._variables_to_fix['my/vgg_16/fc6/weights:0'],tf.reshape(fc6_conv,self._variables_to_fix['my/vgg_16/fc6/weights:0'].get_shape())))
sess.run(tf.assign(self._variables_to_fix['my/vgg_16/fc7/weights:0'],tf.reshape(fc7_conv,self._variables_to_fix['my/vgg_16/fc7/weights:0'].get_shape())))

以上这篇tensorflow实现从.ckpt文件中读取任意变量就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持毛票票。

tensorflow实现从.ckpt文件中读取任意变量

热门推荐

随机推荐