Tensorflow nmt的超参数

2024-02-09 05:08

文章标签 参数 tensorflow nmt

本文主要是介绍Tensorflow nmt的超参数，希望对大家解决编程问题提供一定的参考价值，需要的开发者们随着小编来一起学习吧！

Tensorflow nmt的超参数　　

超参数一般用来定义我们的神经网络的关键参数．　　

在tensorflow/nmt这个demo中，我们的超参数在 nmt.nmt 模块中配置．这也导致了nmt.py这个文件的代码行数比较多，我们完全可以把参数的配置放到单独的一个文件中去．nmt.py 这个文件也是整个项目的入口文件．如果你想了解这个demo的整体结构，请查看我的另一篇博客tensorflow/nmt的整体结构,　这就不展开了．　

下面我会列出nmt模型定义的超参数，并且追条解释，希望能加深你对这些参数的理解．　　

本demo的超参数使用的是argparse模块进行配置的，如果你喜欢，也可以使用tensorflow中的 tf.app.flags.DEFINE_xxx() 函数来配置，后者是前者的简单封装．　　

超参数列表　　

首先用表格的形式列出所有的超参数，对他们的解释放在下一小节．　　

超参数(hparams)	类型(type)	默认值(default)	简介(help)
`--num_units`	`int`	32	network size
`--num_layers`	`int`	2	network depth
`--num_encoder_layers`	`int`	`None`	encoder depth, equal to `num_layers` if `None`
`--num_decoder_layers`	`iny`	`None`	decoder depth, equal to `num_layers` if `None`
`--encoder_type`	`str`	`uni`	one of `uni`, `bi`, `gnmt`
`--residual`	`bool`	`False`	whether to add residual connections
`--time_major`	`bool`	`True`	whether to add time-major mode for dynamic RNN
`--num_embeddings_partitions`	`int`	0	number of partitions for embedding vars
`--attention`	`str`	`""`	one of `""`, `luong`, `scaled_luong`, `bahdanau`, `normed_bahdanau`
`--attention_architecture`	`str`	standard	one of `standard`, `gnmt`, `gnmt_v2`
`--output_attention`	`bool`	`True`	only used in standard attention_architecture
`--pass_hidden_state`	`bool`	`True`	whether to pass enco