actorcritic.nn¶
Contains utilities that concern TensorFlow and neural networks.
Functions
conv2d(input, params, stride, padding) |
Creates a 2D convolutional layer with bias (without activation). |
conv2d_params(num_input_channels, …) |
Creates weights and bias variables for a 2D convolutional layer. |
flatten(input) |
Flattens inputs but keeps the batch size. |
fully_connected(input, params) |
Creates a fully connected layer with bias (without activation). |
fully_connected_params(input_size, …) |
Creates weights and bias variables for a fully connected layer. |
linear_decay(start_value, end_value, step, …) |
Applies linear decay from start_value to end_value. |
Classes
ClipGlobalNormOptimizer(optimizer, clip_norm) |
A tf.train.Optimizer that wraps around another optimizer and minimizes the loss by clipping gradients using the global norm (tf.clip_by_global_norm()). |
-
class
actorcritic.nn.ClipGlobalNormOptimizer(optimizer, clip_norm, name=None)[source]¶ Bases:
tensorflow.python.training.optimizer.OptimizerA
tf.train.Optimizerthat wraps around another optimizer and minimizes the loss by clipping gradients using the global norm (tf.clip_by_global_norm()).See also
-
apply_gradients(grads_and_vars, global_step=None, name=None)[source]¶ Apply gradients to variables.
This is the second part of minimize(). It returns an Operation that applies gradients.
Parameters: - grads_and_vars – List of (gradient, variable) pairs as returned by compute_gradients().
- global_step – Optional Variable to increment by one after the variables have been updated.
- name – Optional name for the returned operation. Default to the name passed to the Optimizer constructor.
Returns: An Operation that applies the specified gradients. If global_step was not None, that operation also increments global_step.
Raises: TypeError– If grads_and_vars is malformed.ValueError– If none of the variables have gradients.
-
-
actorcritic.nn.conv2d(input, params, stride, padding)[source]¶ Creates a 2D convolutional layer with bias (without activation).
Parameters: - input (
tf.Tensor) – The input values. - params (
tupleof (tf.Variable,tf.Variable)) – A tuple of (weights, bias). Probably obtained byconv2d_params(). - stride (
int) – The stride of the convolution. - padding (
string) – The padding of the convolution. One of ‘VALID’, ‘SAME’.
Returns: tf.Tensor– The output values.- input (
-
actorcritic.nn.conv2d_params(num_input_channels, num_filters, filter_extent, dtype, weights_initializer, bias_initializer)[source]¶ Creates weights and bias variables for a 2D convolutional layer. These can be used in
conv2d().Parameters: - num_input_channels (
int) – The size of the input layer. - num_filters (
int) – The output size. Number of filters to apply. - filter_extent (
int) – The spatial extent of the filters. Determines the size of the weights. - dtype (
tf.DType) – The data type of the variables. - weights_initializer (
tf.keras.initializers.Initializer) – An initializer for the weights. - bias_initializer (
tf.keras.initializers.Initializer) – An initializer for the bias.
Returns: tupleof (tf.Variable,tf.Variable) – A tuple of (weights, bias).- num_input_channels (
-
actorcritic.nn.flatten(input)[source]¶ Flattens inputs but keeps the batch size.
Parameters: input ( tf.Tensor) – Input values of shape [batch_size, d_1, …, d_n].Returns: tf.Tensor– Flattened input values of shape [batch_size, d1 * … * d_n].
-
actorcritic.nn.fully_connected(input, params)[source]¶ Creates a fully connected layer with bias (without activation).
Parameters: - input (
tf.Tensor) – The input values. - params (
tupleof (tf.Variable,tf.Variable)) – A tuple of (weights, bias). Probably obtained byfully_connected_params().
Returns: tf.Tensor– The output values.- input (
-
actorcritic.nn.fully_connected_params(input_size, output_size, dtype, weights_initializer, bias_initializer)[source]¶ Creates weights and bias variables for a fully connected layer. These can be used in
fully_connected().Parameters: - input_size (
int) – The size of the input layer. - output_size (
int) – The output size. Number of units. - dtype (
tf.DType) – The data type of the variables. - weights_initializer (
tf.keras.initializers.Initializer) – An initializer for the weights. - bias_initializer (
tf.keras.initializers.Initializer) – An initializer for the bias.
Returns: tupleof (tf.Variable,tf.Variable) – A tuple of (weights, bias).- input_size (
-
actorcritic.nn.linear_decay(start_value, end_value, step, total_steps, name=None)[source]¶ Applies linear decay from start_value to end_value. The value at a specific step is computed as:
value = (start_value - end_value) * (1 - step / total_steps) + end_value
Parameters: Returns: tf.Tensor– The linear decayed value.