*Last updated: 2023-03-16.*

# tf_quant_finance.math.optimizer.conjugate_gradient_minimize

View source Minimizes a differentiable function. ```python tf_quant_finance.math.optimizer.conjugate_gradient_minimize( value_and_gradients_function, initial_position, tolerance=1e-08, x_tolerance=0, f_relative_tolerance=0, max_iterations=50, parallel_iterations=1, stopping_condition=None, params=None, name=None ) ``` Implementation of algorithm described in [HZ2006]. Updated formula for next search direction were taken from [HZ2013]. Supports batches with 1-dimensional batch shape. #### References: [HZ2006] Hager, William W., and Hongchao Zhang. "Algorithm 851: CG_DESCENT, a conjugate gradient method with guaranteed descent." http://users.clas.ufl.edu/hager/papers/CG/cg_compare.pdf [HZ2013] W. W. Hager and H. Zhang (2013) The limited memory conjugate gradient method. https://pdfs.semanticscholar.org/8769/69f3911777e0ff0663f21b67dff30518726b.pdf ### Usage: The following example demonstrates this optimizer attempting to find the minimum for a simple two dimensional quadratic objective function. ```python minimum = np.array([1.0, 1.0]) # The center of the quadratic bowl. scales = np.array([2.0, 3.0]) # The scales along the two axes. # The objective function and the gradient. def quadratic(x): value = tf.reduce_sum(scales * (x - minimum) ** 2) return value, tf.gradients(value, x)[0] start = tf.constant([0.6, 0.8]) # Starting point for the search. optim_results = conjugate_gradient.minimize( quadratic, initial_position=start, tolerance=1e-8) with tf.Session() as session: results = session.run(optim_results) # Check that the search converged assert(results.converged) # Check that the argmin is close to the actual value. np.testing.assert_allclose(results.position, minimum) ``` #### Args: * `value_and_gradients_function`: A Python callable that accepts a point as a real `Tensor` and returns a tuple of `Tensor`s of real dtype containing the value of the function and its gradient at that point. The function to be minimized. The input should be of shape `[..., n]`, where `n` is the size of the domain of input points, and all others are batching dimensions. The first component of the return value should be a real `Tensor` of matching shape `[...]`. The second component (the gradient) should also be of shape `[..., n]` like the input value to the function. * `initial_position`: Real `Tensor` of shape `[..., n]`. The starting point, or points when using batching dimensions, of the search procedure. At these points the function value and the gradient norm should be finite. * `tolerance`: Scalar `Tensor` of real dtype. Specifies the gradient tolerance for the procedure. If the supremum norm of the gradient vector is below this number, the algorithm is stopped. * `x_tolerance`: Scalar `Tensor` of real dtype. If the absolute change in the position between one iteration and the next is smaller than this number, the algorithm is stopped. * `f_relative_tolerance`: Scalar `Tensor` of real dtype. If the relative change in the objective value between one iteration and the next is smaller than this value, the algorithm is stopped. * `max_iterations`: Scalar positive int32 `Tensor`. The maximum number of iterations. * `parallel_iterations`: Positive integer. The number of iterations allowed to run in parallel. * `stopping_condition`: (Optional) A Python function that takes as input two Boolean tensors of shape `[...]`, and returns a Boolean scalar tensor. The input tensors are `converged` and `failed`, indicating the current status of each respective batch member; the return value states whether the algorithm should stop. The default is tfp.optimizer.converged_all which only stops when all batch members have either converged or failed. An alternative is tfp.optimizer.converged_any which stops as soon as one batch member has converged, or when all have failed. * `params`: ConjugateGradientParams object with adjustable parameters of the algorithm. If not supplied, default parameters will be used. * `name`: (Optional) Python str. The name prefixed to the ops created by this function. If not supplied, the default name 'minimize' is used. #### Returns: * `optimizer_results`: A namedtuple containing the following items: converged: boolean tensor of shape `[...]` indicating for each batch member whether the minimum was found within tolerance. failed: boolean tensor of shape `[...]` indicating for each batch member whether a line search step failed to find a suitable step size satisfying Wolfe conditions. In the absence of any constraints on the number of objective evaluations permitted, this value will be the complement of `converged`. However, if there is a constraint and the search stopped due to available evaluations being exhausted, both `failed` and `converged` will be simultaneously False. num_objective_evaluations: The total number of objective evaluations performed. position: A tensor of shape `[..., n]` containing the last argument value found during the search from each starting point. If the search converged, then this value is the argmin of the objective function. objective_value: A tensor of shape `[...]` with the value of the objective function at the `position`. If the search converged, then this is the (local) minimum of the objective function. objective_gradient: A tensor of shape `[..., n]` containing the gradient of the objective function at the `position`. If the search converged the max-norm of this tensor should be below the tolerance.