minimize#

Purpose#

Minimizes a function using the L-BFGS-B algorithm for bound-constrained optimization.

Format#

out = minimize(&fct, x0)#

out = minimize(&fct, x0, ctl)

out = minimize(&fct, x0, ...)

out = minimize(&fct, x0, ..., ctl)

Parameters:

&fct (function pointer) – pointer to a procedure that computes the objective function to be minimized. The procedure receives the parameter vector x as its first argument, plus any additional data arguments passed to minimize().
x0 (vector) – Kx1 vector, starting values for the parameters.
.. (any) – Optional extra arguments. These arguments are passed untouched to the user-provided objective function.

ctl (struct) –

Optional input. Instance of a minimizeControl structure. Normally an instance is initialized by calling minimizeControlCreate() and members of this instance can be set to other values by the user. For an instance named ctl, the members are:

ctl.m	scalar, number of L-BFGS corrections to store. Default = 10.
ctl.maxIters	scalar, maximum number of iterations. Default = 1000.
ctl.factr	scalar, function convergence tolerance factor. Convergence occurs when `\|f_k - f_{k+1}\| < factr * machine_epsilon`. Use 1e12 for low accuracy, 1e7 for moderate accuracy (default), 1e1 for high accuracy.
ctl.pgtol	scalar, projected gradient tolerance. Default = 1e-5.
ctl.bounds	Kx2 matrix or 1x2 vector specifying parameter bounds. Column 1 contains lower bounds, column 2 contains upper bounds. If 1x2, applies to all parameters. Default = `{ -1e300 1e300 }` (effectively unbounded).
ctl.printSummary	scalar, print final summary. 0 = no (default), 1 = yes.
ctl.printEvery	scalar, print progress every N iterations. 0 = never (default), N = every N iterations.

Returns:

out (struct) –

an instance of a minimizeOut structure. For an instance named out, the members are:

out.x	Kx1 vector, solution values.
out.fval	scalar, objective function value at solution.
out.gradient	Kx1 vector, gradient at solution.
out.retcode	scalar, return code: 0: Converged successfully. 1: Maximum iterations exceeded. 2: Abnormal termination (search direction too small). 3: Error in problem setup or evaluation.
out.iterations	scalar, number of iterations used.
out.fnEvals	scalar, number of function evaluations.
out.retmsg	string, message describing convergence status.

Examples#

Example 1: Basic unconstrained minimization#

// Rosenbrock function
proc (1) = rosenbrock(x);
    retp((1 - x[1])^2 + 100*(x[2] - x[1]^2)^2);
endp;

// Starting point
x0 = { -1, 1 };

// Minimize
struct minimizeOut out;
out = minimize(&rosenbrock, x0);

print "Solution:  " out.x';
print "Objective: " out.fval;
print "Iterations:" out.iterations;

Solution:         1.0000000       0.99999999
Objective:    2.5073756e-17
Iterations:       37.000000

The optimizer finds the known minimum at (1, 1) with an objective value near zero in 37 iterations.

Example 2: With data arguments#

rndseed 42;

// OLS objective function
proc (1) = ols_objective(beta, Y, X);
    local resid;
    resid = Y - X * beta;
    retp(resid'resid);
endp;

// Generate sample data
X = ones(100, 1) ~ rndn(100, 2);
beta_true = { 1, 2, -1 };
Y = X * beta_true + 0.5*rndn(100, 1);

// Starting values
x0 = zeros(3, 1);

// Minimize - pass Y and X as data arguments
struct minimizeOut out;
out = minimize(&ols_objective, x0, Y, X);

print "True coefficients:      " beta_true';
print "Estimated coefficients: " out.x';

True coefficients:             1.0000000        2.0000000       -1.0000000
Estimated coefficients:       0.94044862        2.0263110       -1.0269490

Data arguments (Y and X) are passed directly through to the objective function without modification.

Example 3: Bound-constrained optimization#

proc (1) = myfunc(x);
    retp(sumc(x.^2));
endp;

x0 = { 5, 5, 5 };

// Set bounds: all parameters in [0, 10]
struct minimizeControl ctl;
ctl = minimizeControlCreate();
ctl.bounds = { 0 10 };

struct minimizeOut out;
out = minimize(&myfunc, x0, ctl);

print "Solution: " out.x';

Solution:        0.0000000        0.0000000        0.0000000

When ctl.bounds is a 1x2 vector, the same bounds apply to all parameters.

Example 4: Variable-specific bounds#

proc (1) = myfunc(x);
    retp((x[1] - 2)^2 + (x[2] - 3)^2);
endp;

x0 = { 0, 0 };

// x[1] in [0, inf), x[2] in [0, 2]
struct minimizeControl ctl;
ctl = minimizeControlCreate();
ctl.bounds = { 0 1e300,    // x[1] >= 0
               0     2 };  // x[2] in [0, 2]

struct minimizeOut out;
out = minimize(&myfunc, x0, ctl);

print "Solution: " out.x';

Solution:        2.0000000        2.0000000

The unconstrained minimum is at (2, 3), but the bound x[2] <= 2 forces the solution to (2, 2). When ctl.bounds is Kx2, each row specifies bounds for the corresponding parameter.

Remarks#

minimize() uses the L-BFGS-B algorithm, a limited-memory quasi-Newton method that is the gold standard for smooth bound-constrained optimization. It is particularly suitable for:

Smooth differentiable objective functions
Large-scale problems (hundreds to thousands of variables)
Simple bound constraints (each parameter has a lower and/or upper bound)

L-BFGS-B approximates the Hessian using a limited number of past gradients (controlled by ctl.m), making it memory-efficient for large problems.

For problems with more complex constraints (linear equality/inequality, nonlinear constraints), use sqpSolveMT() instead.

Gradient computation:

minimize() automatically computes gradients numerically using finite differences. Future versions may support user-provided analytical gradients.

Convergence criteria:

The algorithm terminates when either:

The function value change is small: |f_k - f_{k+1}| < factr * eps
The projected gradient is small: max|g_i| < pgtol
Maximum iterations is reached

Starting point:

If the starting point x0 violates any bounds, it is automatically projected into the feasible region before optimization begins.