Linearization of Basic Non-linear Operators

Given a non-linear operator $f$ and a region $R$ , we want to find linear constraints to approximate the non-linear operator, which is in the following form:

$l^{T} x + lb \leq f (x) \leq u^{T} x + ub for any x \in R$

Specifically, given a gbound region $l_x \le x \le u_x, l_y \le y \le u_y, l_d \le y - 𝛽 x \le u_d$, we want to find linear constraints to approximate the non-linear operator $f (x)$ , $f (y)$ and $f (x) - α f (y)$ for any $x, y$ and some $α$ in the gbound region.

Linearization Objective

Different strategies can be used to find the linear constraints. Here we consider the following two objectives:

Minimum $L_{1}$ norm of the bounds: $\int_{R} (u - l)^{T} x + (ub - lb) d x$ .
Minimum $L_{\infty}$ norm of the bounds: $x \in R max (u - l)^{T} x + (ub - lb)$ .

For me, the minimum $L_{1}$ bounds makes more sense because it minimizes the total area between the upper and lower bounds. In the overall algorithm, the bound will be used to restrict different directions, e.g., we may want the minimum and maximum values of $a f (x) + b x$ in bound propagation. The minimum $L_{1}$ bounds have the ability to guarantee the average case.

At the same time, the minimum $L_{1}$ bounds are easier to compute. To do this, we need the convex (concave) envelope of the non-linear operator, which is the tightest convex (concave) function that upper (lower) bounds the non-linear operator, defined as follows:

$f^{\cup} = sup {g ∣ g is convex and g (x) \leq f (x) for any x \in R} f^{\cap} = in f {g ∣ g is concave and g (x) \geq f (x) for any x \in R}$

Then we have the formulas for the minimum $L_{1}$ bounds:

$l = \nabla f^{\cup} (x_{C}) lb = f^{\cup} (x_{C}) - \nabla f^{\cup} (x_{C})^{T} x_{C} u = \nabla f^{\cap} (x_{C}) ub = f^{\cap} (x_{C}) - \nabla f^{\cap} (x_{C})^{T} x_{C}$

where $x_{C}$ is the center of $R$ .

Therefore, in the following sections, we will use the minimum $L_{1}$ bounds as the linearization objective, and we will compute the convex (concave) envelope of the non-linear operator to obtain the linear constraints.

See DeepPoly for more details about the minimum $L_{1}$ bounds.

Linearization Process

Before we go into the details of the linearization of specific non-linear operators, we first need to normalize the gbound to ensure that all bounds are tight. For example, for the gbound of $y$ , we can compute the minimum and maximum values of $y$ in the gbound region, which are $\min\left(u_y, 𝛽 u_x + u_d\right)$ and $\max\left(l_x, 𝛽 l_x + l_d\right)$ , respectively. Then we can update the gbound of $y$ correspondingly.

Even after this, gbound area is still too complex for fast computation, thus we will try to further relax the gbound area to a parallelogram area. Obviously, there are 3 ways to relax the gbound area to a parallelogram area, which are 1) $l_{x} \leq x \leq u_{x}, l_{y} \leq y \leq u_{y}$ , 2) $l_x \le x \le u_x, l_d \le y - 𝛽 x \le u_d$ and 3) $l_y \le y \le u_y, l_d \le y - 𝛽 x \le u_d$.

We choose the parrallelogram area based on these:

For $f (x)$ and $f (y)$ , we will choose the parallelogram area of $l_{x} \leq x \leq u_{x}, l_{y} \leq y \leq u_{y}$ .
For the convex envelope of $f (y) - α f (x)$ , we choose area 2) $l_x \le x \le u_x, l_d \le y - 𝛽 x \le u_d$.
For the concave envelope of $f (y) - α f (x)$ , we choose area 3) $l_y \le y \le u_y, l_d \le y - 𝛽 x \le u_d$.

We do this for simplicity and fast computation, since $α, β > 0$ , case 2 and case 3 are symmetric for computation. It simplifies our algorithm.

Selecting $α$

The selection of $𝛼$ is important for the tightness of the bounds. Normally, we will select $𝛼$ based on the range of $f$ on gbound. We will discuss the selection of 𝛼 for each of the operators.

HexagonDiff Wiki

Linearization of Basic Non-linear Operators

Linearization Objective

Linearization Process

Selecting α

Selecting $α$