Arg max

From HandWiki
Short description: Inputs at which function values are highest
As an example, both unnormalised and normalised sinc functions above have argmax of {0} because both attain their global maximum value of 1 at x = 0.

The unnormalised sinc function (red) has arg min of {−4.49, 4.49}, approximately, because it has 2 global minimum values of approximately −0.217 at x = ±4.49. However, the normalised sinc function (blue) has arg min of {−1.43, 1.43}, approximately, because their global minima occur at x = ±1.43, even though the minimum value is the same.[1]

In mathematics, the arguments of the maxima (abbreviated arg max or argmax) and arguments of the minima (abbreviated arg min or argmin) are the input points at which a function output value is maximized and minimized, respectively.[note 1] While the arguments are defined over the domain of a function, the output is part of its codomain.

Definition

Given an arbitrary set X, a totally ordered set Y, and a function, f:XY, the argmax over some subset S of X is defined by

argmaxSf:=argmaxxSf(x):={xS:f(s)f(x) for all sS}.

If S=X or S is clear from the context, then S is often left out, as in argmaxxf(x):={x:f(s)f(x) for all sX}. In other words, argmax is the set of points x for which f(x) attains the function's largest value (if it exists). Argmax may be the empty set, a singleton, or contain multiple elements.

In the fields of convex analysis and variational analysis, a slightly different definition is used in the special case where Y=[,]={±} are the extended real numbers.[2] In this case, if f is identically equal to on S then argmaxSf:= (that is, argmaxS:=) and otherwise argmaxSf is defined as above, where in this case argmaxSf can also be written as:

argmaxSf:={xS:f(x)=supSf}

where it is emphasized that this equality involving supSf holds only when f is not identically on S.[2]

Arg min

The notion of argmin (or argmin), which stands for argument of the minimum, is defined analogously. For instance,

argminxSf(x):={xS:f(s)f(x) for all sS}

are points x for which f(x) attains its smallest value. It is the complementary operator of argmax.

In the special case where Y=[,]={±} are the extended real numbers, if f is identically equal to on S then argminSf:= (that is, argminS:=) and otherwise argminSf is defined as above and moreover, in this case (of f not identically equal to ) it also satisfies:

argminSf:={xS:f(x)=infSf}.[2]

Examples and properties

For example, if f(x) is 1|x|, then f attains its maximum value of 1 only at the point x=0. Thus

argmaxx(1|x|)={0}.

The argmax operator is different from the max operator. The max operator, when given the same function, returns the maximum value of the function instead of the point or points that cause that function to reach that value; in other words

maxxf(x) is the element in {f(x):f(s)f(x) for all sS}.

Like argmax, max may be the empty set (in which case the maximum is undefined) or a singleton, but unlike argmax, max may not contain multiple elements:[note 2] for example, if f(x) is 4x2x4, then argmaxx(4x2x4)={2,2}, but maxx(4x2x4)={4} because the function attains the same value at every element of argmax.

Equivalently, if M is the maximum of f, then the argmax is the level set of the maximum:

argmaxxf(x)={x:f(x)=M}=:f1(M).

We can rearrange to give the simple identity[note 3]

f(argmaxxf(x))=maxxf(x).

If the maximum is reached at a single point then this point is often referred to as the argmax, and argmax is considered a point, not a set of points. So, for example,

argmaxx(x(10x))=5

(rather than the singleton set {5}), since the maximum value of x(10x) is 25, which occurs for x=5.[note 4] However, in case the maximum is reached at many points, argmax needs to be considered a set of points.

For example

argmaxx[0,4π]cos(x)={0,2π,4π}

because the maximum value of cosx is 1, which occurs on this interval for x=0,2π or 4π. On the whole real line

argmaxxcos(x)={2kπ:k}, so an infinite set.

Functions need not in general attain a maximum value, and hence the argmax is sometimes the empty set; for example, argmaxxx3=, since x3 is unbounded on the real line. As another example, argmaxxarctan(x)=, although arctan is bounded by ±π/2. However, by the extreme value theorem, a continuous real-valued function on a closed interval has a maximum, and thus a nonempty argmax.

See also

Notes

  1. For clarity, we refer to the input (x) as points and the output (y) as values; compare critical point and critical value.
  2. Due to the anti-symmetry of , a function can have at most one maximal value.
  3. This is an identity between sets, more particularly, between subsets of Y.
  4. Note that x(10x)=25(x5)225 with equality if and only if x5=0.

References

  1. "The Unnormalized Sinc Function ", University of Sydney
  2. 2.0 2.1 2.2 Rockafellar & Wets 2009, pp. 1-37.