#jsDisabledContent { display:none; } My Account |  Register |  Help

# Indicator function

Article Id: WHEBN0000240790
Reproduction Date:

 Title: Indicator function Author: World Heritage Encyclopedia Language: English Subject: Collection: Publisher: World Heritage Encyclopedia Publication Date:

### Indicator function

The graph of the indicator function of a two-dimensional subset of a square.

In mathematics, an indicator function or a characteristic function is a function defined on a set X that indicates membership of an element in a subset A of X, having the value 1 for all elements of A and the value 0 for all elements of X not in A. It is usually denoted by a bold or blackboard bold 1 symbol with a subscript describing the event of inclusion.

## Contents

• Definition 1
• Remark on notation and terminology 2
• Basic properties 3
• Mean, variance and covariance 4
• Characteristic function in recursion theory, Gödel's and Kleene's representing function 5
• Characteristic function in fuzzy set theory 6
• Derivatives of the indicator function 7
• Notes 9
• References 10

## Definition

The indicator function of a subset A of a set X is a function

\mathbf{1}_A \colon X \to \{ 0,1 \} \,

defined as

\mathbf{1}_A(x) := \begin{cases} 1 &\text{if } x \in A, \\ 0 &\text{if } x \notin A. \end{cases}

The Iverson bracket allows the equivalent notation, [x\in A], to be used instead of \mathbf{1}_A(x).

The function \mathbf{1}_A is sometimes denoted I_A, \chi_A or even just A. (The Greek letter \chi appears because it is the initial letter of the Greek word characteristic.)

## Remark on notation and terminology

A related concept in statistics is that of a dummy variable. (This must not be confused with "dummy variables" as that term is usually used in mathematics, also called a bound variable.)

The term "characteristic function" has an unrelated meaning in probability theory. For this reason, probabilists use the term indicator function for the function defined here almost exclusively, while mathematicians in other fields are more likely to use the term characteristic function to describe the function that indicates membership in a set.

## Basic properties

The indicator or characteristic function of a subset A of some set X, maps elements of X to the range {0,1}.

This mapping is surjective only when A is a non-empty proper subset of X. If AX, then 1A = 1. By a similar argument, if A ≡ Ø then 1A = 0.

In the following, the dot represents multiplication, 1·1 = 1, 1·0 = 0 etc. "+" and "−" represent addition and subtraction. "\cap " and "\cup " is intersection and union, respectively.

If A and B are two subsets of X, then

\mathbf{1}_{A\cap B} = \min\{\mathbf{1}_A,\mathbf{1}_B\} = \mathbf{1}_A \cdot\mathbf{1}_B,
\mathbf{1}_{A\cup B} = \max\ (-1)^{|F|} \mathbf{1}_{\bigcap_F A_k} = \sum_{\emptyset \neq F \subseteq \{1, 2, \dotsc, n\}} (-1)^{|F|+1} \mathbf{1}_{\bigcap_F A_k}

where |F| is the cardinality of F. This is one form of the principle of inclusion-exclusion.

As suggested by the previous example, the indicator function is a useful notational device in combinatorics. The notation is used in other places as well, for instance in probability theory: if X is a probability space with probability measure \mathbb{P} and A is a measurable set, then \mathbf{1}_A becomes a random variable whose expected value is equal to the probability of A:

\operatorname{E}(\mathbf{1}_A)= \int_{X} \mathbf{1}_A(x)\,d\mathbb{P} = \int_{A} d\mathbb{P} = \operatorname{P}(A).

This identity is used in a simple proof of Markov's inequality.

In many cases, such as order theory, the inverse of the indicator function may be defined. This is commonly called the generalized Möbius function, as a generalization of the inverse of the indicator function in elementary number theory, the Möbius function. (See paragraph below about the use of the inverse in classical recursion theory.)

## Mean, variance and covariance

Given a probability space \textstyle (\Omega, \mathcal F, \mathbb P) with A \in \mathcal F, the indicator random variable \mathbf{1}_A \colon \Omega \rightarrow \Bbb{R} is defined by \mathbf{1}_A (\omega) = 1 if \omega \in A, otherwise \mathbf{1}_A (\omega) = 0.

Mean
\operatorname{E}(\mathbf{1}_A (\omega)) = \operatorname{P}(A)
Variance
\operatorname{Var}(\mathbf{1}_A (\omega)) = \operatorname{P}(A)(1 - \operatorname{P}(A))
Covariance
\operatorname{Cov}(\mathbf{1}_A (\omega), \mathbf{1}_B (\omega)) = \operatorname{P}(A \cap B) - \operatorname{P}(A)\operatorname{P}(B)

## Characteristic function in recursion theory, Gödel's and Kleene's representing function

Kurt Gödel described the representing function in his 1934 paper "On Undecidable Propositions of Formal Mathematical Systems". (The paper appears on pp. 41–74 in Martin Davis ed. The Undecidable):

"There shall correspond to each class or relation R a representing function φ(x1, . . ., xn) = 0 if R(x1, . . ., xn) and φ(x1, . . ., xn) = 1 if ~R(x1, . . ., xn)." (p. 42; the "~" indicates logical inversion i.e. "NOT")

Stephen Kleene (1952) (p. 227) offers up the same definition in the context of the primitive recursive functions as a function φ of a predicate P takes on values 0 if the predicate is true and 1 if the predicate is false.

For example, because the product of characteristic functions φ12* . . . *φn = 0 whenever any one of the functions equals 0, it plays the role of logical OR: IF φ1 = 0 OR φ2 = 0 OR . . . OR φn = 0 THEN their product is 0. What appears to the modern reader as the representing function's logical inversion, i.e. the representing function is 0 when the function R is "true" or satisfied", plays a useful role in Kleene's definition of the logical functions OR, AND, and IMPLY (p. 228), the bounded- (p. 228) and unbounded- (p. 279ff) mu operators (Kleene (1952)) and the CASE function (p. 229).

## Characteristic function in fuzzy set theory

In classical mathematics, characteristic functions of sets only take values 1 (members) or 0 (non-members). In fuzzy set theory, characteristic functions are generalized to take value in the real unit interval [0, 1], or more generally, in some algebra or structure (usually required to be at least a poset or lattice). Such generalized characteristic functions are more usually called membership functions, and the corresponding "sets" are called fuzzy sets. Fuzzy sets model the gradual change in the membership degree seen in many real-world predicates like "tall", "warm", etc.

## Derivatives of the indicator function

A particular indicator function, which is very well known, is the Heaviside step function. The Heaviside step function is the indicator function of the one-dimensional positive half-line, i.e. the domain [0, ∞). It is well known that the distributional derivative of the Heaviside step function, indicated by H(x), is equal to the Dirac delta function, i.e.

\delta(x)=\tfrac{d H(x)}{dx},

with the following property:

\int_{-\infty}^\infty f(x) \, \delta(x) dx = f(0).

The derivative of the Heaviside step function can be seen as the 'inward normal derivative' at the 'boundary' of the domain given by the positive half-line. In higher dimensions, the derivative naturally generalises to the inward normal derivative, while the Heaviside step function naturally generalises to the indicator function of some domain D. The surface of D will be denoted by S. Proceeding, it can be derived that the inward normal derivative of the indicator gives rise to a 'surface delta function', which can be indicated by δS(x):

\delta_S(\mathbf{x})=-\mathbf{n}_x\cdot\nabla_x\mathbf{1}_{\mathbf{x}\in D}

where n is the outward normal of the surface S. This 'surface delta function' has the following property:[1]

-\int_{\mathbf{R}^n}f(\mathbf{x})\,\mathbf{n}_x\cdot\nabla_x\mathbf{1}_{\mathbf{x}\in D}\;d^{n}\mathbf{x}=\oint_{S}\,f(\mathbf{\beta})\;d^{n-1}\mathbf{\beta}.

By setting the function f equal to one, it follows that the inward normal derivative of the indicator integrates to the numerical value of the surface area S.

## Notes

1. ^ Lange, Rutger-Jan (2012), "Potential theory, path integrals and the Laplacian of the indicator", Journal of High Energy Physics (Springer) 2012 (11): 29–30,

## References

• Folland, G.B. (1999). Real Analysis: Modern Techniques and Their Applications (Second ed.). John Wiley & Sons, Inc.
This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and USA.gov, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for USA.gov and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.

Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.