|
Differential entropy (also referred to as continuous entropy) is a concept in information theory which tries to extend the idea of (Shannon) entropy, a measure of average surprisal of a random variable, to continuous probability distributions. A bundle of optical fiber. ...
Entropy of a Bernoulli trial as a function of success probability, often called the binary entropy function Entropy is a concept in thermodynamics (see thermodynamic entropy), statistical mechanics and information theory. ...
Within the context of information theory, self-information is defined as the amount of information that knowledge about (the outcome of) a certain event, adds to someones overall knowledge. ...
A random variable is a mathematical function that maps outcomes of random experiments to numbers. ...
In mathematics and statistics, a probability distribution, more properly called a probability density, assigns to every interval of the real numbers a probability, so that the probability axioms are satisfied. ...
Definition
Let X be a random variable with a probability density function f whose support is a set . The differential entropy h(X) or h(f) is defined as In mathematics, a probability density function (pdf) serves to represent a probability distribution in terms of integrals. ...
In mathematics, the support of a real-valued function f on a set X is sometimes defined as the subset of X on which f is nonzero. ...
 As with its discrete analog, the units of differential entropy depend on the base of the logarithm, which is usually 2 (i.e., the units are bits). See logarithmic units for logarithms taken in different bases. Related concepts such as joint, conditional differential entropy, and relative entropy are defined in a similar fashion. One must take care in trying to apply properties of discrete entropy to differential entropy, since probability density functions can be greater than 1. For example, Uniform(0,1/2) has differential entropy . Logarithms to various bases: is to base e, is to base 10, and is to base 1. ...
A bit (binary digit) refers to a digit in the binary numeral system, which consists of base 2 digits (ie. ...
Logarithmic units are generic mathematical units in which we can express any quantities (physical or mathematical) that are defined as being proportional to values of a logarithm function. ...
The joint entropy is an entropy measure used in information theory. ...
The conditional entropy is an entropy measure used in information theory. ...
In probability theory and information theory, the Kullback-Leibler divergence (or information divergence, or information gain, or relative entropy) is a natural distance measure from a true probability distribution P to an arbitrary probability distribution Q. Typically P represents data, observations, or a precise calculated probability distribution. ...
In mathematics, the continuous uniform distributions are probability distributions such that all intervals of the same length are equally probable. ...
The definition of differential entropy above can be obtained by partitioning the range of X into bins of length Δ with associated sample points iΔ within the bins, for X Riemann integrable. This gives a quantized version of X, defined by XΔ = iΔ if . Then the entropy of XΔ is Quantized signal Digital signal In digital signal processing, quantization is the process of approximating a continuous range of values (or a very large set of possible discrete values) by a relatively-small set of discrete symbols or integer values. ...
. The first term approximates the differential entropy, while the second term is approximately − log(Δ). Note that this procedure suggests that the differential entropy of a discrete random variable should be . Note that the continuous mutual information I(X;Y) has the distinction of retaining its fundamental significance as a measure of discrete information since it is actually the limit of the discrete mutual information of partitions of X and Y as these partitions become finer and finer. Thus it is invariant under quite general transformations of X and Y, and still represents the amount of discrete information that can be transmitted over a channel that admits a continuous space of values. In probability theory and, in particular, information theory, the mutual information, or transinformation, of two random variables is a quantity that measures the mutual dependence of the two variables. ...
Properties of differential entropy - For two densities f and g,
with equality if f = g almost everywhere. Similarly, for two random variables X and Y, and with equality if and only if X and Y are independent. - The chain rule for differential entropy holds as in the discrete case
. - Differential entropy is translation invariant, ie, h(X + c) = h(X) for a constant c.
- Differential entropy is in general not invariant under arbitrary invertible maps. In particular, for a constant a,
. For a vector valued random variable X and a matrix A, . - If a random vector
has mean zero and covariance matrix K, with equality if and only if X is jointly gaussian. In measure theory (a branch of mathematical analysis), one says that a property holds almost everywhere if the set of elements for which the property does not hold is a null set, i. ...
In probability theory and statistics, the covariance between two real-valued random variables X and Y, with expected values and is defined as: where E is the expected value. ...
In probability theory and statistics, a multivariate normal distribution, also sometimes called a multivariate Gaussian distribution, is a specific probability distribution, which can be thought of as a generalization to higher dimensions of the one-dimensional normal distribution (also called a Gaussian distribution). ...
Example: Exponential distribution Let X be an exponentially distributed random variable with parameter λ, that is, with probability density function In probability theory and statistics, the exponential distributions are a class of continuous probability distribution. ...
 Its differential entropy is then Here, he(X) was used rather than h(X) to make it explicit that the logarithm was taken to base e, to simplify the calculation.
Differential entropies for various distributions In the table below, (the gamma function), , B(p,q) = Γ(p)Γ(q), and γ is Euler's constant. The Gamma function along part of the real axis In mathematics, the Gamma function extends the factorial function to complex and non integer numbers (it is already defined on the naturals, and has simple poles at the negative integers). ...
The Euler-Mascheroni constant is a mathematical constant, used mainly in number theory, and is defined as the limiting difference between the harmonic series and the natural logarithm: Its approximate value is γ â 0. ...
Table of differential entropies. | Distribution Name | Probability density function (pdf) | Entropy in nats | | Uniform | for  |  | | Normal |  |  | | Exponential |  |  | | Rayleigh |  |  | | Beta | for  | ![ln B(p,q) - (p-1)[psi(p) - psi(p + q)] - (q-1)[psi(q) - psi(p + q)] ,](http://upload.wikimedia.org/math/a/b/5/ab553119cbe12c724fa1955f79fbbc44.png) | | Cauchy |  |  | | Chi |  |  | | Chi-squared |  | In mathematics, the continuous uniform distributions are probability distributions such that all intervals of the same length are equally probable. ...
The normal distribution, also called Gaussian distribution (named after Carl Friedrich Gauss, a German mathematician, although Gauss was not the first to work with it), is a probability distribution of great importance in many fields. ...
In probability theory and statistics, the exponential distributions are a class of continuous probability distribution. ...
In probability theory and statistics, the Rayleigh distribution is a continuous probability distribution. ...
In probability theory and statistics, the beta distribution is a continuous probability distribution with the probability density function (pdf) defined on the interval [0, 1]: where α and β are parameters that must be greater than zero and B is the beta function. ...
The Cauchy-Lorentz distribution, named after Augustin Cauchy, is a continuous probability distribution with probability density function where x0 is the location parameter, specifying the location of the peak of the distribution, and γ is the scale parameter which specifies the half-width at half-maximum (HWHM). ...
In probability theory and statistics, the chi distribution is a continuous probability distribution. ...
In probability theory and statistics, the chi-square distribution (also chi-squared or Ï2 distribution) is one of the theoretical probability distributions most widely used in inferential statistics, i. ...
| | Erlang |  |  | | F |  | The Erlang distribution is a continuous probability distribution with wide applicability primarily due to its relation to the exponential and Gamma distributions. ...
In probability theory and statistics, the F-distribution is a continuous probability distribution. ...
| | Gamma |  |  | | Laplace |  |  | | Logistic |  |  | | Lognormal |  |  | | Maxwell-Boltzmann |  |  | | Generalized normal |  |  | | Pareto |  |  | | Student's t |  |  | | Triangular |  |  | | Weibull |  |  | | Multivariate normal |  |  | In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions that represents the sum of exponentially distributed random variables. ...
In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. ...
In probability theory and statistics, the logistic distribution is a continuous probability distribution. ...
In probability and statistics, the log-normal distribution is the probability distribution of any random variable whose logarithm is normally distributed (the base of the logarithmic function is immaterial in that loga X is normally distributed if and only if logb X is normally distributed). ...
The introduction to this article provides insufficient context for those unfamiliar with the subject matter. ...
The Pareto distribution, named after the Italian economist Vilfredo Pareto, is a power law probability distribution found in a large number of real-world situations. ...
In probability and statistics, the t-distribution or Students t-distribution is a probability distribution that arises in the problem of estimating the mean of a normally distributed population when the sample size is small. ...
In probability theory and statistics, the triangular distribution is a continuous probability distribution with lower limit a, mode c and upper limit b. ...
In probability theory and statistics, the Weibull distribution (named after Waloddi Weibull) is a continuous probability distribution with the probability density function where and is the shape parameter and is the scale parameter of the distribution. ...
In probability theory and statistics, a multivariate normal distribution, also sometimes called a multivariate Gaussian distribution, is a specific probability distribution, which can be thought of as a generalization to higher dimensions of the one-dimensional normal distribution (also called a Gaussian distribution). ...
See also Entropy of a Bernoulli trial as a function of success probability, often called the binary entropy function Entropy is a concept in thermodynamics (see thermodynamic entropy), statistical mechanics and information theory. ...
A bundle of optical fiber. ...
Within the context of information theory, self-information is defined as the amount of information that knowledge about (the outcome of) a certain event, adds to someones overall knowledge. ...
In probability theory and information theory, the Kullback-Leibler divergence (or information divergence, or information gain, or relative entropy) is a natural distance measure from a true probability distribution P to an arbitrary probability distribution Q. Typically P represents data, observations, or a precise calculated probability distribution. ...
References - Thomas M. Cover, Joy A. Thomas. Elements of Information Theory New York: Wiley, 1991. ISBN 0-471-06259-6
- Lazo, A. and P. Rathie. On the entropy of continuous probability distributions Information Theory, IEEE Transactions on, 1978. 24(1): p. 120-122.
External links Differential entropy on PlanetMath PlanetMath is a free, collaborative, online mathematics encyclopedia. ...
|