# Integral

In mathematics, an **integral** assigns numbers to functions in a way that describes displacement, area, volume, and other concepts that arise by combining infinitesimal data. The process of finding integrals is called **integration**. Along with differentiation, integration is a fundamental, essential operation of calculus,^{[a]} and serves as a tool to solve problems in mathematics and physics involving the area of an arbitrary shape, the length of a curve, and the volume of a solid, among others.

The integrals enumerated here are those termed **definite integrals**, which can be interpreted formally as the signed area of the region in the plane that is bounded by the graph of a given function between two points in the real line. Conventionally, areas above the horizontal axis of the plane are positive while areas below are negative. Integrals also refer to the concept of an antiderivative, a function whose derivative is the given function. In this case, they are called **indefinite integrals**. The fundamental theorem of calculus relates definite integrals with differentiation and provides a method to compute the definite integral of a function when its antiderivative is known.

Although methods of calculating areas and volumes dated from ancient Greek mathematics, the principles of integration were formulated independently by Isaac Newton and Gottfried Wilhelm Leibniz in the late 17th century, who thought of the area under a curve as an infinite sum of rectangles of infinitesimal width. Bernhard Riemann later gave a rigorous definition of integrals, which is based on a limiting procedure that approximates the area of a curvilinear region by breaking the region into thin vertical slabs.

Integrals may be generalized depending on the type of the function as well as the domain over which the integration is performed. For example, a line integral is defined for functions of two or more variables, and the interval of integration is replaced by a curve connecting the two endpoints of the interval. In a surface integral, the curve is replaced by a piece of a surface in three-dimensional space.

The first documented systematic technique capable of determining integrals is the method of exhaustion of the ancient Greek astronomer Eudoxus (*ca.* 370 BC), which sought to find areas and volumes by breaking them up into an infinite number of divisions for which the area or volume was known.^{[1]} This method was further developed and employed by Archimedes in the 3rd century BC and used to calculate the area of a circle, the surface area and volume of a sphere, area of an ellipse, the area under a parabola, the volume of a segment of a paraboloid of revolution, the volume of a segment of a hyperboloid of revolution, and the area of a spiral.^{[2]}

A similar method was independently developed in China around the 3rd century AD by Liu Hui, who used it to find the area of the circle. This method was later used in the 5th century by Chinese father-and-son mathematicians Zu Chongzhi and Zu Geng to find the volume of a sphere.^{[3]}

In the Middle East, Hasan Ibn al-Haytham, Latinized as Alhazen (c. 965 – c. 1040 AD) derived a formula for the sum of fourth powers.^{[4]} He used the results to carry out what would now be called an integration of this function, where the formulae for the sums of integral squares and fourth powers allowed him to calculate the volume of a paraboloid.^{[5]}

The next significant advances in integral calculus did not begin to appear until the 17th century. At this time, the work of Cavalieri with his method of Indivisibles, and work by Fermat, began to lay the foundations of modern calculus,^{[6]} with Cavalieri computing the integrals of *x*^{n} up to degree *n* = 9 in Cavalieri's quadrature formula.^{[7]} Further steps were made in the early 17th century by Barrow and Torricelli, who provided the first hints of a connection between integration and differentiation. Barrow provided the first proof of the fundamental theorem of calculus.^{[8]} Wallis generalized Cavalieri's method, computing integrals of x to a general power, including negative powers and fractional powers.^{[9]}

The major advance in integration came in the 17th century with the independent discovery of the fundamental theorem of calculus by Leibniz and Newton.^{[10]} The theorem demonstrates a connection between integration and differentiation. This connection, combined with the comparative ease of differentiation, can be exploited to calculate integrals. In particular, the fundamental theorem of calculus allows one to solve a much broader class of problems. Equal in importance is the comprehensive mathematical framework that both Leibniz and Newton developed. Given the name infinitesimal calculus, it allowed for precise analysis of functions within continuous domains. This framework eventually became modern calculus, whose notation for integrals is drawn directly from the work of Leibniz.

While Newton and Leibniz provided a systematic approach to integration, their work lacked a degree of rigour. Bishop Berkeley memorably attacked the vanishing increments used by Newton, calling them "ghosts of departed quantities".^{[11]} Calculus acquired a firmer footing with the development of limits. Integration was first rigorously formalized, using limits, by Riemann.^{[12]} Although all bounded piecewise continuous functions are Riemann-integrable on a bounded interval, subsequently more general functions were considered—particularly in the context of Fourier analysis—to which Riemann's definition does not apply, and Lebesgue formulated a different definition of integral, founded in measure theory (a subfield of real analysis). Other definitions of integral, extending Riemann's and Lebesgue's approaches, were proposed. These approaches based on the real number system are the ones most common today, but alternative approaches exist, such as a definition of integral as the standard part of an infinite Riemann sum, based on the hyperreal number system.

The notation for the indefinite integral was introduced by Gottfried Wilhelm Leibniz in 1675.^{[13]} He adapted the integral symbol, **∫**, from the letter *ſ* (long s), standing for *summa* (written as *ſumma*; Latin for "sum" or "total"). The modern notation for the definite integral, with limits above and below the integral sign, was first used by Joseph Fourier in *Mémoires* of the French Academy around 1819–20, reprinted in his book of 1822.^{[14]}

Isaac Newton used a small vertical bar above a variable to indicate integration, or placed the variable inside a box. The vertical bar was easily confused with **.***x* or *x*′, which are used to indicate differentiation, and the box notation was difficult for printers to reproduce, so these notations were not widely adopted.^{[15]}

The term was first printed in Latin by Jacob Bernoulli in 1690: "Ergo et horum Integralia aequantur".^{[16]}

In general, the integral of a real-valued function *f*(*x*) with respect to a real variable *x* on an interval [*a*, *b*] is written as

The integral sign ∫ represents integration. The symbol *dx*, called the differential of the variable *x*, indicates that the variable of integration is *x*. The function *f*(*x*) is called the integrand, the points *a* and *b* are called the limits of integration, and the integral is said to be over the interval [*a*, *b*], called the interval of integration.^{[17]} A function is said to be *integrable* if its integral over its domain is finite, and when limits are specified, the integral is called a definite integral.

the integral is called an indefinite integral, which represents a class of functions (the antiderivative) whose derivative is the integrand.^{[18]} The fundamental theorem of calculus relates the evaluation of definite integrals to indefinite integrals. There are several extensions of the notation for integrals to encompass integration on unbounded domains and/or in multiple dimensions (see later sections of this article).

Integrals appear in many practical situations. For instance, from the length, width and depth of a swimming pool which is rectangular with a flat bottom, one can determine the volume of water it can contain, the area of its surface, and the length of its edge. But if it is oval with a rounded bottom, integrals are required to find exact and rigorous values for these quantities. In each case, one may divide the sought quantity into infinitely many infinitesimal pieces, then sum the pieces to achieve an accurate approximation.

For example, to find the area of the region bounded by the graph of the function *f*(*x*) = √*x* between *x* = 0 and *x* = 1, one can cross the interval in five steps (0, 1/5, 2/5, ..., 1), then fill a rectangular using the right end height of each piece (thus √0, √1/5, √2/5, ..., √1) and sum their areas to get an approximation of

which is larger than the exact value. Alternatively, when replacing these subintervals by ones with the left end height of each piece, the approximation one gets is too low: with twelve such subintervals the approximated area is only 0.6203. However, when the number of pieces increase to infinity, it will reach a limit which is the exact value of the area sought (in this case, 2/3). One writes

which means 2/3 is the result of a weighted sum of function values, √*x*, multiplied by the infinitesimal step widths, denoted by *dx*, on the interval [0, 1].

There are many ways of formally defining an integral, not all of which are equivalent. The differences exist mostly to deal with differing special cases which may not be integrable under other definitions, but also occasionally for pedagogical reasons. The most commonly used definitions are Riemann integrals and Lebesgue integrals.

The Riemann integral is defined in terms of Riemann sums of functions with respect to *tagged partitions* of an interval.^{[20]} A tagged partition of a closed interval [*a*, *b*] on the real line is a finite sequence

This partitions the interval [*a*, *b*] into n sub-intervals [*x*_{i−1}, *x*_{i}] indexed by i, each of which is "tagged" with a distinguished point *t*_{i} ∈ [*x*_{i−1}, *x*_{i}]. A *Riemann sum* of a function f with respect to such a tagged partition is defined as

thus each term of the sum is the area of a rectangle with height equal to the function value at the distinguished point of the given sub-interval, and width the same as the width of sub-interval, Δ_{i} = *x*_{i}−*x*_{i−1}. The *mesh* of such a tagged partition is the width of the largest sub-interval formed by the partition, max_{i=1...n} Δ_{i}. The *Riemann integral* of a function f over the interval [*a*, *b*] is equal to S if:^{[21]}

When the chosen tags give the maximum (respectively, minimum) value of each interval, the Riemann sum becomes an upper (respectively, lower) Darboux sum, suggesting the close connection between the Riemann integral and the Darboux integral.

It is often of interest, both in theory and applications, to be able to pass to the limit under the integral. For instance, a sequence of functions can frequently be constructed that approximate, in a suitable sense, the solution to a problem. Then the integral of the solution function should be the limit of the integrals of the approximations. However, many functions that can be obtained as limits are not Riemann-integrable, and so such limit theorems do not hold with the Riemann integral. Therefore, it is of great importance to have a definition of the integral that allows a wider class of functions to be integrated.^{[22]}

Such an integral is the Lebesgue integral, that exploits the following fact to enlarge the class of integrable functions: if the values of a function are rearranged over the domain, the integral of a function should remain the same. Thus Henri Lebesgue introduced the integral bearing his name, explaining this integral thus in a letter to Paul Montel:^{[23]}

I have to pay a certain sum, which I have collected in my pocket. I take the bills and coins out of my pocket and give them to the creditor in the order I find them until I have reached the total sum. This is the Riemann integral. But I can proceed differently. After I have taken all the money out of my pocket I order the bills and coins according to identical values and then I pay the several heaps one after the other to the creditor. This is my integral.

As Folland puts it, "To compute the Riemann integral of f, one partitions the domain [*a*, *b*] into subintervals", while in the Lebesgue integral, "one is in effect partitioning the range of f ".^{[24]} The definition of the Lebesgue integral thus begins with a measure, μ. In the simplest case, the Lebesgue measure *μ*(*A*) of an interval *A* = [*a*, *b*] is its width, *b* − *a*, so that the Lebesgue integral agrees with the (proper) Riemann integral when both exist.^{[25]} In more complicated cases, the sets being measured can be highly fragmented, with no continuity and no resemblance to intervals.

where the integral on the right is an ordinary improper Riemann integral (*f*^{∗} is a strictly decreasing positive function, and therefore has a well-defined improper Riemann integral).^{[26]} For a suitable class of functions (the measurable functions) this defines the Lebesgue integral.

A general measurable function f is Lebesgue-integrable if the sum of the absolute values of the areas of the regions between the graph of f and the x-axis is finite:^{[27]}

In that case, the integral is, as in the Riemannian case, the difference between the area above the x-axis and the area below the x-axis:^{[28]}

Although the Riemann and Lebesgue integrals are the most widely used definitions of the integral, a number of others exist, including:

The collection of Riemann-integrable functions on a closed interval [*a*, *b*] forms a vector space under the operations of pointwise addition and multiplication by a scalar, and the operation of integration

is a linear functional on this vector space. Thus, the collection of integrable functions is closed under taking linear combinations, and the integral of a linear combination is the linear combination of the integrals:^{[29]}

Similarly, the set of real-valued Lebesgue-integrable functions on a given measure space E with measure μ is closed under taking linear combinations and hence form a vector space, and the Lebesgue integral

More generally, consider the vector space of all measurable functions on a measure space (*E*,*μ*), taking values in a locally compact complete topological vector space V over a locally compact topological field *K*, *f* : *E* → *V*. Then one may define an abstract integration map assigning to each function f an element of V or the symbol *∞*,

that is compatible with linear combinations.^{[30]} In this situation, the linearity holds for the subspace of functions whose integral is an element of V (i.e. "finite"). The most important special cases arise when K is **R**, **C**, or a finite extension of the field **Q**_{p} of p-adic numbers, and V is a finite-dimensional vector space over K, and when *K* = **C** and V is a complex Hilbert space.

Linearity, together with some natural continuity properties and normalization for a certain class of "simple" functions, may be used to give an alternative definition of the integral. This is the approach of Daniell for the case of real-valued functions on a set X, generalized by Nicolas Bourbaki to functions with values in a locally compact topological vector space. See Hildebrandt 1953 for an axiomatic characterization of the integral.

A number of general inequalities hold for Riemann-integrable functions defined on a closed and bounded interval [*a*, *b*] and can be generalized to other notions of integral (Lebesgue and Daniell).

In this section, f is a real-valued Riemann-integrable function. The integral

over an interval [*a*, *b*] is defined if *a* < *b*. This means that the upper and lower sums of the function f are evaluated on a partition *a* = *x*_{0} ≤ *x*_{1} ≤ . . . ≤ *x*_{n} = *b* whose values *x*_{i} are increasing. Geometrically, this signifies that integration takes place "left to right", evaluating f within intervals [*x*_{ i} , *x*_{ i +1}] where an interval with a higher index lies to the right of one with a lower index. The values a and b, the end-points of the interval, are called the limits of integration of f. Integrals can also be defined if *a* > *b*:^{[17]}

The first convention is necessary in consideration of taking integrals over subintervals of [*a*, *b*]; the second says that an integral taken over a degenerate interval, or a point, should be zero. One reason for the first convention is that the integrability of f on an interval [*a*, *b*] implies that f is integrable on any subinterval [*c*, *d*], but in particular integrals have the property that if c is any element of [*a*, *b*], then:^{[29]}

The *fundamental theorem of calculus* is the statement that differentiation and integration are inverse operations: if a continuous function is first integrated and then differentiated, the original function is retrieved.^{[33]} An important consequence, sometimes called the *second fundamental theorem of calculus*, allows one to compute integrals by using an antiderivative of the function to be integrated.^{[34]}

Let f be a continuous real-valued function defined on a closed interval [*a*, *b*]. Let F be the function defined, for all x in [*a*, *b*], by

Then, F is continuous on [*a*, *b*], differentiable on the open interval (*a*, *b*), and

Let f be a real-valued function defined on a closed interval [*a*, *b*] that admits an antiderivative F on [*a*, *b*]. That is, f and F are functions such that for all x in [*a*, *b*],

A "proper" Riemann integral assumes the integrand is defined and finite on a closed and bounded interval, bracketed by the limits of integration. An improper integral occurs when one or more of these conditions is not satisfied. In some cases such integrals may be defined by considering the limit of a sequence of proper Riemann integrals on progressively larger intervals.

If the interval is unbounded, for instance at its upper end, then the improper integral is the limit as that endpoint goes to infinity:^{[35]}

If the integrand is only defined or finite on a half-open interval, for instance (*a*, *b*], then again a limit may provide a finite result:^{[36]}

That is, the improper integral is the limit of proper integrals as one endpoint of the interval of integration approaches either a specified real number, or ∞, or −∞. In more complicated cases, limits are required at both endpoints, or at interior points.

where the differential *dA* indicates that integration is taken with respect to area. This double integral can be defined using Riemann sums, and represents the (signed) volume under the graph of *z* = *f*(*x*,*y*) over the domain *R*.^{[38]} Under suitable conditions (e.g., if *f* is continuous), Fubini's theorem states that this integral can be expressed as an equivalent iterated integral^{[39]}

This reduces the problem of computing a double integral to computing one-dimensional integrals. Because of this, another notation for the integral over *R* uses a double integral sign:^{[38]}

The concept of an integral can be extended to more general domains of integration, such as curved lines and surfaces inside higher-dimensional spaces. Such integrals are known as line integrals and surface integrals respectively. These have important applications in physics, as when dealing with vector fields.

A *line integral* (sometimes called a *path integral*) is an integral where the function to be integrated is evaluated along a curve.^{[40]} Various different line integrals are in use. In the case of a closed curve it is also called a *contour integral*.

The function to be integrated may be a scalar field or a vector field. The value of the line integral is the sum of values of the field at all points on the curve, weighted by some scalar function on the curve (commonly arc length or, for a vector field, the scalar product of the vector field with a differential vector in the curve).^{[41]} This weighting distinguishes the line integral from simpler integrals defined on intervals. Many simple formulas in physics have natural continuous analogs in terms of line integrals; for example, the fact that work is equal to force, **F**, multiplied by displacement, **s**, may be expressed (in terms of vector quantities) as:^{[42]}

For an object moving along a path *C* in a vector field **F** such as an electric field or gravitational field, the total work done by the field on the object is obtained by summing up the differential work done in moving from **s** to **s** + *d***s**. This gives the line integral^{[43]}

A *surface integral* generalizes double integrals to integration over a surface (which may be a curved set in space); it can be thought of as the double integral analog of the line integral. The function to be integrated may be a scalar field or a vector field. The value of the surface integral is the sum of the field at all points on the surface. This can be achieved by splitting the surface into surface elements, which provide the partitioning for Riemann sums.^{[44]}

For an example of applications of surface integrals, consider a vector field **v** on a surface *S*; that is, for each point *x* in *S*, **v**(*x*) is a vector. Imagine that a fluid flows through *S*, such that **v**(*x*) determines the velocity of the fluid at x. The flux is defined as the quantity of fluid flowing through *S* in unit amount of time. To find the flux, one need to take the dot product of **v** with the unit surface normal to *S* at each point, which will give a scalar field, which is integrated over the surface:^{[45]}

The fluid flux in this example may be from a physical fluid such as water or air, or from electrical or magnetic flux. Thus surface integrals have applications in physics, particularly with the classical theory of electromagnetism.

A differential form is a mathematical concept in the fields of multivariable calculus, differential topology, and tensors. Differential forms are organized by degree. For example, a one-form is a weighted sum of the differentials of the coordinates, such as:

where *E*, *F*, *G* are functions in three dimensions. A differential one-form can be integrated over an oriented path, and the resulting integral is just another way of writing a line integral. Here the basic differentials *dx*, *dy*, *dz* measure infinitesimal oriented lengths parallel to the three coordinate axes.

Unlike the cross product, and the three-dimensional vector calculus, the wedge product and the calculus of differential forms makes sense in arbitrary dimension and on more general manifolds (curves, surfaces, and their higher-dimensional analogs). The exterior derivative plays the role of the gradient and curl of vector calculus, and Stokes' theorem simultaneously generalizes the three theorems of vector calculus: the divergence theorem, Green's theorem, and the Kelvin-Stokes theorem.

The discrete equivalent of integration is summation. Summations and integrals can be put on the same foundations using the theory of Lebesgue integrals or time scale calculus.

Integrals are used extensively in many areas. For example, in probability theory, integrals are used to determine the probability of some random variable falling within a certain range.^{[46]} Moreover, the integral under an entire probability density function must equal 1, which provides a test of whether a function with no negative values could be a density function or not.^{[47]}

Integrals are also used in thermodynamics, where thermodynamic integration is used to calculate the difference in free energy between two given states.

The most basic technique for computing definite integrals of one real variable is based on the fundamental theorem of calculus. Let *f*(*x*) be the function of x to be integrated over a given interval [*a*, *b*]. Then, find an antiderivative of f; that is, a function F such that *F*′ = *f* on the interval. Provided the integrand and integral have no singularities on the path of integration, by the fundamental theorem of calculus,

Sometimes it is necessary to use one of the many techniques that have been developed to evaluate integrals. Most of these techniques rewrite one integral as a different one which is hopefully more tractable. Techniques include integration by substitution, integration by parts, integration by trigonometric substitution, and integration by partial fractions.

Alternative methods exist to compute more complex integrals. Many nonelementary integrals can be expanded in a Taylor series and integrated term by term. Occasionally, the resulting infinite series can be summed analytically. The method of convolution using Meijer G-functions can also be used, assuming that the integrand can be written as a product of Meijer G-functions. There are also many less common ways of calculating definite integrals; for instance, Parseval's identity can be used to transform an integral over a rectangular region into an infinite sum. Occasionally, an integral can be evaluated by a trick; for an example of this, see Gaussian integral.

Computations of volumes of solids of revolution can usually be done with disk integration or shell integration.

Specific results which have been worked out by various techniques are collected in the list of integrals.

Many problems in mathematics, physics, and engineering involve integration where an explicit formula for the integral is desired. Extensive tables of integrals have been compiled and published over the years for this purpose. With the spread of computers, many professionals, educators, and students have turned to computer algebra systems that are specifically designed to perform difficult or tedious tasks, including integration. Symbolic integration has been one of the motivations for the development of the first such systems, like Macsyma and Maple.

A major mathematical difficulty in symbolic integration is that in many cases, a relatively simple function does not have integrals that can be expressed in closed form involving only elementary functions, include rational and exponential functions, logarithm, trigonometric functions and inverse trigonometric functions, and the operations of multiplication and composition. The Risch algorithm provides a general criterion to determine whether the antiderivative of an elementary function is elementary, and to compute it if it is. However, functions with closed expressions of antiderivatives are the exception, and consequently, computerized algebra systems have no hope of being able to find an antiderivative for a randomly constructed elementary function. On the positive side, if the 'building blocks' for antiderivatives are fixed in advance, it may still be possible to decide whether the antiderivative of a given function can be expressed using these blocks and operations of multiplication and composition, and to find the symbolic answer whenever it exists. The Risch algorithm, implemented in Mathematica, Maple and other computer algebra systems, does just that for functions and antiderivatives built from rational functions, radicals, logarithm, and exponential functions.

Some special integrands occur often enough to warrant special study. In particular, it may be useful to have, in the set of antiderivatives, the special functions (like the Legendre functions, the hypergeometric function, the gamma function, the incomplete gamma function and so on). Extending the Risch's algorithm to include such functions is possible but challenging and has been an active research subject.

More recently a new approach has emerged, using *D*-finite functions, which are the solutions of linear differential equations with polynomial coefficients. Most of the elementary and special functions are *D*-finite, and the integral of a *D*-finite function is also a *D*-finite function. This provides an algorithm to express the antiderivative of a *D*-finite function as the solution of a differential equation. This theory also allows one to compute the definite integral of a *D*-function as the sum of a series given by the first coefficients, and provides an algorithm to compute any coefficient.

Definite integrals may be approximated using several methods of numerical integration. The rectangle method relies on dividing the region under the function into a series of rectangles corresponding to function values and multiplies by the step width to find the sum. A better approach, the trapezoidal rule, replaces the rectangles used in a Riemann sum with trapezoids. The trapezoidal rule weights the first and last values by one half, then multiplies by the step width to obtain a better approximation.^{[52]} The idea behind the trapezoidal rule, that more accurate approximations to the function yield better approximations to the integral, can be carried further: Simpson's rule approximates the integrand by a piecewise quadratic function.^{[53]}

Riemann sums, the trapezoidal rule, and Simpson's rule are examples of a family of quadrature rules called the Newton–Cotes formulas. The degree n Newton–Cotes quadrature rule approximates the polynomial on each subinterval by a degree *n* polynomial. This polynomial is chosen to interpolate the values of the function on the interval.^{[54]} Higher degree Newton–Cotes approximations can be more accurate, but they require more function evaluations, and they can suffer from numerical inaccuracy due to Runge's phenomenon. One solution to this problem is Clenshaw–Curtis quadrature, in which the integrand is approximated by expanding it in terms of Chebyshev polynomials.

Romberg's method halves the step widths incrementally, giving trapezoid approximations denoted by *T*(*h*_{0}), *T*(*h*_{1}), and so on, where *h*_{k+1} is half of *h*_{k}. For each new step size, only half the new function values need to be computed; the others carry over from the previous size. It then interpolate a polynomial through the approximations, and extrapolate to *T*(0). Gaussian quadrature evaluates the function at the roots of a set of orthogonal polynomials.^{[55]} An n-point Gaussian method is exact for polynomials of degree up to 2*n* − 1.

The computation of higher-dimensional integrals (for example, volume calculations) makes important use of such alternatives as Monte Carlo integration.^{[56]}

The area of an arbitrary two-dimensional shape can be determined using a measuring instrument called planimeter. The volume of irregular objects can be measured with precision by the fluid displaced as the object is submerged.

Area can sometimes be found via geometrical compass-and-straightedge constructions of an equivalent square.