# Categorical proposition

In logic, a **categorical proposition**, or **categorical statement**, is a proposition that asserts or denies that all or some of the members of one category (the *subject term*) are included in another (the *predicate term*).^{[1]} The study of arguments using categorical statements (i.e., syllogisms) forms an important branch of deductive reasoning that began with the Ancient Greeks.

The Ancient Greeks such as Aristotle identified four primary distinct types of categorical proposition and gave them standard forms (now often called *A*, *E*, *I*, and *O*). If, abstractly, the subject category is named *S* and the predicate category is named *P*, the four standard forms are:

Surprisingly, a large number of sentences may be translated into one of these canonical forms while retaining all or most of the original meaning of the sentence. Greek investigations resulted in the so-called square of opposition, which codifies the logical relations among the different forms; for example, that an *A*-statement is contradictory to an *O*-statement; that is to say, for example, if one believes "All apples are red fruits," one cannot simultaneously believe that "Some apples are not red fruits." Thus the relationships of the square of opposition may allow immediate inference, whereby the truth or falsity of one of the forms may follow directly from the truth or falsity of a statement in another form.

Modern understanding of categorical propositions (originating with the mid-19th century work of George Boole) requires one to consider if the subject category may be empty. If so, this is called the *hypothetical viewpoint*, in opposition to the *existential viewpoint* which requires the subject category to have at least one member. The existential viewpoint is a stronger stance than the hypothetical and, when it is appropriate to take, it allows one to deduce more results than otherwise could be made. The hypothetical viewpoint, being the weaker view, has the effect of removing some of the relations present in the traditional square of opposition.

Arguments consisting of three categorical propositions — two as premises and one as conclusion — are known as categorical syllogisms and were of paramount importance from the times of ancient Greek logicians through the Middle Ages. Although formal arguments using categorical syllogisms have largely given way to the increased expressive power of modern logic systems like the first-order predicate calculus, they still retain practical value in addition to their historic and pedagogical significance.

Sentences in natural language may be translated into standard forms. In each row of the following chart, *S* corresponds to the subject of the example sentence, and *P* corresponds to the predicate.

Note that "All *S* is not *P*" (e.g., "All cats do not have eight legs") is not classified as an example of the standard forms. This is because the translation to natural language is ambiguous. In common speech, the sentence "All cats do not have eight legs" could be used informally to indicate either (1) "At least some, and perhaps all, cats do not have eight legs" or (2) "No cats have eight legs".

Categorical propositions can be categorized into four types on the basis of their "quality" and "quantity", or their "distribution of terms". These four types have long been named *A*, *E*, *I*, and *O*. This is based on the Latin *affirmo* (I affirm), referring to the affirmative propositions

*A*and

*I*, and

*(I deny), referring to the negative propositions*

*n***e**g**o***E*and

*O*.

^{[2]}

**Quantity** refers to the number of members of the subject class (A *class* is a collection or group of things designated by a term that is either subject or predicate in a categorical proposition.^{[3]}) that are used in the proposition. If the proposition refers to all members of the subject class, it is *universal*. If the proposition does not employ all members of the subject class, it is *particular*. For instance, an *I*-proposition ("Some *S* is *P*") is particular since it only refers to some of the members of the subject class.

**Quality** It is described as whether the proposition affirms or denies the inclusion of a subject within the class of the predicate. The two possible qualities are called *affirmative* and *negative*.^{[4]} For instance, an *A*-proposition ("All *S* is *P*") is affirmative since it states that the subject is contained within the predicate. On the other hand, an *O*-proposition ("Some *S* is not *P*") is negative since it excludes the subject from the predicate.

An important consideration is the definition of the word *some*. In logic, *some* refers to "one or more", which is consistent with "all". Therefore, the statement "Some S is P" does not guarantee that the statement "Some S is not P" is also true.

The two terms (subject and predicate) in a categorical proposition may each be classified as **distributed** or **undistributed**. If all members of the term's class are affected by the proposition, that class is *distributed*; otherwise it is *undistributed*. Every proposition therefore has one of four possible *distribution of terms*.

Each of the four canonical forms will be examined in turn regarding its distribution of terms. Although not developed here, Venn diagrams are sometimes helpful when trying to understand the distribution of terms for the four forms.

An *A*-proposition distributes the subject to the predicate, but not the reverse. Consider the following categorical proposition: "All dogs are mammals". All dogs are indeed mammals, but it would be false to say all mammals are dogs. Since all dogs are included in the class of mammals, "dogs" is said to be distributed to "mammals". Since all mammals are not necessarily dogs, "mammals" is undistributed to "dogs".

An *E*-proposition distributes bidirectionally between the subject and predicate. From the categorical proposition "No beetles are mammals", we can infer that no mammals are beetles. Since all beetles are defined not to be mammals, and all mammals are defined not to be beetles, both classes are distributed.

Both terms in an *I*-proposition are undistributed. For example, "Some Americans are conservatives". Neither term can be entirely distributed to the other. From this proposition, it is not possible to say that all Americans are conservatives or that all conservatives are Americans. Note the ambiguity in the statement: It could either mean that "Some Americans (or other) are conservatives" (*de dicto*), or it could mean that "Some Americans (in particular, Albert and Bob) are conservatives" (*de re*).

In an *O*-proposition, only the predicate is distributed. Consider the following: "Some politicians are not corrupt". Since not all politicians are defined by this rule, the subject is undistributed. The predicate, though, is distributed because all the members of "corrupt people" will not match the group of people defined as "some politicians". Since the rule applies to every member of the corrupt people group, namely, "All corrupt people are not some politicians", the predicate is distributed.

In short, for the subject to be distributed, the statement must be universal (e.g., "all", "no"). For the predicate to be distributed, the statement must be negative (e.g., "no", "not").^{[5]}

Peter Geach and others have criticized the use of distribution to determine the validity of an argument.^{[6]}^{[7]}

It has been suggested that statements of the form "Some A are not B" would be less problematic if stated as "Not every A is B,"^{[8]} which is perhaps a closer translation to Aristotle's original form for this type of statement.^{[9]}

There are several operations (e.g., conversion, obversion, and contraposition) that can be performed on a categorical statement to change it into another. The new statement may or may not be equivalent to the original. [In the following tables that illustrate such operations, at each row, boxes are green if statements in one green box are equivalent to statements in another green box, boxes are red if statements in one red box are inequivalent to statements in another red box. Statements in a yellow box means that these are implied or valid by the statement in the left-most box when the condition stated in the same yellow box is satisfied.]

Some operations require the notion of the *class complement*. This refers to every element under consideration which is *not* an element of the class. Class complements are very similar to set complements. The class complement of a set P will be called "non-P".

From a statement in *E* or *I* form, it is valid to conclude its converse (as they are equivalent). This is not the case for the *A* and *O* forms.

Categorical statements are logically equivalent to their obverse. As such, a Venn diagram illustrating any one of the forms would be identical to the Venn diagram illustrating its obverse.