Axioms
This chapter contains notes on the foundational underpinnings of set theory.
There are two versions of set theory: (1) naive set theory, and (2) axiomatic set theory. We explore aspects of both theories in the materials that follow. What are the differences? Naive set theory is the version of set theory as originally constructed by the German mathematician Georg Cantor.1 Axiomatic set theory is the version of set theory constructed in response to several paradoxes, the most famous of which is Russell's Paradox. In this chapter, we focus only on naive set theory.
Foundational Axioms
In mathematics, a set is a collection of objects. Those objects might be dogs, people, trees, numbers, values, variables, or more abstractly, propositions. For example, the set of all coins includes nickels, sterling, and yen. The set of all countries includes Japan, Palau, and Uruguay. The set of all integers includes 1 and -117. We can visualize a set as:
Sets can be finite (there are only so many countries), or infinite (there are infinitely many of numbers). Each object in a set is called a member of that set.
Axiom of Extension
The set is the same as the set , and the same set as . We can draw this conclusion because of the axiom of extension.
axiom of extension. Two sets are equal if and only if they have the same elements.
In short, the axiom of extension provides that a set is completely determined by what its elements areβit is irrelevant whether the elements are in order, or whether an element is listed more than once.
Axiom of Specification
The axiom of specification, also referred to as Aussonderungsaxiom, provides the following:
axiom of specification. To every set and to every condition there exists a set whose elements are exactly the elements of for which holds.
Set theorists introduced the axiom of specification to get around some awkward gaps philosophers noticed in the early 1900s. One such gap is Russell's Paradox, discussed in a separate chapter. In essence, without the axiom of specification, we get the conclusion that nothing contains everything.
While we could simply accept this as a KΕan, it causes enough discomfort to warrant an axiom. The axiom of specification, in short, establishes that there exists a set we can specify (e.g., tailor) to a set we desire.
A set that's "tailored" is said to be well-defined. We say that a set is well-defined if, for any given we can determine whether is a member of the set. For example, the set is a well-defined set. Given any we can determine whether is a member of must be a real number, and it must be greater than 4. The set however, is not a well-defined set. Is a member of We do not have enough conditions to make a determination.
Axiom of Infinity
We state the axiom of infinity as follows:
axiom of infinity. There exists a set containing and containing the successor of each of its elements.
The axiom of infinity provides that there is at least one infinite set. This may not seem important or interesting at this point, but it proves to be extremely useful at a later juncture. Of note, the axiom of infinity is an issue in mathematics subject to debate. The hot button is whether the axiom of infinity is necessary.
Set Notation
To help make our future discussions more efficient, let's agree on some basic notation and terminology. First, the notion of logical identity.
logic identity. The expression
means: and are symbols for the same object. If and are different objects, we write:
Sets can be expressed in a variety of ways:
- "The set of all integers whose square is between, and includes, one and nine."
Let's go over these approaches. We denote the notion of set membership with the symbol This is a variant of the lowercase epsilon (), which is the first letter of the Greek word meaning "is."
set membership. Let be a set. The notation
means: is an element of If is not an element of we write:
A set may be made more explicit through set roster notation. For instance, is a set containing the elements 0, 1, and 2. If the set has too many elements to list, ellipses may be used. For example, the set of all integers from 0 to 100 may be written as: . Likewise, the set of all positive integers may be written as .
We can also specify sets in set builder notation. Suppose that is a set, and is a property that elements of may or may not satisfy. We can define a new set that contains all the elements of such that is true:
The pipe character, |, is shorthand for "such that." Thus, above notation refers to all the elements of with the property . Note that in some mathematical contexts, the phrase "such that" is often abbreviated "s.t." or . Thus, the above set may appear in some texts as:
Subsets
Some sets are themselves elements of another set. These sets are called subsets.2 For example, the set of all Herefords is a subset of the set of all cows, and the set of all bonobos is a subset of the set of all great apes. More formally:
subset. Suppose and are sets. The set is a subset of if, and only if, every element of is also an element of To denote this relationship, we write:
If is not a subset of , then we write:
If is true, then we can infer: For every element if then We describe and relationship with statements such as " is contained in " and " contains "
If is true, then we can infer: There is at least one element such that and . In other words, if is not a subset of , then there is at least one element in that is not an element of
From the definition above, if we are asked to prove that is a subset of we must show that if belongs to then also belongs to If we are asked to prove that is not a subset of we must find a single that belongs to but does not belong to
The definition above implies that the empty set is a subset of all sets, and a given set is a subset of itself. In other words, for the set the sets and are subsets of We say that and are trivial subsets of
This follows directly from the definition. The proposition if, for all Because the sufficient condition, is false, it follows that is true. This is an example of a vacuous proof.
The case of straightforward. Given that and are the same sets, every implies that Because the sufficient condition is true and the necessary condition is true, it follows that by definition.
Proper Subsets
If a subset of is such that and then we say that is called a proper subset of For example, the set of all potatoes is a subset of all root vegetables. It certainly isn't emptyβyou've got Yukon Golds, Russet potatoes, Japanese sweet potatoes, LaRette potatoes, and so on. But it also doesn't comprise the entire set of all root vegetables either, because there are other root vegetables: taro, yams, tapioca, turmeric, ginseng, ginger, beets, and so on. Thus, we say that, "Potatoes are a proper subset of root vegetables," and write:
A subset is a general concept. When we say that is a "subset" of we allow the possibility of and being equal. For example, the set of all aardvarks is a subset of all Orycteropodidae, and the set of all Orycteropodidae is a subset of all Tubulidentata. However, aardvarks comprise all the members of Orycteropodidae, and Orycteropodidae comprise all the members of Tubulidentata. Thus, these sets are all equal, but are also subsets.3
Now, if some extinct species of Orycteropodidae comes back to life or some new species is discovered, then the aardvarks would not comprise all of Orycteropodidae. In that situation, aardvarks are no longer just a subset of Orycteropodidae. We can still refer to aardvarks as a subset of Orycteropodidae, but we are now also permitted to more specifically say that aardvarks are a strict subset (or proper subset) of Orycteropodidae.
strict subset. Suppose and are sets. is a proper subset of if, and only if, every element of is in but there is at least one element in that is not in . If is a proper subset of , we write:
If is true, we can infer: All of the elements of are elements of , but there is at least one element of that is not an element of We can also infer that all the elements of have some property in common that separate it from other subsets of
A helpful way to distinguish between the two symbols is noticing how they share similarities to the symbols and means "subset or equal" and means "subset but not equal;" the former is more general than the latter.
If is false (i.e., is not a proper subset of ), we write:
If is true, then either is true, or is true, but no elements in are not in .
The Empty Set
Some sets have no members. For example, the set of all positive integers greater than their squares is the empty set. If there is even a single element in the set, then the set is not the empty set. Such a set (a set with a single member) is called a singleton.
Importantly, we shouldn't confuse the lack of an element with the lack of a subset. The empty set does have a subsetβthe empty set. But the fact that it has a subset does not imply that it has an element. We will see why this is the case shortly.
The empty set is both an interesting and useful concept to have. Proferred by George Boole, the empty set embodies the concept of nothingness. Nothingness is a difficult concept to think about, but as with many things in mathematics, the idea is encapsulatedβabstracted awayβinto a single symbol: The abstraction allows us to reason about nothingness differently. Usually, when we think of filling an empty pint with beer, we tend to think that we are somehow negating the glass's emptiness; sort of "removing" the nothingness inside the pint. however, is a member of every set except itself. We cannot remove nothingness from the set of all natural numbersβit's always there.
The empty set is a very useful notion, but it can also be a footgun if we don't use it carefully. To prevent fiascos, we will always denote the empty set with the null symbol: At no point do we ever write:
This is equivalent to a spelling mistake in mathematics. Second, the set containing an empty set is not the same as the empty set:
Once more, these are two very different things. One is an actual object (), the other is a box containing a single object (). The empty set can be thought of as representing an empty folder. If we place an empty folder inside an empty folder then the folder is no longer empty: It contains a folder. The distinction between elements and subsets is a critical distinction.
Vacuous Truth
The empty set offers an opportunity to revisit the notion of vacuous truth. Here's the question: Suppose we had a set Is the empty set a subset of The tempting answer is "It depends. What's in " Unfortunately, the temptation is wrongβit does not depend. Recall the definition of a subset:
The key point to consider is the condition Now substitute with Is the statement true? Of course not. The empty set is emptyβthere's nothing in it. This means that the sufficient condition for the definition of a subset is always false when we consider the empty set as a subset. And what happens when a sufficient condition is false? The conditional proposition is always true! This is a vacuous truth: The empty set is a subset of In fact, the empty set is a subset of every set, including itself (since the condition is still false if we substitute with ).
The notion of vacuous truth may seem strange, but it follows directly from logic. Consider the negation of the definition for a subset:
In other words, is not a subset of if there's an in that's not in We know there's nothing in the empty set, so there can never be an in that's not in There are no to begin with.
This discussion leads to some helpful points regarding proofs concerning sets:
-
To prove that show that and are just trenchcoats: If is inside then is also inside
-
To prove that find a culprit: There's at least one inside that's not inside
If we think a little more carefully about the definition for subsets, we'd realize that this whole idea of the empty set being a subset of itself extends to a more general conclusion:
theorem. For every set
Why is this true? Well, we know that since has no members, the sufficient condition for the definition of a subset is always false, so it follows that is always true. Next, by the law of identity, every statement implies itself. Thus, This implies that every element of is an element of And since every element of is an element of it follows that
Set Identity
Consider the following sets:
By the axiom of extension, and all have the same elements. Observing this, we have the following definition:
set identity. Two sets and are identical if, and only if, every element of an element of and every element of is an element of
From the definition above, given to sets and to prove that is equal to we prove that and
Cardinality
Suppose we have the following set, called
How many elements are in this set? There are 4. The number of elements in a set is called the set's cardinality. The term cardinality is derived from the term cardinal number (a number used for counting, rather than ordering). Cardinality plays a special role in both mathematics and programming. For example, in computer science, an uncountable function is a function where no computer program can be written to find all its values. In other words, there are some functions that we, as programmers, cannot write simply because the cardinality of the function's set of outputs is either too large to count or cannot be counted at all.
cardinality. Given a set if there are exactly elements in where then is a finite set with a cardinality of We denote this fact with:
The notation is not the only one way to denote cardinality. Some texts denote cardinality with and We use but will switch notations as necessary for clarity. Note that when analyzing cardinality, the axiom of extension applies. The set and are the same sets. Thus, both and have a cardinality of 2 (i.e., )
Another useful fact when handling cardinality: The empty set is the only set with a cardinality of 0:
cardinality of the null. Given a set if then is the empty set,
Further note that because sets denote the number of elements in a set, for our purposes, we follow the axiom that cardinality can never be negative.4 Because we're discussing set algebra, we are careful to state this axiom explicitly:
positivity of cardinality. Given a set
One-to-One Correspondence
When Oliver Twist asked for more gruel, the master grew pale and aimed a blow at Oliver's head. Why? Because at meal time, each child was to receive one, and only one, serving of gruel. Suppose each boy is a member of the set and each serving of gruel is a member of the set Then, each element of gets one, and only one, element of In mathematics, we call this one-to-one correspondence.
one-to-one correspondence. Let and be sets.
and are in one-to-one correspondence if there exists a pairing of the elements of to the elements of such that each corresponds to one, and only one, and each corresponds to one, and only one,
Equinumerosity
When a set and have the same cardinality, we say that and are equinumerous. As we'll see in later chapters, know whether sets are equinumerous is a powerful fact utilized in not just set theory, but many other areas of mathematics.
equinumerosity. Let and be sets. and have the same cardinality if, and only if, there is a one-to-one correspondence from to If and have the same cardinality, we say that and are equinumerous, and write
Without going into the details now, we can compare the cardinalities of sets with the classical comparison operators. For example, when we write , we are stating, "Set is smaller than set " When we write we are stating, "Set is smaller or the same size as set "
Countability
Careful thought into sets reveals that there are some sets we can't count. For example, intuitively, we might think that because there are infinitely many integers, there are infinitely many odd positive integers, so we simply can't count all of the odd positive integers. This intuition, however, is wrong. Why? Because mathematics has a very strict definition of "countable":
countability. A set is countable if, and only if:
- is finite, or
Otherwise, is uncountable.
If is an infinite set and countable, we denote the cardinality of by writing and say " has cardinality aleph null."
Based on this definition, we can see that the set of odd positive integers, is a countable set. Why? Consider the function where This function maps the set of positive integers to the set of odd numbers. Now, recall the definition for when two sets are considered equal. They are equal iff there is a one-to-one correspondence between and Two sets have one-to-one correspondence if they are both (a) one-to-one and (b) onto.
The first requirement, (a), is satisfied. Suppose that From this assumption, it follows that It then follows that This proves that the set is onto.
The second requirement, (b), is satisfied. Let's say there's some odd positive integer Since is an odd positive integer, it is a positive integer of the form where This in turn means that will always be less than an even positive integer As such, it follows that And given this fact, no an odd positive integer, can map to even integer. It follows then that the odd positive integers will always map to the positive integers.
Satisfying both (a) and (b), the function is a one-to-one correspondence from to And because it is of one-to-one correspondence, it follows that This satisfies the definition of countability aboveβthe set of odd positive integers is countable.
Set Union
Suppose there are two sets, and . The union of and , denoted is the set of all elements of both and . Visually:
Venn diagrams are named after the English mathematician John Venn, who introduced their usage in 1881. More formally:
set union. Given sets and is the set of all elements in either or both and
Big Union Operator
In some cases, we want to express the notion of a union of many sets. For example, suppose we had the sets:
We could express the union of all three of these sets by writing:
However, we can also communicate the same idea by writing:
The big "U" is is colloquially called the "big union" or "big cup" (from its
corresponding LaTeX command, \bigcup
).
big union. Let be sets. The union of these sets is denoted:
Cardinality of Unions
Working with set theory, knowing the size of a union is tremendously useful. However, we must be careful when counting the elements. Because consists of all the elements in and the elements in both and the elements in both and are counted twice. Accordingly, we must subtract the intersection of and to retrieve the correct cardinality. This is called the principle of inclusion-exclusion:
principle of inclusion-exclusion. Given
Set Intersection
In the Venn diagram below, the grey region is called the set intersection.
Here is the definition of intersection:
intersection. Let and be sets. The intersection of and denoted
is set of elements that are members of and members of That is, if
then and
This is a deceptively simple definition. For example, very clearly yields the intersection But here's a slighly more devilish one:
It's very tempting to say the intersection here is 3. The correct answer, however, is
The slip is forgetting the fact that the intersection operation always returns
a set. As an aside, because of how long the word "intersection" is, the symbol
is often read "cap" (again from its corresponding LaTeX command,
\cap
). For example, the expression
is often read " cap "
Big Intersection
Like the union operator, we have some shortcut notation for taking many intersections (the "big cap" operator).
big intersection. Let be sets. The intersection of these sets may be denoted as:
Cardinalities of Intersections and Unions
Suppose we had the following sets and
As a Venn diagram, sets and are visualized as such:
We see that the union of and is:
The cardinalities of the sets and the union:
Notice that the cardinality of the union is not simply This is because simply adding and results in counting the elements in the intersection twice. Accordingly, to compute the cardinality of we must subtract This yields the following formula:
lemma. Let and be non-empty, finite sets. The cardinality of is given by the formula:
From the above formula, we can derive a second formula for the cardinality of the intersection:
lemma. Let and be non-empty, finite sets. The cardinality of is given by the formula:
Disjoint Sets
Some sets have no elements in common, and we call them disjoint. For example, the two sets below are disjoint:
Here's an explicit definition:
disjoint sets. Let and be sets. If
then and are said to be disjoint. That is, and have no elements in common.
Disjoint Union
Say we had the following sets:
The sets and have nothing in common. I.e.,
What if we want to express the set
while maintaining the fact that and are disjoint? The notation doesn't tell us anything about whether and are disjoint. Fortunately, we have a special symbol for this, called the disjoint union:
This is simply the union operator, but with the added restriction that its operands and must be disjoint.
Set Difference
Let and both be sets. The difference of and , denoted (read " set minus ) is the set of all elements in and not in In other words,
More formally:
definition: set difference. Given sets and > is the set containing all the elements in that are not contained in
We say that is the complement of with respect to
Complements
The complement of a set more generally refers to the set of all elements not contained in If we have a defined universal set and then the complement of is the set of all elements in that are not inside
definition: complement. Suppose is the universal set. The complement of the set denoted is the complement of with respect to Hence:
Or, alternatively:
Set Identities
Knowing the set operations, it's often helpful to know several set identities by heart to quickly perform deductions. These identities are presented below:
Name | Identity |
---|---|
Identity Law 1 | |
Identity Law 2 | |
Domination Law 1 | |
Domination Law 2 | |
Indempotent Law 1 | |
Indempotent Law 1 | |
Complementation Law | |
Commutative Law 1 | |
Commutative Law 2 | |
Associative Law 1 | |
Associative Law 2 | |
Distributive Law 1 | |
Distributive Law 2 | |
De Morgan's Law 1 | |
De Morgan's Law 2 | |
Absorption Law 1 | |
Absorption Law 2 | |
Complement Law 1 | |
Complement Law 2 |
Symmetric Difference
The symmetric difference between two sets and is defined as:
That is to say, the symmetric difference of sets and is the set made up of all the elements which are in or but not both. We can think of this as akin to the "exclusive or."
The symmetric difference has the following properties:
property i | |
property ii | |
property iii | |
property iv | |
property v |
Power Sets
A mathematician requested a VISA to enter the United States with his family (him, his wife, and his son). The embassy replied, "Please submit passport photos of your family." Unsure whether multiple persons were supposed to be in each photo, the mathematician took one for every combination. Each on their own, him and his wife, him and his son, his wife and his son, etc. Photos taken, the mathematician returned to the embassy.
The clerk was dumbfounded. After several minutes she said, "There's no one in this photo."
"Yes. That is a photo of the empty set."
This amusing exchange illustrates the notion of a power set. Consider the set the set of binary digits. Using our definition of subsets, what are all the different possible subsets of We can lay them out:
There are four different possible subsets. Suppose we place all these possible subsets in an entirely new set:
We call the set above the power set of and denote as:
The symbol is used in these notes, but other symbols are used throughout mathematics: and a few others. Defining the power set more formally:
power set. Given a set the power set of is the set of all subsets of the set The power set of is denoted
One peculiar application of the definition above is with the empty set. What is the size of the power set of It is 1, since the only subset of is Next, what is the size of the power set of It is 2, since If we continue this pattern:
The pattern above yields to a direct corollary (as well as several other ideas we will explore in the sections on number theory):
corollary. If as set has elements, where then
The proof of this corollary: Let be a set. For every subset of there are two possibilities for each element Either (1) is in the subset, or is not in the subset. Therefore, it follows that for elements, there are possibilities in making a subset of From the definition of the powerset, it follows that the number of subsets of is
Big Unions & Intersections
At this point, we have a much better understanding of basic set theory. We can now introduce the generalized forms of unions and intersections. Because sets comply with the laws of associativity, the sets and are unambiguous. In other words, we don't need parentheses to communicate what we mean. is the same set as which is the same set as The same goes for
For example, suppose and Then and The set is the set containing all the elements that occur in at least one of and The set contains all the elements in all three of and We can extend this idea to any number of sets:
From the definition above, it follows that the notation represents a set. Accordingly, when we write we imply that is a member of at least one of This idea also applies to intersections:
Special Number Sets
Certain number sets are used so frequently in mathematics that we denote them with symbols.
Symbol | Set |
---|---|
The set of all real numbers. E.g., | |
The set of all negative real numbers. E.g., . This set does not include 0. | |
The set of all nonpositive real numbers. This is the set of negative real numbers including 0. | |
The set of all positive real numbers. E.g., . This set does not include 0. | |
The set of all nonnegative positive real numbers. This is the set of all positive real numbers including 0. | |
The set of all rational numbers. These are numbers that can be expressed as , where and are integers. E.g., . | |
The set of all irrational numbers. This is the set of all numbers that are not rational numbers. E.g., . | |
The set of all integers. This is the set of what we refer to as the counting numbers, and their negative counterparts. E.g., . | |
The set of all negative integers. This is a subset of . It consists of all the negative integers. E.g., . This set does not include 0. | |
The set of all positive integers. Another subset of . It consists of all the positive integers. E.g., . This set does not include 0. | |
The set of all even numbers. This is the set of all integers that can be written in the form , where is an integer. E.g., . | |
The set of all odd numbers. This is the set of all integers that can be written in the form , where is an integer. E.g., | |
The set of all natural numbers. This is the set of all counting numbers (equivalent to the set of positive integers). E.g., . | |
The set of all prime numbers. E.g., . |
Footnotes
-
When Cantor first introduced set theory, the math community largely ridiculed him. Over time, however, the theory gained big-name mathematicians like David Hilbert. Nevertheless, the ridicule remained and affected Cantor deeply. Comments at the time captured the split: David Hilbert famously said, "No one will drive us from the paradise which Cantor created for us." To which Wittgensteinβa philosopher and one of Hilbert's most formidable opponentsβresponded, "If one person can see it as a paradise of mathematicians, why should not another see it as a joke?" β©
-
A set consisting of only one member is called a singleton. β©
-
In biology, a taxon (an instance of a taxonomy level, e.g., kingdom, phylum, class, etc.) with only one superset is called monotypic. Aardvarks, the Albany pitcher plant, the hyacinth macaw, among others are all monotypic. β©
-
Negative cardinality is a concept explored in modern mathematics, but it is beyond the scope of these materials. β©