Philosophy of Science: Scientific Theories As Models

Samuel Mitchell,

Philosophy

Donal O'Shea,

Mathematics

Index | Intro to Scientific Theory

What is a scientific theory? An Introduction.


Statement of purpose and preliminaries.
Objectives.
This is intended to give a readable introduction to recent developments surrounding the question "what is a scientific theory?" Its audience is students who are interested in philosophy, but beyond that very little will be assumed. I don't even presuppose that students have had a course in logic. It will be helpful to have some interest in the natural sciences, but an elementary college level knowledge is all that is needed. A good high school level of exposure will probably be enough.
A second objective, more ambitious, is to make the topic at least slightly interesting. The material is somewhat technical, and for a good reason. Philosophers are interested in what a scientific theory is because they want to use to answer to go on to look at further questions. Intuitively interesting questions, such as whether there is a difference between science and religion, or how the sciences explain things, turn out to need an answer to the question of what a theory is before they can be adequately discussed and answers proposed. But our best examples of scientific theories are often explicitly mathematical and always are carefully phrased. We wouldn't dignify a topic with the name of science unless it was systematic and attempted to avoid ambiguity.
So I'll proceed slowly, systematically explaining the various views that have been proposed, I hope in a comprehensible way. I'll try to constantly relate the issues to examples, and I'll try to propose things to do to illustrate some of the points I'll be trying to make. For example, some people (some of my students for example), think that the emphasis on systematic theories and the use of mathematics is constricting in the sciences. You might usefully try to think of theories that have been influential that lack this feature. (Does Darwin's original statement of the theory of evolution count?) Perhaps the emphasis on formal and mathematical theories in the sciences is misplaced, but it can hardly be denied that it exists. Things are unlikely to change unless paradigmatic examples of scientific theories can be proposed that lack this feature.
I do hold a position on these topics, and will state it as it arises. I will do my best, though, to present the issues in a balanced way.


Preliminary considerations.
The first point to make is that the word 'theory' in this context does not mean 'conjecture' or 'speculation'. In ordinary language, a discussion of why it is that the President of the University unexpectedly resigned can often include the proposal of various 'theories' with virtually no evidence in their favor. (Alcoholism, affairs with students etc.) We sometimes say we have a theory about something when we mean we have a proposed explanation, or a link to something else we know. But every explanation or interconnected view that scientists use is called a theory and this need not indicate that anyone is uncertain about it or lacks good evidence in its favor. Things that we now accept without question, such as the fact that illnesses are caused by specific kinds of diseases, were once called "theories", and some still are. (When you think about it, illnesses might form a continuous spectrum, rather than being classifiable into different discontinuous kinds.) We still talk about the theory of relativity, although no one is in any doubt about it (well, almost no one). It's a theory that the earth goes around the sun, and a theory that there are organisms too small to see. Many contemporary philosophers believe that our ordinary view of the world in everyday life is a theory, because there is an interconnected set of expectations about the things we see around us. It does sound odd to think about a 'supermarket theory', but when you try to program a computer to answer simple questions about supermarkets, its surprising how much you have to tell it.
A second simple point concerns the practice of working scientists. It is not unusual to find scientists distinguishing between a hypothesis and a theory. A hypothesis, as they understand it, is a claim or conjecture that has no evidence in its favor. When we get evidence in its favor, it's called a theory. Once again, this isn't what the word 'theory' means in the philosophy of science.
Very loosely defined, in the philosophy of science a scientific theory is an interconnected group of claims that are capable of being supported or refuted by possible observations and which is capable of providing scientific explanations. There need not be any actual evidence in support or contrary to it, so long as there could be, and similarly, no-one need actually appeal to it to explain anything, as long as they could do so.
It might seem that just any set of claims will satisfy this loose definition, so that we'll have to count political speeches and children's stories as scientific theories. Or it might seem that the definition is completely uninformative until we get a theory of what it is to give a scientific explanation and how scientific confirmation by evidence works. It turns out, though, that disputes about what it is to confirm or explain in science depend upon a view of what scientific theories are. We have two rival theories of what it is to be a connected series of claims that are capable of explaining and facing the evidence, which are independent of other rival theories of what it is to explain or confirm scientific theories. We'll be looking at these two rivals in detail.


A brief history of the problem.

There are two rival views of what a scientific theory is, the syntactic, or received view, and the semantic view. The syntactic view developed first, and views a scientific theory as a set of sentences. The semantic view of theories views them as a class of models. In the next two sections we'll look at these in detail, and then at the various points that might be made in favor of one or the other.

The syntactic view has origins in antiquity. Euclid's Elements of Geometry begins with a few simple definitions and a set of five axioms. These axioms supposedly represented a set of basic truths about geometry, for example, "we can draw a straight line between any two points". From these axioms a sequence of indubitable inferences or rules of inference enabled the construction of proofs of various results. Each result is called a theorem of Euclidian geometry.

Euclid's great achievement was greatly admired and often emulated. Although originally a mathematical theory, it could easily be adapted to a scientific theory, and is clearly Aristotle's ideal of a scientific theory in the Posterior Analytics. Probably the most famous example of it is Newton's mechanics. Newton's three laws of motion are probably familiar to those of you with some high school physics. Together with some initial observations, usually called initial conditions and Newton's law of gravitation, Newton was able to prove theorems that predicted when planets would occupy positions in the sky, just as Euclid proved his theorems.

Around the turn of the century the great philosophical and mathematical genius Gottlob Frege succeeded in formulating a set of rigorously defined rules of inference that are usually called predicate logic and which are taught in every basic logic course. Instead of this term, we will use the term first order logic for any system of inference rules like these that always preserve truth, and can be rigorously defined in a way we'll look at shortly. When the laws of a scientific theory are used as axioms, and the theorems are the result of operating on these laws and initial conditions, we will say that this is a syntactic view of the scientific theory. The syntactic view was extremely popular, and virtually everyone used it for several decades.

Popular and powerful though it was, the syntactic view suffered from several disadvantages. First, although the laws were rigorously defined, they were extremely cumbersome to use, and often difficult to apply to the use of actual scientific theories by actual scientists, who tended to use less well defined, but more convenient, forms of inference. Second, because the rules of inference operated upon sentences, the syntactic view could not avoid becoming embroiled in debates about language, which were often difficult, and tended to distract philosophers of science from the practice of science itself. (For example, Newtonian mechanics in French seems to be a different theory from Newtonian mechanics in English according to the syntactic view, because the sentences are different. This is absurd. But to say they are really the same theory means giving an account of when French words mean the same as English words - which is very difficult to do, and nothing to do with Newtonian mechanics.)
Another criticism was rather technical. We ordinarily assume that points in space can be any distance from a given location. That is, given a straight line in space we normally assume that whatever is true of any number we can think of will be true of the number of meters along that line. This is part of what we intend to do when we measure space by numbers of meters. Well, it turns out that whatever set of axioms you use for your scientific theory, first order logic will always fail to give you some truth about numbers, and hence will always fail to give you some truth about some number of meters from a fixed point. But, the objection goes, scientists always intend all the truths about numbers to be truths about numbers of meters. So first order logic and the syntactic view of theories will always omit to say something that scientists intend to be true when they use numbers of meters to describe space. This is a consequence of Gödel's proof, which is one of the great achievements of twentieth century mathematical logic. The proof show that no axiom system and system of proof can prove all and only the truths of arithmetic. Kurt Gödel first proved it in 1931.

Because of all this a new view, the semantic view of theories came to prominence in the nineteen seventies and eighties. In order to get an intuitive grasp on this, it is necessary first to understand the rather odd idea of a mathematical structure or model.

Think of a chessboard at the start of a game. All games of chess begin with the same arrangement of pieces, and so every game looks the same at the beginning. Imagine this 'start point' of the chessboard as a location, perhaps as a photograph pinned to the top of a billboard. There are exactly twenty legal first moves in chess, so take a photograph of the way the board looks after each of these and line them up as twenty 'first move' photographs each connected by a piece of string to the 'start point' photograph. To each of these there are twenty legal responses, so take four hundred photographs of the chess board after two moves and connect each legal move to the photograph of the board that it precedes by a piece of string. After two moves, there are four hundred possible states of legal chess.

Obviously we could continue the exercise. The billboard would rapidly become unmanageable, but that need not stop us. We could store the legal chess moves in a computer and program it to generate all the legal moves for us. Or we could store them in a vast set of index cards, or on a CD (actually, we'd soon need a lot of CDs). If we kept going, we'd be constructing a list of every possible game of chess, most of them very silly.
If you're familiar with the rules of chess, you'll know that if we keep going, eventually every game has to end. Obviously, storing this huge amount of information in a physical object like a computer is impossible, but that doesn't stop us talking about it or thinking about it. We can, for example, get very rough estimates of the number of possible states of the game, and we can sometimes show that a particular photograph of a chessboard could not be the result of any sequence of legal moves.

What we're thinking about is really an abstract mathematical object. It's actually called a 'tree' because new states of the chessboard arise out of old ones, but there are no circular paths along the pieces of string. The tree of legal chess games doesn't exist physically, but it's an abstract object. There are facts about it; some sentences about it are true and some false, so it's not quite an imaginary object. Its more like a set, or a direction, it isn't a physical thing, but we can reason about it and use it.
In the same way, we can conceive of future courses of events for physical objects in the universe. When we state a scientific theory we are forbidding some courses of events from a given starting point and allowing others. We can think of the path of a billiard ball along a straight line, for example, as a line sloping upwards, with the time dimension as the vertical one. If the ball is slowing or speeding up appreciably, the line will be a curve. We can simply think, intuitively, as a scientific theory as a tree of possible future states of the universe. Actually, there is a forest. A tree grows from any possible initial state of the universe.

If you like, you can identify these courses of events with sets of numbers. These sets will have a rather complicated nested structure. The various set-theoretic representations of models are a mathematical way of representing models, and presumably different scientific theories will be more conveniently represented as different mathematical representations. Included in these complicated pieces of set theory will be, for example, ordered pairs of numbers that scientists regard as the energy of a photon in electron volts followed by its frequency per second. Scientists can then compare these numbers to those discovered by experiment to discover whether the theory accurately matches reality.
(An important word of caution is in order here for students who have had logic. The word 'model' is also used for an interpretation that makes all of a specified set of sentences true. This is not the kind of model we are talking about here. The consistency and completeness proofs of first order logic demonstrate that, given a set of sentences, the interpretations that make that set true are just those in which the sentences derived from that set by the classical proofs are true. That is, if we can prove a sentence from members of this set using classical logic, that sentence is true if and only if the premises of the proof are true. But the interpretations that make, for example, the sentences of Newtonian mechanics true will include non-standard models of the real numbers, which are not included in the models of Newtonian mechanics that the semantic view of theories allows. And if you didn't understand this paragraph, you can safely ignore it.)

Now let's look at the two views in more detail.

The syntactic view of theories.

1. What is a proof anyway?


Since antiquity, mathematicians have constructed proofs. These proofs started with certain apparently undeniable truths, and explicitly drew out their consequences. These consequences were as explicit and obvious as possible, so that anyone who doubted the theorem would be led by indisputable truths from the indisputable premises to acknowledge, eventually, the truth of the conclusion. But what made the original truths undeniable, and what made the inferences so certain? Historically, philosophers and mathematicians have appealed to the light of human reason, instilled by God or simply stated to be a fact about human nature. The German philosopher Immanuel Kant proposed a particularly brilliant and sophisticated explanation that held sway for over a hundred years, until the early twentieth century.

By then, mathematical developments had thrown previous accounts of proof into disarray. For one thing, the most conspicuous success of a proof system, Euclid's Elements of Geometry was under challenge from mathematicians. Euclid's apparently undeniable axioms had been challenged, and various very odd new geometries had been suggested. These were certainly unfamiliar, and unexpected, but if the traditional account of proof were correct, they had to be more than that. For if the traditional account were correct, Euclid's axioms were undeniable truths, so the alternatives had to have something profoundly wrong with them; they couldn't be true. The new geometries were peculiar, but not that peculiar. Even if they weren't true, it looked as if they could have been if the universe had been different. (As it happens, one of them actually is true, but even before that fact was known, the traditional view of proof was abandoned.)

First order logic provided a nice clear account of what it was for a set of sentences to be impossible. Let p be any sentence (e.g. 'the sky is blue' or 'sugar is poisonous' or 'all cats are mammals' - each of these could be p). Now let not p be the denial of p (e.g. 'the sky is not blue', etc.). A set of sentences is impossible if, were it true, both p and not p would have to be true too. In such a case, we say the set is inconsistent.
But what does it mean to say that if a set of sentences were true, other sentences would have to be true too? Again, first order logic had an answer. In elementary logic, students learn a set of inference rules. If the course is sophisticated, students also get to prove that, if some other sentence must be true if a set of sentences is, then we can get that sentence as a conclusion if we apply these inference rules in the right way.
And hence, we get a very nice account of a proof. Something is a proof if and only if it is a string of sentences each of which we get from the previous ones by the application of the inference rules. A set of sentences is possibly true if and only if you cannot prove both a sentence and its negation from that set using the inference rules. (A sentence is necessarily true if and only if it can be proved using no assumptions.)

The rules of inference had a very nice property. Intuitively, we can say that once a very simple set of words are defined, we can tell whether a rule can be applied to some sentences just by looking at whether they contain these simple words in the right order. For example, if we only define 'all … are…' then we can tell that from 'all cats are mammals' we can infer 'if Winnie is a cat, then Winnie is a mammal'. The words '… is a cat' and '…is a mammal' could mean anything, and 'Winnie' could refer to anything, as long as it is a name.

Put another way, whether a sentence can be inferred by the rules from another depends only on the syntax of the premise. That is, it depends only upon which words occur in which order. A computer that knew nothing of the world, and was programmed only to recognize certain words, could be programmed to apply the rules of inference to sentences fed as input. The 'shape' of the sentences alone determines whether the rules of inference apply, not what they mean. Of course, what we're trying to capture does depend upon meaning, for if an inference rule applies to a sentence, then the conclusion we draw using that inference rule must be true if the original premise was, and whether sentences are true or not depends upon what they mean. The useful thing about first order logic is that we can tell something about the relations between sentences so far as truth is concerned purely by looking at their syntax.

The syntactic view of theories uses the inference rules of first order logic to account for proofs from statements of scientific laws and initial conditions. That is where the name 'syntactic view' comes from.

Now lets look at a simple example of the syntactic view in action.

2. A simple example


Start with a simple statement of two laws: 'The density of an object is equal to its mass divided by its volume' and 'An object floats on water if and only if its density is less that that of water'.
Then add some initial conditions: 'Fred is an object and Fred's volume is 8 cm3', 'Fred weighs 4 g', 'the density of water is 1 g/cm3'.
It seems to follow immediately that 'Fred floats on water'. However, if you've done some logic and try to prove it using your inference rules, you'll find you need an additional premise or two: '4 divided by 8 is 0.5' and '0.5 is less than 1'. It's an important point that these rules need to be explicitly added. The advantage of first order logic is its systematic, explicit, and syntactical nature. Its disadvantage is that it is unbelievably cumbersome and tedious to actually apply. Although the overwhelming majority of inferences that mathematicians draw can be spelled out in terms of first order logic, mathematicians almost never go through each tedious step. Rather, their familiarity with mathematics allows them to jump directly to the mathematically significant conclusion without spelling everything out in the tortuous detail that first order logic would require.
The conclusion we proved from this trivial theory, 'Fred floats on water', is something we can check to be true or false. Hence, according to the syntactic view, predications issue from a theory simply by being proved from its axioms, which are the laws of the scientific theory and the initial conditions.

3. Empiricism


There is an important historical note to add to the story as I have told it so far. In the heyday of the syntactic view of theories, a position known as empiricism was very popular. Consequently, the syntactic view of theories was often associated, in its statements, with the attempt to articulate empiricism. Empiricism is the view that observation is the sole source of knowledge of the physical world about us, and that its range is very limited. So while we could observe whether something was water or not, and whether something else floated in it, we could not directly observe things like densities or masses of objects. These quantities had to be inferred from things that we could directly observe, such as whether a scale balanced, or which parts of an object were in contact with gradations on a ruler. For empiricists therefore, the initial conditions stating Fred's mass and volume were themselves conclusions drawn by a theory from other pieces of data. The importance of this was that virtually no significant sentence of a scientific theory could be tested by itself. Some link was required between the data from which Fred's length and mass were inferred, and the length and mass. An example would be: 'if a scale with a body labeled "8 grams" balances with a body, then that body weighs 8 grams'. Coupled with the sentence 'Fred is a body, which balances on a scale with a body that is labeled "8 grams"' we can infer using first order logic again that Fred weighs 8 grams.
But obviously even this is too simple. Not every body labeled "8 grams" actually weighs that much and we have to know what a scale is, and when one is accurate. Hence, more scientific theories entered the picture, and these in turn introduced yet others. Only the entirety of scientific knowledge at a time, it appeared, actually faced the data. We almost invariably needed to introduce a sentence that linked directly observable and indirectly observable objects and quantities whenever we tested a sentence of a scientific theory. Such bridge principles, as they were often called, then needed further evidence to establish them, and an infinite regress started.

Not only that, but the empiricists of the day argued that only words that referred to directly observable objects and properties had any immediately ascertainable meaning. The meanings of other words, like 'density' and 'mass' were inferred from our understanding of words that referred to directly observable things, like 'this floats' or 'this pointer and the numeral "8" are touching'. What our words mean is, after all, a matter of the way in which we use them, and constitutes a theory. So that theory, the theory of meaning, is known only because of the connections between observed utterances of words and the observable circumstances that obtain when the utterances are made. It seemed to follow that we must learn words for directly observable objects and properties first and then infer from these the meanings of words for other things.

4. Pragmatism.


This emphasis on language as a central concern for philosophy waned with the work of Willard van Orman Quine which was very influential in the nineteen sixties and seventies. Philosophers in general, and philosophers of science in particular, began to see the concerns of their subject as much more intimately connected with the study of the way scientists actually worked and what they discovered about the way humans think. Consequently, the issues in the philosophy of language that the earlier protagonists of the syntactic view had introduced came to seem irrelevant and distracting. Moreover, Quine's work also shifted philosophy away from empiricism and towards pragmatism. It is difficult to state exactly what pragmatism is in a brief way, but very roughly it includes the view that the only way in which a true theory may be recognized is through its practical success, especially success with the data. It is possible to be both a pragmatist and an empiricist simultaneously, but those pragmatists who were not also empiricists usually emphasized that various incompatible theories might be equally practically successful. Because the data did not allow us to distinguish between these rivals, and because we had no other means of identifying which was true, we selected from the theories by simply using those that were most convenient for us. We prefer simple and powerful theories, for example, because they are easy to use in a variety of circumstances. Unlike empiricists, pragmatists were generally not worried about believing in the existence of objects that were indirectly observable. So long as these objects were helpful to us, they earned their keep and were perfectly acceptable.

The rise of pragmatism meant that the syntactic view of scientific theories dropped many of its connections with specifically empiricist concerns. The desire to avoid commitments to anything but directly observable entities ceased to be of central interest, and the desire to explain the meanings of all words in terms of those that referred only to directly observable entities vanished almost entirely.

But the syntactic view itself continued to be the dominant view of scientific theories. This is an important point to note, because some of the criticisms of the syntactic view have focused upon those aspects of it that were closely linked to the earlier empiricism. It is true that the original proponents of the syntactic view were virtually universally empiricists, and that their views on meaning and observation have come under such sustained attack that no-one now can embrace them without a lengthy and detailed defense. Nonetheless, Quine himself continues to hold the syntactic view, and the collapse of early twentieth century empiricism occurred prior to the widespread acceptance of the semantic view. The view that there is no genuine distinction between directly and indirectly observable entities and that problems of meaning can be safely ignored in philosophy is quite consistent with the syntactic view of scientific theories.

The Semantic View.


What do real scientists do with actual theories?
One of the most attractive features of the semantic view of scientific theories is that it seems to focus directly on the things that actual scientists say and do in practice.

This gets more obvious as the theories get more sophisticated. Even for simple theories, though, it can be made plausible. Consider the example just given of calculating the density of something. One of the problems we encountered was that under the syntactic view, all the mathematical premises had to be made explicit. We had to add the sentences '1/2 is less than 1' for example. This seems to be a very unnatural interpretation of what we actually do in practice. What we seem to do is identify masses with numbers, volumes with numbers, and densities with numbers. We then use the properties of numbers and their relations to each other to get the number we've identified with the density from the numbers we've identified with the masses and volumes.

The important point here is that the sentences, just by themselves, need not mean anything specific. This symbol: '1', for example, need not refer to the number one. Instead, it could be the name of the Eiffel tower or the Pope, or Mahler's fifth symphony. The big advantage of first order logic, as an account of proof, was that we did not have to worry about what symbols meant, by their positions relative to one another alone we could discover whether something was a proof or not. But the big disadvantage of this is that some symbols that are not simply matters of logic, but of mathematics, seem to be regarded as having specific mathematical meanings by scientists in every case.

The semantic view acknowledges from the start that mathematics will be regarded as privileged. What scientists actually do when they propose or apply a scientific theory, the semantic theorists say, is to create a mathematical object. They then identify parts of this object with parts of the world. This enables them to design experiments and make predictions. The 'proofs' that scientists perform in this process are simply a means of developing predictions from a mathematical structure that they have postulated beforehand.

There are a variety of accounts of how scientists do this. Instead of investigating them directly, I will try to develop the simple example already given.

The Semantic View: A simple example.


I'll begin with some elementary properties of the real numbers. The first order of business is to say what 'real number' means. Children learn early how to count things ('one, two, three…'etc.) There are obviously things that they count (ducks, blocks etc.), but the numbers themselves are not these things. (What thing corresponds to zero? How does the same card manage to be both three when we're counting aces and five when we're counting hearts, at the same time?) Some people have concluded that there are no such things as numbers, but this view turns out to be surprisingly difficult to sustain. (If there are no such things as numbers, how come there's a right and a wrong way to balance a checkbook? How come there's a right and a wrong way to solve equations and differentiate?) So lets suppose there are such things as numbers. Obviously they don't have a location; it would be absurd to look for the number three. We can't see them, at least not without considerable chemical assistance. They don't get any older; there is no birthday for the cube root of two. Still, that need not prevent them from existing. One might say the same things about space or the constitution of the Iroquois (or God, or singularities of space-time).

Logicians call the numbers 0, 1, 2, 3, 4, … the natural numbers. (Mathematicians tend to omit zero in their use of this term.) If we add the negative numbers … -3, -2, -1, 0, 1, 2, 3 … we get the integers. Obviously both are infinite in number, but then so might be points in space or time. Now add all the numbers that are fractions, e.g. ½, 4/5, 1289764/-1 etc. This set is called the rational numbers.
You might think that the rational numbers are all the numbers there are, but in fact numbers like pi and the square root of two cannot be represented as a fraction. If you find this surprising, you're not the first person to do so. The numbers were once called 'surds', and it is possible we get our word 'absurd' from this root ('ab' means 'from'). Today, we call these numbers irrational numbers. If you add the irrational numbers to the rational numbers you get the real numbers which are all the numbers of any unit of length (meters, for example) in a straight line of infinite length and arbitrary zero point.
(Proof that there are irrational numbers. I will prove that the square root of two is irrational. Suppose it is rational, that is, suppose it can be expressed as a fraction. I will now reduce this assumption to absurdity. If it's a fraction, it must have a lowest form, let m/n be that form. (m/n)2 = 2, because it's the square root of two. So m2 = 2n2, so m2 is even. Therefore, m must be even, because all odd numbers have odd squares. So m2/2 must also be even, because half of the square of an even number is always even. But m2/2 = n2, so n2 is even. So n must be even. But then both n and m are even and so n/m cannot be the square root of two in its lowest form. This contradicts the stipulation that it is. Hence, the assumption upon which that stipulation is based, that the square root of two is a fraction, must be false. Neat huh? The Pythagoreans proved this about five hundred years before Christ was born.)


The semantic view of theories allows considerably greater mathematical sophistication than just the real numbers, but for simplicity's sake I'll stick to them. The major issues arise with respect to these numbers, and I think they are evil sufficient unto the day.


Now think about how we actually calculate the density of something. We identify non-negative real numbers with the mass and volume of the object and divide them. The operation of division is defined for the real numbers, so this is quite straightforward. If you like, you can think of mass as the x-axis of a graph and volume as the y-axis, then imagine a third axis at right angles to the other two, the z-axis, as the volume. Then the simple theory states that any object will fall somewhere on the surface defined by z = x/y. Not only that, but different samples of the same pure substance will always fall on the same point on the surface, and any sample of any solid will float on any liquid that has a lower density. If necessary, this simple structure could be elaborated, for example by adding another dimension for temperature and relating changes of density to temperature.


This is what the semantic view means by saying that we construct mathematical models when we construct theories. We simply identify quantities like mass and volume with real numbers and use relationships between those numbers (for example, multiplication and division) to get numbers which we then identify with other quantities. There are different views about how this is done, proposed by different thinkers, but this is the core idea.


The semantic view of theories really does well with mathematically sophisticated theories such as space-time theories and quantum mechanics. Here we need to integrate and differentiate vectors in spaces, and the mathematical structure of the theory is quite sophisticated. One shouldn't think it is only in these areas that the semantic view is useful. Models of genetics can also be constructed. Games like chess, and relationships like those found in the periodic table are also amenable to this treatment.


The important distinction between the semantic and syntactic views of scientific theories is that the semantic view proposes a specific, privileged interpretation for scientific theories. Usually, though not invariably, this is proposed to be the one that the scientists themselves are using. For example, in the case of density, we could use only the rational numbers to represent the densities of objects, because every measurement we actually make can be represented as a rational number. (To measure something to possess an irrational value for a quantity we'd need to have infinitely precise measuring instruments, and this we obviously don't have.) But scientists themselves speak, calculate and write as though the real numbers are what they mean, and so this is preferable to represent the model. We take the scientists at their word, and suppose that the models they are talking about are the ones that they say they are.


I have said that we "identify quantities with real numbers" but how do we do this? The literature on the semantic view is not always clear here, but I think the answer is something like this. It is just a fact that certain operations we can perform with objects can successfully be predicted based on relationships between numbers. An example will help demonstrate this. Suppose you have a simple measuring device such as the bathroom scales. Now suppose you put a couple of objects on it, yourself holding the dog for example. Note the numeral that appears on the scales. Now put each object singly on the scales and note the numeral that appears. The relationship between the total weight of objects on the scales at a time and the numeral indicated at that time can be predicted by the following procedure. Take the number that the numeral names according to our ordinary conventions and suppose that that is the total weight of the objects on the scales at that time. We can predict, for example that you and the dog together will indicate the numeral that names the sum of the two numbers indicated by the numerals that appear when you and the dog are on the scale separately.


Put another way, it's just a fact that mathematical objects and relations are 'like' physical objects and relations. We discover general relations between physical objects that obey this 'likeness' relation, and then we can use mathematics to predict their future courses of events, or design experiments. We can even give an account of what the 'likeness' relation is in terms of mathematical models, and hence model the thing we are doing when we use mathematics to make predictions.


A final point, which I find interesting. Whether one is a syntactic or a semantic theorist, one faces a difficulty if one is a realist. At even moderately advanced levels of physics these days, there is a very deep involvement of physical theory with mathematical objects. Obviously, this is something that the semantic theorists emphasize. But according to any sort of realist, there ought to be a distinction between a physical object - one that is physically real - and a mathematical object - one that is part of the mathematical structure of the theory.


Bas van Fraassen has pointed out that it is very difficult to draw this distinction. It's difficult to say whether the wave function in quantum mechanics is a physical or a mathematical object. It's even quite difficult to say whether its square is a physical object, although it has a direct physical interpretation as a probability. What properties do physical objects and properties have that make them different from mathematical objects and properties? The question is quite difficult even for some 'easy' cases. For example, suppose we propose that a sufficient condition for being a physical object is that an object transmits energy. That makes photons physical objects. But what about virtual photons? These odd beings have been proposed as carriers of electrical energy. It's difficult now to see whether they are a mathematical device or a real carrier of energy. A realist is forced to draw a distinction between the mathematical and physically real parts of a theory when neither can be directly observed. If the proposal is that physically real things can be indirectly observed, then the question immediately arises whether we can indirectly observe virtual photons, wave functions etc. and the answer appears to be that we can do so if these are real. Clearly, that will not do as an explanation just by itself. The problem strikes me as very difficult and very interesting, and seems to me to have received too little attention.

Syntactic or Semantic view?
A dialog.


Syn: Let's begin by stating the advantages and disadvantages we see in each other's positions.
Sem: O.K
The main advantages I see in my position fall under three headings:
First, it characterizes scientific theories directly in terms of what scientists seem to be talking about when they discuss and use their theories. It easily handles their statements directly, in terms of what they seem to be talking about. Consequently, it is very easy to apply to real examples.
Second, it avoids problems in the philosophy of language, which aren't helpful to philosophy of science, where we examine the activities of scientists themselves, (which is what science is, after all).
Third, your position uses axioms and first order logic to derive results from them, but this isn't how we reason, and your position ignores the sophisticated mathematical structures we require to formulate scientific theories.
Syn: Well, that seems to me to cover both your view of the disadvantages of my position as well as the advantages of your own!
Let me begin by replying to the claim that I worry too much about language. It seems to me that the only way to propose a scientific theory is to say something, and the only way for someone else to know what your theory is is to understand what it is you say. We're not gods, we can only smear dyes and shake the air to communicate, and so we shouldn't ignore language. In fact, that is precisely what is wrong with your approach; it ignores language.
Sem: Well my reply to that is that the philosophy of science is not about language. We're trying to examine what it is that scientists actually do, to solve the problems that arise for scientists themselves, or arise naturally from their disciplines. Can you name a single debate scientists have ever engaged in that depended upon the solution to some problem in the philosophy of language?
Syn: I suppose not, not directly. Maybe some of their problems have actually turned on the philosophy of language though, and they didn't realize it. Newton and Leibniz on space and time, for example.
But I don't agree with your view of philosophy of science. Its true that part of philosophy of science is examining the problems you state, but the subject isn't just a chambermaid in the great palace of science, it has problems and interests of its own, and one of them is how scientific theories are communicated.
Sem: That seems to me to be exactly the reverse of the way things really are. How we communicate is a subject for empirical investigation, not philosophical speculation. It's a matter for psychology, not philosophy, and psychology is a science, or ought to be anyway. We need to presume the scientific method works in order to get anywhere understanding language, so we're stuck with the assumption that scientists know what they're talking about, and the semantic view is the embodiment of that assumption.
Syn: I think I could agree with the view that empirical investigations are useful in understanding human use of language, but still disagree that philosophy of language is irrelevant to the philosophy of science.
It seems to me that there are philosophical problems that arise concerning science that are not scientific problems. For example, the debate between realists and empiricists. Find out as much as you like about the physical world, you'll never be able to find out whether the things you can't see are really there or not.
Sem: I don't believe that. There are arguments on that topic that stem from the actual details of what scientists do.
But I'll accept your point, at least for the sake of the argument. Suppose that there are independent philosophical problems concerning science. How would it follow that your view of what scientific theories are is any better off?
Syn: Well some of those problems might be helpfully addressed in terms of language. The philosophy of language might be helpful in solving them, or they might just involve linguistic issues.
Sem: The trouble is, there aren't any examples of such problems.
Syn: Mainstream philosophy of science doesn't concentrate on them now, perhaps because of the work of Quine, and certainly because of the fruitfulness of historical case studies in the philosophy of science. Still, it doesn't follow that those problems have been successfully addressed. Lawrence Sklar's work contains lots of examples of puzzles in actual science that contemporary approaches seem to ignore. Historically, these were addressed in terms of language.
Sem: Yeah, and which the philosophy of language notoriously failed to solve.
Look, we're using the approach that is dominant today because right now that looks like the most promising approach. If you want to propose some other approach, it's up to you to show that it actually useful, not to just suggest that it might be. That's completely unconvincing. All kinds of new approaches might be useful in addressing problems that the present approach doesn't.
Syn. I suppose you're right. At the present state of play, it's very hard to claim that introducing linguistic issues into the philosophy of science is much of an advantage. If we want to do that, we'd have to show the productivity of the linguistic approach, and I don't suppose there are many philosophers who believe that has been done.
Still, I don't think the syntactic view is at a disadvantage compared to the semantic view. What we ought to do, to secure mathematics, is simply to include the axioms of arithmetic as a part of every theory. Then we can get all the advantages you have.
Sem: The semantic view does better there too.
You want a set of axioms, the fundamental statements of mathematics, and then you want to derive the real numbers and their properties from those statements. But what rules of inference are you going to use? Your view requires that we use the methods of first order logic. But you know as well as I do that no set of fundamental statements will enable you to draw all the consequences that are true of the real numbers themselves.
Syn: Yes, that's a remarkable result. It turns out that the ordinary mathematical relations we all learned up to high school tell us about a set of objects, the real numbers. These possess properties that simply cannot be proved from any set of axioms, but which are expressed by sentences of first order logic, and therefore ought to be provable if the axioms and their consequences are to capture what it is we learned about.
You draw the consequence that scientists my view must see this as a disadvantage compared to your view. But that is a double edged sword. Why not instead draw the conclusion that your view is at a disadvantage compared to mine, for I have a nice clear theory of proof, and you do not?
Sem: Oh that's completely specious. If you're going to claim that your view can get the advantage that my own view has, that of being able to make use of the real numbers, then you must provide some account of how you're able to do this. When I go on to point out that you can't give this account, you turn around and say that I can't give an account of proof so we're even.
But if you want to make use of the real numbers, with all their properties, you can't give an account of proof either. Your account of proof and scientific theories means that every such theory must omit at least one mathematical truth about the real numbers that you acknowledge ought to be provable. If you want to keep first order logic and the axioms, you've got to give up on the real numbers, and the real numbers are what scientists intend when they talk about their theories. Or are you going to deny that too?
Syn: I'm not sure whether I have to. Let me see what happens if I think things through.
When they propose a theory, scientists have to say and write things. When they prove things, if the proof is a genuine one, they have to make each step explicit, and the proof must be effectively checkable. Now Gödel's proof shows that whatever we interpret them as meaning by their words, the interpretations will always include some that are not the real numbers, but something different.
But conversely, these possible interpretations of their words always will include the real numbers. For we are interpreting a scientist's words, and a scientist will never deny the sentence that ought to be provable from his statements if he intends the real numbers by them. That's obviously because he does intend the real numbers by his statements. So among the mathematical structures he intends is one that is privileged, the intended interpretation. And so I can say that it is this interpretation that I take to be the intended one, and all his subsequent statements will bear me out (as long as he doesn't make a mistake).
That point is not special to mathematics, it's one I would want to make anyway. Suppose a scientist says, "That's a neutron star". Since this is just a sentence, I could interpret as true if and only if that is a piece of green cheese he will never come near to. But that would be silly. Obviously, the best interpretation of his words is that by '…is a neutron star' he means the property of being a neutron star. Similarly, the best interpretation of his mathematical language is that he is referring to numbers, specifically the real numbers.
Sem: It seems to me that you've secured the advantages of my view only by abandoning your own. What you're saying is that you accept the usefulness of my way of seeing scientific theories, and that what you really meant all along was the linguistic view with the addition that we're interested only in some ways of interpreting the language.
But I don't see why talking about language isn't all an enormous irrelevant detour. Why not just stick with your preferred interpretations, which are what is doing all the work of analysis of scientific theories? Why bother with sentences and words at all?
Syn: Well, as I said, partly to give an account of proofs, and partly because scientists must speak or write when they propose a theory.
Sem: The way they speak and write gives no support to the additional structure you include. Scientists frequently say things that rely on inferences that first order logic cannot capture. They draw graphs and diagrams as well as speaking and writing.
Now perhaps you will reply that these words, pictures, and apparent inferences can all be interpreted using all kinds of models other than the one I prefer. But how does that illuminate the practice of science itself? It seems like an irrelevant philosophical grace note to me. As you know, some of the ways of interpreting the languages of first order logic are clearly unintentional and bizarre. Since you yourself admit that you want to make use of the intended interpretations that I do, why not just cut to the chase and talk about the mathematical models scientists themselves do?
Syn: I think you're right. If we're only interested in the philosophy of science as a way of studying scientists and their practices from their own perspective, then the semantic view is the better one. One might claim that there are other philosophical problems involved, which perhaps require taking a philosophical perspective rather than that of the scientists themselves. But even if one believes that, I would still have to show that the study of language and the restriction of inferences to those sanctioned by first order logic would be helpful in solving these philosophical problems.

A personal conclusion.


This debate represents the way I see the dispute between the two views of scientific theories. The semantic view seems to me to have the best case in its favor.
Nonetheless, I do not favor the semantic view myself. The reason is that I remain firmly convinced that the most serious philosophical problems we face today concern language, although they may also be phrased in terms of the confirmation of scientific theories.
In the debate above, both the semantic and syntactic theorists assumed that there was simply no problem about us intending the mathematician's view of the real numbers by our words. Nearly everyone believes this.
Nonetheless there are a few thinkers, originally mathematicians, who do think there is a problem here. Originally, these problems concerned a specific view of what numbers were. Numbers, these thinkers believed, were constructed out of human subjective experience, especially our experience of time. These constructions, while very wide ranging, weren't adequate to construct the real numbers as they are usually viewed.
Recently however, the philosopher Michael Dummett has re-phrased this view, called Intuitionism, from a different foundation. It is not our ability to construct numbers that matters, he has argued, so much as it is our ability to communicate, that leads to Intuitionism. The things we can say and do when we practice mathematics, he argues, are simply insufficient to indicate that it is the real numbers as ordinary mathematics construes them that we intend by our words and actions. Only the much poorer view of numbers that the Intuitionists proposed can be communicated to one another. It isn't that we intend the usual real numbers mentally, and are unable to adequately describe our intention. Rather, because we cannot understand each other in such a way as to establish that we intend the usual real numbers, we cannot even speak meaningfully about them. That is, we can't even meaningfully say "I mean the usual real numbers" or even "I am thinking about the usual real numbers", nor even "I really am thinking about those numbers that I think I'm thinking about, even though I can't communicate to you what they are!" (I say 'the usual real numbers' because Intuitionists also have a theory of what the real numbers are, it's just not as rich as the one most mathematicians would say they mean.)
It would certainly seem absurd for anyone in the philosophy department to go around correcting the mathematics department about mathematics, but I really cannot see any way of evading Dummett's arguments. My own work is that of arguing that we cannot confirm, of one another, that we understand the usual real numbers. That is, I take the business of finding out what we mean by our words to be that of finding evidence for scientific hypotheses, that is, hypotheses about what we mean. So of course, for me, the sentences scientists utter are very important, and I cannot presume that they intend the usual real numbers by these utterances.


And that is why, despite the argument above, I am a syntactic theorist. I realize that to some readers, it will be irritating to read this final, very superficial, section. If I am going to argue something, I ought to actually argue it, and not vaguely gesture in its direction and then finish. But still, this piece has been written as an introduction to students and other philosophers who are not in the philosophy of science, and I do think there is an advantage in indicating to students what a writer thinks, even if it is done very briefly.

----------------------------------------

Copyright © 1999 Mount Holyoke College. This page created by Math Across the Curriculum and maintained by Jennifer Adams. Last modified on August 8, 1999.