Space, Time, and Antonymy

Space, Time, and Antonymy∗
Jessica Rett
[email protected]
August 2012
Abstract
The empirical goal of this paper is to show a particular parallel between the domains of
space, times and degrees that I argue can be attributed to the linear nature of these domains.
In particular, I try to show that relational predicates in these domains – above and under, before
and after, and comparatives like faster and slower – denote relations between scales, and are
sensitive to the boundedness of these scales. I suggest a semantic account of these phenomena
and end by discussing the significance of this sensitivity.
1
Introduction and ontological preliminaries
There are two main background assumptions to this paper: first, I assume a typed semantics; and
second, I assume that different semantic types can differ in their intrinsic structure. I’ll discuss each
in turn.
In assuming a typed semantics I assume that natural language can refer to different ontological
domains and that it does so with discrimination. Of particular importance to the discussion are the
following entities: individuals x ∈ De ; worlds w ∈ Ds ; events e ∈ Dv ; times t ∈ Di ; degrees d ∈ Dd
and locations l ∈ Dl . I assume that each of these types of entities can be referred to, modified and
quantified over. And I assume that modifiers, quantifiers, etc. can in principle distinguish between
these types.
While this assumption is generally shared to some degree or another by many linguists, it may
be less popular among philosophers. I believe that the content of this paper constitutes a defense
for some assumption similar to the one outlined above, although I acknowledge there may be other
ways of accounting for the empirical observations made here.
The second assumption has to do with the division of these domains into categories based on
how they can be structured by natural language. I’ll assume that degrees, times and locations
can be strictly ordered, while the domains of individuals, worlds and events, in contrast, can be
ordered into sums (or semilattices, as in Link, 1983). This assumption – at least so far as it extends
to degrees, times and locations – is critical to the analysis presented below; but I’ll motivate this
assumption for the moment by using an independent construction.
Embedded questions or embedded wh-clauses of any syntactic form arguably denote properties,
at least at some stage in the semantic derivation (see Groenendijk and Stokhof, 1989; Jacobson,
∗
Thanks to Dana Kleifield for investigating prepositions with me and to Sam Cumming for help on this and other
drafts. Thanks to Heather Burnett, Gabe Greenberg, Hilda Koopman and Adam Rett for discussion.
1
1995; Caponigro, 2004; George, 2011). The relevant observation here is that, for wh-clauses which
denote properties of plurals, domains differ in the structure their plural entities can take.
(1)
a.
b.
John knows [ CP who came to the party].
John knows [ CP what Bill did yesterday].
(2)
a.
b.
John knows [ CP how far Bill travelled].
John knows [ CP when Bill jogged around the block].
individual
event
degree
time
Imagine a context with the individuals (other than John) a, b and c. The property of individuals
denoted by the who clause in (1a) can, denote any combination of these individuals and their sums
(e.g. a⊕b), depending on who went to the party in that world. Crucially, there is no dependence
between any atomic individuals: the attendance of a doesn’t logically require the attendance of
b. But there is of course a dependence between a plural individual and its atomic members: so,
given that party attendance is a distributive property, the attendance of a⊕b logically requires the
attendance of a and b. To account for this fact, Link (1983) and others have assumed that the
domain of individuals is intrinsically unordered, but can be formed into join semi-lattices whose
membership can vary from context to context.
This same point holds for the property of events denoted by the what clause in (1b). There is
no dependence between the atomic events e, f and g: there are worlds in which Bill participated in
events e and f but not g; and e and g but not f; etc. A plural event e⊕f – an event of Bill dressing,
for instance, composed of an atomic event of him putting his shirt on and an atomic event of him
putting his pants on – entails the inclusion of its atomic members in the extension of the predicate,
but this entailment doesn’t hold between atomic events. Events have been argued to form sums in
parallel to individuals (Bach, 1986; Krifka, 1989, 1995).
In contrast are the domains in (14a). These predicates are monotonic, which means there is a
clear dependence between the atomic entities in their extension. So, if d1 < d2 < d3 , then there
might be a world in which Bill travelled far to degrees d2 and d3 , but there is no world in which
Bill travelled far to degrees d1 and d3 but not to degree d2 . The same goes for the temporal when
clause in (2b): assuming an atomic, homogenous event of Bill jogging, there is a world in which he
jogged aroung the block during times t2 and t3 , but no world in which he jogged around the block
during times t1 and t3 (but not t2 ).
Two observations are in order: first, it’s clearly possible to refer to a disjoint set of degrees or
times. In the jogging situation above, Bill could be out of shape, in which case he might participate
in two separate events (e and g) of running halfway around the block, with an event (f) of panting
and wheezing in between. In this case, event e might be mapped to to time t1 , event f to time t2
and event g to time t3 . And, as Beck and Rullmann (1999) have argued, some degree predicates can
be interpreted in context as non-monotonic, as in the predicate be the quantity of people who can
form a soccer team. Degree and temporal predicates do not have to be interpreted monotonically,
but they can be (and generally are).
Second, I stated that summing domains like that of individuals and events carry dependence
relationships between sums and their atomic members but not between the atomic members themselves. In the case of the degree and time domains, this is an uninteresting distinction. If John
is tall to degree dn , then he is tall to degree dn−1 , and the sum of these degrees carries the same
semantic import as the maximal degree. The dependence relationship between the atomic entities
2
in these domains dictates the relationship between a plural entity and its atomic members.
As I described in terms of (1), I assume that the domains of individuals and events (and worlds,
which I haven’t argued explicitly) can be structured into sums by natural language predicates. In
contrast, I will assume that the domains of degrees and times (and locations, which I haven’t argued
explicitly) can be structured into scales by natural language predicates. Scales are totally ordered
under <, and are therefore irreflexive (for all a it’s not the case that a > a); asymmetric (for all a
and for all b 6= a, if a < b then it’s not the case that b < a); transitive (for all a and for all b 6= a
and for all c 6= b 6= a, a < b and b < c entails a < c); and total (for all a and for all b 6= a, a < b
or b < a). These domains have in common a homomorphism to the real numbers. This has been
argued, as I’ll show, independently for each of these domains.
There is a clear and obvious relationship between events and their runtimes, so I assume events
can be mapped onto scales via some homomorphism (like Davidson’s (1969) τ operator). And I’ve
argued elsewhere that exclamatives like What dorks they are! involve the mapping of individuals to
degrees of dorkiness (Rett, 2011). That these involve homomorphisms is clear from the fact that
they pattern with and have the distribution of times and temporal properties on the one hand and
degrees and degree properties on the other hand.
If domains of entities are divided in terms of their structures – if some can form sums while
others can form scales – then we might expect that natural language be sensitive to this distinction.
In this paper I’ll argue that this distinction between summing and scalar domains corresponds to a
distinction in what counts as the most informative member of a plural. In the case of sums, the least
upper bound always corresponds to the maximal member. But in the case of scales, the least upper
bound can be open, not a member of the set. I’ll present evidence to show that this distinction is
relevant in natural language semantics.
I’ll start by introducing a meaning alternation exhibited by vertical locative prepositions. This
will serve as an illustration of the core phenomenon to be addressed in the paper. In §3 I will set
the stage for an analysis by discussing independent evidence that natural language demonstrates a
sensitivity to scale-boundedness. I’ll then argue that scale boundedness is the key to accounting for
this same sort of meaning alternation in the degree domain. §4 returns to extend the analysis to
the original preposition data. I discuss the extension of the analysis to the temporal domain in the
Appendix.
2
A puzzle with locative prepositions
Spatial prepositions come in at least two flavors: locative Ps and path Ps. (Some, like under, can
function as both.) The former describe the location of one entity relative to another entity; the
latter describe the path of a moving entity relative to another entity. Some examples are in (3).
(3)
a.
b.
The mirror is inside the box/under the painting.
The helicopter flew into the hangar/under the overpass.
locative
path
Work on the semantics of locative prepositions has been carried out largely in the framework of
cognitive psychology or cognitive linguistics, and has focused on the effect of contextual factors like
spatial attention and focus of the speakers and hearers on the truth conditions of sentences like (3a)
(e.g. Landau and Jackendoff, 1993; Logan and Sadler, 1996; O’Keefe, 1996; Carlson et al., 2003). In
3
these theories, broadly speaking, locative prepositions are characterized as placing restrictions on
the center of mass of the located object (which I’ll refer to as the a argument; the mirror in (3a))
relative to the center of mass of the reference object (the b argument; the box or paining in (3a)). So,
for instance, the vertical locative above is true iff the center of mass of its a argument has a higher
coordinate than the center of mass of its b argument, with some additional horizontal restrictions
(e.g. a doesn’t count as above b for most purposes if a is on the East Coast and b on the West Coast).
However, the situation is more complicated than this. The antonymic vertical locative prepositions under and above display a meaning alternation: for some objects b the location of a is
restricted relative to b’s highest point, and for some objects b the location of a is restricted to b’s
lowest point. I characterize them as antonymic because the following entailment relationship holds
between them:
(4)
a is under b. ↔ b is above a.
It’s possible for the sentences in (4) to be true even when the individuals a and b overlap spatially. That is, The mirror is under the painting can be true, depending on the level of granularity
called for in the context, if the lower half of the painting and the upper half of the mirror overlap
completely in vertical space. In an attempt to keep things simple in the discussion above, I will
ignore such cases (and will therefore discuss only cases of non-overlap). This is not because such
cases are anomalous – they aren’t – but rather because the distinction I’ll be focusing on is more
clearly conveyed in cases of non-overlap.
In both sentences in (5), of the form ‘a is VLP b’, the vertical locative preposition locates the
individual a with respect to the individual b (the tree).
(5)
a.
b.
The balloon is above the tree.
The treasure is under the tree.
In fact, intuitively, each sentence positions a relative to the highest point of b. In particular, (5a)
is true iff the balloon is above the canopy of the tree, and (5b) is true iff the treasure is under the
canopy of the tree, whether it is above or below ground.1 These sentences suggest the following
informal generalization of the semantics of vertical locative prepositions: ‘a is VLP b’ positions a
relative to the highest point of b.2
However, this generalization is not quite right. In (6a), as in (5a), above positions the balloon
above the highest point of the house (its roof).
(6)
a.
b.
The balloon is above the house.
The treasure is under the house.
But (6b) receives a different reading: instead of positioning the treasure below the highest point of
the house, it positions the treasure below the lowest point of the house. Specifically, while (5b) is
true if the treasure is under the top of the tree, (6b) is true only if the treasure is under the bottom
1
I use the term ‘canopy’ as shorthand for the highest point of the tree; obviously, the canopy can be vertically
extended, in which case I intend to refer to ‘the highest point of the canopy’. The same goes for my use of the term
‘roof’ in the context of (6).
2
I will use the terms ‘highest’ and ‘lowest’ to compare points in (objective) physical space, and will reserve the
terms ‘greatest’ and ‘least’ for the comparison of points on scales whose orderings may or may not accord with physical
space. The need for this distinction will become clear shortly.
4
of the house. It’s false, for instance, if the treasure is inside the house.
This restricted meaning isn’t (as a linguist might suppose) the result of some sort of lexical
competition with the interior locative preposition inside; (6b) is similarly false if the treasure is
under the eave of the roof – outside of the house – but still above ground. Instead, it appears that,
while above positions a relative to the highest point of b, under positions a relative to either the
highest or lowest point of b, depending. This is depicted in Figure 1, with a1 corresponding to the
reading in (5b) and a2 corresponding to the reading in (6b).
As these minimal pairs suggest, the difference between the two interpretations of under phrases
is conditioned by the object denoted by the internal argument b. The classes of objects conditioning
each reading can be extended, as demonstrated in (7).
(7)
a.
b.
a1 , ‘under the highest’ meaning: trees, tables, chairs, umbrellas, carports, hats
a2 , ‘under the lowest’ meaning: houses, cars, boxes, refrigerators, cabinets
I will provide an account of this meaning alternation in a vector-semantic framework. The framework takes for granted that these locative prepositions denote relations between the vector spaces
corresponding to each argument. I’ll argue that the difference between the objects in (7a) and those
in (7b) will ultimately have to do with the sort of vector space to which they’re mapped: the objects
in (7a) are generally conceptualized as not having bottoms, and so their vector spaces have open
lower bounds. In contrast, the objects in (7b) are generally conceptualized as having bottoms, so
their vector spaces have closed lower bounds.
As a prelude to this analysis, I’ll switch to a discussion of the only other place (that I know of)
in the semantics literature in which scale boundedness seems to play a truth-conditional role: in
degree semantics. After motivating this boundedness distinction, I’ll argue that degree comparatives
display the same meaning alternation as locative prepositions do.
3
3.1
Scale boundedness in degree constructions
Sensitivity to scale boundedness in the degree domain
Previous degree-semantic work on adjectives and adjectival participles suggests that the scales with
which they are associated can vary in their ordering and their boundedness. Degree semantic theories
are those which invoke degrees as ontological primitives to account for a variety of natural language
5
phenomena including gradable adjectives, comparatives and numeral quantifiers (beginning with
Bartsch and Vennemann, 1972; Cresswell, 1976). I assume, following many, that the difference
between a gradable and a non-gradable adjective is that gradable adjectives denote relations between
an individual and a degree that individual instantiates (given the dimension of measurement encoded
by the adjective, as in (8)).
(8)
JtallK = λdλx.tall(x, d)
Gradable adjectives are therefore said to be associated with a scale, or set D of degrees d which
are linearly ordered, onto which the adjective’s individual argument is mapped. There are a few
ways in which scales can differ across adjectives: in their dimension of measurement (the difference
between height and length, for instance); in their ordering, and in their boundedness (whether their
bounds are open or closed). A closed bound is one which is included in the scale itself; an open
bound is one not included in the scale.
(i)
a.
b.
(a, b) = {x : a > x > b}
(a, b] = {x : a > x ≥ b}
c. [a, b) = {x : a ≥ x > b}
d. [a, b] = {x : a ≥ x ≥ b}
When I illustrate scales, I will depict open bounds with while circles ◦ and closed bounds with black
circles •.
Antonyms like tall and short (can) have the same dimension and scale structure, but differ in
their ordering. This assumption explains the entailment relations between comparatives formed
with relative adjective antonyms, as in (9).
(9)
a.
b.
John is taller than Bill is. ↔ Bill is shorter than John is.
John is shorter than Bill is. ↔ Bill is taller than John is.
The ‘tall’ scale is downward monotonic: we can infer from John’s being related to height d that he
is also related to height d − 1 (but not to d + 1; Heim, 2000). In contrast the ‘short’ scale is upward
monotonic: we can infer from John’s being related to height n that he is also related to height d + 1
(but not to d − 1).
It’s been argued that the scales associated with gradable adjectives differ in their structure, or
boundedness, as well as in their ordering (Kennedy and McNally, 2005). Motivation comes from the
difference between absolute and relative adjectives and the correspondingly different distribution of
modifiers, as shown in (10) (Lehrer, 1985; Yoon, 1996; Kennedy and McNally, 1999; Paradis, 2001;
Rotstein and Winter, 2004; Kennedy and McNally, 2005). These examples and the details of the
analysis are from Kennedy and McNally (2005) (although see Burnett, 2012, for an explanation of
these phenomena and their interaction with vagueness in a non-degree-based semantics).
(10)
a.
b.
c.
Her brother is completely ??tall/??short.
The treatment is completely safe/??dangerous.
The figure was completely visible/invisible.
These theories characterize modifiers like completely and half as scale modifiers, denoting functions whose domain is the scale associated with a particular adjective. The assumption is that
scalar modifiers can differ in terms of whether they demonstrate sensitivity to whether their scalar
argument has an open or closed bound. Completely modifies scales with respect to their open upper
bounds, and so is undefined with scales with closed upper bounds, and is therefore unacceptable
6
with adjectives associated with such scales. The relative adjectives tall and short in (10a) are
associated with fully closed scales, and the absolute adjectives visible and invisible in (10c) are
associated with fully open scales. The antonyms safe and dangerous in (10b) are associated with
partially closed scales, but because they differ in their ordering only one of the antonyms, safe, has
an open bound as its upper bound and can therefore cooccur with safe.
Sensitivity to scale-boundedness has been useful at accounting for interesting aspectual facts
(Krifka, 1989, 1990, 1992; Hay et al., 1999; Beavers, 2008). A relevant distinction is between telic
and atelic eventualities (Vendler, 1957), as exemplified in Table 1.3
Table 1.
telic
atelic
atomic events
achievement
reach the top
semelfactive
hiccup
extended events
accomplishment
climb to the top
activity
climb
states
states
own a surfboard
A reliable test for the telic/atelic distinction in English is the ability of a VP to by modified by a
for -phrase (for atelic VPs) or an in-phrase (for telic VPs), as in (11).
(11)
a.
b.
John climbed for 3 hours.
John climbed to the top in 3 hours.
atelic
telic
Beginning with Bach (1986); Krifka (1989, 1990, 1992), it’s been useful to couch the telicity distinction in terms of homogeneity of an event (or ‘cumulativity of reference’, or ‘boundedness’). The
idea is that atelic events and states are homogenous (display cumulativity of reference): an activity
like climbing counts as an event of climbing, and each subevent of the relevant grain also counts
as an event of climbing. In contrast, telic events are heterogenous (fail to display cumulativity of
reference): an event of climbing to the top of a mountain, includes several subevents of climbing
but also includes a subevent in which the agent reaches the top of the mountain.
This distinction was originally proposed to explain parallels between mass nouns and atelic
events on the one hand and count nouns and telic events on the other. But it seems to crop up in
other ways as well. When a verb is derived from a gradable adjective, the scale structure of the
adjective seems to dictate the telicity of the resulting verb. Gradable adjectives like dry and straight
are associated with scales with open upper bounds – they can be modified by completely – and
sentences with the verbs to dry and to straighten are correspondingly telic. This is demonstrated
using another telicity test of Vendler’s: atelic predicates are entailed by their progressive forms,
while telic predicates are not (data from Hay et al., 1999).
(12)
a.
b.
The workers are straightening the rope. 9 The have straightened the rope.
The clothes are drying. 9 The clothes have dried.
3
I will use the term ‘eventuality’ as a cover term for events and states (Bach, 1986). Vendler’s original distinction
between states and events involved the observation that states (unlike extended events) do not correspond to a process
which changes over time, as they “cannot be qualified as actions at all” (p106). Tests distinguishing states from events
include the ability to occur in the progressive (*She is owning a surfboard ) and the ability to function as an imperative
(*Own a surfboard! ) (Dowty, 1979). States nevertheless behave like activities in that they are extended (non-atomic)
and atelic, so I will include them in the discussion below.
7
The idea is if the workers are in the process of straightening the rope, it’s false that they have
straightened the rope. (They are presumably only on their way to straightening the rope.)
This is in contrast to relative adjectives like long and slow, which are associated with fully closed
scales. The verbs to lengthen and to slow are correspondingly atelic.
(13)
a.
b.
The workers are lengthening the rope. → They have lengthened the rope.
The rain is slowing. → The rain has slowed.
The intuition is that if the workers are instead in the process of lengthening the rope, then it’s true
that they have already lengthened the rope. This test is intended to reflect that lengthening is a
homogenous process – composed of sub-events of lengthening – while straightening is a heterogenous
one, culiminating in a telos which is distinct from the sub-events of straightening.
To summarize, there are many reasons to think that natural language predicates can be associated with scales – linearly ordered sets of degrees – and that these scales can differ not just in terms
of their dimensions and ordering but in the nature of their bounds. The data in (10) suggests that
modifiers like completely take as their arguments only scales with an open upper bound, and thus
that relative and absolute adjectives differ in the boundedness of the scales with which they’re associated. And this distinction seems to manifest itself in the event domain in terms of the (a)telicity
of an event.
In the next section, I’ll show that degree comparatives, which I assume denote relations between
sets of degrees (or scales), demonstrate the same meaning alternation locative prepositions do. I’ll
show that a good way of accounting for the difference is in terms of a sensitivity to scale structure.
In §4 I’ll return to the locative data and argue for the same analysis in the domain of locations.
3.2
A meaning alternation in degree comparatives
Gradable adjectives are associated with scales to which their individual arguments are mapped. The
result is a subset of degrees, which I’ll call an interval, corresponding to that individual’s measure
along the relevant dimension. So in a situation in which John’s height is 6ft, the intervals corresponding to John’s tallness and shortness are represented in (14), with the left bound corresponding
to the least upper bound (the lub) and the greatest least bound (the glb). I define these bounds in
(15).
(14)
a.
b.
λd.tall(john, d) = {...1ft, 2ft,...6ft} = (0, 6ft]
λd.short(john, d) = {...,7ft, 6ft} = [∞,6ft]
(15)
a.
b.
lub(D∗ ) = ιd[∀d0 [D(d0 ) → d0 ≤∗ d] ∧ ∀d0 , d00 [(D(d0 ) ∧ D(d00 )) → (d0 ≤∗ d00 ∧ d ≤∗ d00 )]]
glb(D∗ ) = ιd[∀d0 [D(d0 ) → d ≤∗ d0 ] ∧ ∀d0 , d00 [(D(d0 ) ∧ D(d00 )) → (d00 ≤∗ d0 ∧ d00 ≤∗ d)]]
In (15), D ranges over intervals. An interval can have one of two orderings – either downwardor upward-monotonic – and its lubs and glbs are defined relative to that ordering. I’ve used the
asterisk ∗ to range over these orderings.
Descriptively speaking, a comparative of the form ‘a Adj-er b’ places restrictions on the set of
degrees to which a instantiates Adj relative to the set of degrees to which b does. The comparative
in (16a) requires that the degree to which Amy is tall exceed the highest degree to which Bob is
tall, and the negative-antonym comparative in (16b) requires that the degree to which Amy is short
8
exceed on the ‘short’ scale (i.e., be lower than) the highest degree to which Bob is short.4
(16)
a.
b.
Amy is taller than Bob (is).
Amy is shorter than Bob (is).
From this, we might conclude that ‘a Adj-er b’ is true iff the a interval exceeds in the relevant
direction the highest member of the b interval.
However, it’s been observed that this isn’t always the case. It turns out that whether a comparative relates the a interval to the minimum or maximum of the b interval depends on the nature
of the than clause (Rullmann, 1995; Meier, 2002; Heim, 2007; Büring, 2007; Beck, 2012). I’ll argue
here that this is the same phenomenon as the meaning alternation observed in locative prepositions.
Imagine a context in which Lucinda is driving on a highway that has a maximum speed limit
(70mph) as well as a minimum speed limit (40mph). I’ll refer to this as the ‘Max/Min Context’. In
this context, the predicate she can drive d-fast is non-monotonic: if it holds truthfully of a degree d
we can neither infer that it holds of d − 1 nor that it holds of d + 1. In this context, the comparative
in (17a) is true iff Lucinda is driving faster than the maximum speed, i.e. above 70mph. And the
slower comparative in (17b) is true iff Lucinda is driving slower than the minimum speed, i.e. below
40mph. These judgments conform to the pattern in (16).
(17)
a.
b.
Lucinda is driving faster than she can (on this highway).
Lucinda is driving slower than she can (on this highway).
In contrast, imagine a context in which the highway has a maximum speed limit (70mph) but no
minimum speed limit (I’ll refer to this as the ‘Max Context’). In this context, the predicate she can
drive d-fast is downward-monotonic: if it holds truthfully of some degree d then it holds truthfully
of some degree d − 1. And in this context, as in the previous one, (17a) is true iff Lucinda’s speed
exceeds 70mph. But, in this context, (17b) receives a different interpretation than before: it is true
iff Lucinda is driving below the speed limit; that is, it restricts Lucinda’s speed with respect to the
maximum degree she is allowed to drive.
This difference has been characterized in terms of ambiguity: (17a) is ambiguous, while (17b)
isn’t. The basis of such theories is a characterization of slow as the negation of fast. The claim is
that the negative-antonym comparatives contain a covert negation which can scope independently
of the comparative morpheme, resulting in a syntactic ambiguity (Rullmann, 1995; Heim, 2007;
Büring, 2007).5 I’ll instead argue that it has to do with scale structure and boundedness.
3.3
An analysis in terms of boundedness
There is a strong and productive tradition of analyzing the comparative and equative morphemes as
degree quantifiers (i.e. as denoting relations between sets of degrees; Seuren, 1973; McConnell-Ginet,
1973; Kamp, 1975; Cresswell, 1976; Hellan, 1981; Hoeksema, 1983). Following many (see Lechner,
2001, 2004), I consider the target of a clausal comparatives and equatives to be an elided clause, as
in (18).
4
As before, I use the terms ‘highest’ and ‘lowest’ objectively – in this case, with respect to the positive reals –
keeping the terms ‘greatest’ and ‘least’ for scale-subjective uses.
5
The sentence Lucinda is driving less fast than is allowed receives the same interpretations as (17b), leading
Rullmann, Heim and Büring to assume that less is the negated version of more.
9
(18)
a.
b.
John is as tall as/taller than Sue (is).
John is as tall as/ taller than Sue is tall.
Following Bresnan (1973) and many others, I assume that the clausal correlates of comparatives and
equatives denote sets of degrees, or intervals, via the movement of a null wh-phrase base-generated
in the degree argument position of the adjective in each clause (Bresnan, 1973; Chomsky, 1977;
Williams, 1977; Pesetsky, 1987; Heim, 1985, 2006).
There have been a number of different proposals for the semantics of the comparative morpheme.
Most (excepting the work in Klein, 1980, 1982, and its recent adaptations) follow the assumption
above that -er is a degree quantifier (see von Stechow, 1984; Schwarzschild, 2008, for overviews) and
so denotes a relation between sets of degrees. I’ll use as a starting point one of the more influential
versions, which Schwarzschild (2008) dubs the ‘A-not-A’ analysis (Seuren, 1973; McConnell-Ginet,
1973; Kamp, 1975; Hoeksema, 1983; Seuren, 1984). Its typical formulation is shown in (19) (with
D ranging over sets of degrees, or scales). It requires that there be a degree in the denotation of
the matrix clause which is not a member of the denotation of the than clause.
(19)
J-erK = λD0 λD∃d [d ∈ D ∧ d ∈
/ D0 ]
I will have to adopt a slightly different characterization of the comparative in what follows.
As I stated, the prominent analysis of this ambiguity involves the postulation of a syntactic
ambiguity in just the negative-antonym case between a covert negation and the comparative morpheme. As I’ll discuss shortly, Beck (2012) provides evidence against such an account: when the
than clause denotes an upward-monotonic scale it’s the positive-antonym comparative that displays
a meaning alternation. Meier (2002), focusing only on comparatives with modals in the than clause,
implemented a ‘negative ordering source’ on possible worlds to capture the asymmetry.
I propose that the meaning alternation comes about because the than clauses in the contexts
outlined above denote differently bounded scales, and because the comparative is sensitive to the
difference between open and closed bounds. Take the two contexts described with respect to the
comparative in (17). In a context in which the salient law places only a maximum limit on drivers’
speed (the Max Context), the interval corresponding to what is allowed on the highway is partially
closed; its bound corresponding to 70mph is included in the interval, while its bound corresponding
to zero is not (Figure 2).
Figure 2.
In contrast, in a context in which the salient law places both a maximum and a minimum (40mph)
limit on drivers’ speed (the Max/Min Contexts), the interval corresponding to what is allowed on
the highway is closed, with both bounds included in the interval.
10
This allows for a reconceptualization of the meaning alternation in (17) as in Table 2. From this
perspective, the comparative morpheme positions the matrix interval a with respect to the lub of
its b interval in every case except one.
Table 2.
(17a), Max
(17b), Max
(17a), Max/Min
(17b), Max/Min
Context
Context
Context
Context
relation
faster
slower
faster
slower
b
partially closed
partially closed
closed
closed
a w.r.t. b
exceed lub
exceed glb
exceed lub
exceed lub
It seems as though, in the context of a slower comparative, the difference between a closed and
open lub is enough to condition the change from an ‘exceed lub’ interpretation to an ‘exceed glb’
interpretation. In other words, the comparative appears to discriminate between closed and open
bounds of its internal argument, and furthermore to privilege closed bounds. This suggests for the
following characterization of the semantics of comparatives: ‘a GP-er b’ is true iff a exceeds on the
relevant scale the maximal closed bound of b.’
To account for the observed facts we’ll need to incorporate scale direction into the definition of
the comparative, and we’ll also need to incorporate the difference between open and closed bounds.
I do this in (20).
(20)
a.
b.
c.
λD∗ λD∗0 ∃d0 ∈ D∗0 [d0 >∗ Max(λd.bound(d, D∗ ) ∧ d ∈ D∗ )], where
Max(D∗ ) = ιd[d ∈ D∗ ∧ ∀d0 ∈ D∗ [d0 6= d → (d0 <∗ d)]] and
bound(d, D∗ ) is true iff ((∀d0 ∈ d∗ 6= d, d0 <∗ d) ∨ (∀d0 ∈ d∗ 6= d, d <∗ d0 ))
This definition can be interpreted as characterizing ‘a GP∗ -er b’ as true iff some member of the
interval a exceeds on the ∗ scale the greatest bound of the interval b which is included in b. This
definition correctly predicts that, in the Lucinda cases, Lucinda’s speed will be restricted to those
degrees above the relevant lub in all cases except the lub is an open bound. Instead of characterizing the slower examples as syntactically ambiguous, this approach predicts that the meaning of
the comparatives as dependent on the scalar structure of their internal argument.
Unlike the Rullmann/Heim theory, which ties the meaning alternation to the markedness of the
negative antonym slow, this theory predicts that any comparative can display a meaning alternation,
as long as it has a than clause denoting an open lub in some contexts. And this appears to be the
case. Beck (2012) argues that the meaning alternation isn’t restricted to comparatives formed
from negative antonyms but instead proposes the empirical generalization in (21). Again, for
the purposes of this theory, the terms ambiguous and unambiguous will have to be translated
into meaning something like ‘will demonstrate a meaning alternation’ and ‘will not demonstrate a
meaning alternation’.
(21)
a.
b.
Positive-polar (e.g. faster ) comparatives are unambiguous with downward-monotonic
B arguments, but ambiguous with upward-monotonic B arguments.
Negative-polar (e.g. slower ) comparatives are ambiguous with downward-monotonic
B arguments, but unambiguous with upward-monotonic B arguments.
I’ll illustrate Beck’s point by modifying the above contexts slightly. Imagine that instead of
driving on an American highway, Lucinda is driving on a German autobahn. In this particular area
11
of Germany, there is no maximum speed limit but there is a minimum speed limit of 70kph. In this
Minimum context, the slower comparative in (22b) has a clear meaning: it requires that Lucinda’s
speed be less than 70kph, the lub of the ‘slow’ scale.
(22)
a.
b.
Lucinda is driving faster than she can (on this autobahn).
Lucinda is driving slower than she can (on this autobahn).
The faster comparative in (22a) is harder for some to interpret (as discussed in Beck, 2012) but
many (including myself) report it as meaning something like ‘Lucinda is driving faster than some
speed she can be driving’. In the Minimum context, this means that (22a) is true iff Lucinda is
driving faster than 70kph, the glb of the ‘fast’ scale.
A Max/Min version of this context is the same as a Max/Min version of the previous, American
highway context, and should receive the same interpretations. I’ve summarized the truth conditions
of (17)/(22) in Table 3. As predicted by Beck, it’s the positive-antonym comparative faster which
demonstrates a meaning alternation in this upward-montonic context.
Table 3.
(17a), Min
(17b), Min
(17a), Max/Min
(17b), Max/Min
Context
Context
Context
Context
relation
faster
slower
faster
slower
b
partially closed
partially closed
closed
closed
a w.r.t. b
exceed glb
exceed lub
exceed lub
exceed lub
Figure 3 demonstrates the scale structure of the than phrase in these contexts. The crucial difference here is that the upward-montonic interval has a closed bound of 70kph and an open bound of
infinity. On the ‘faster’ scale that open bound is a lub; on the ‘slower’ scale that open bound is a glb.
Figure 3.
As before, (20) correctly predicts the meaning alternation: the data in Table 3 show that the
comparative places the interval a with respect to the highest closed bound of b. The discussion
of this Minimum context follows reports that the meaning alternation isn’t confined to negativeantonym comparatives, and requires an analysis that is sensitive to scale direction.6
4
A vector-semantic analysis
The boundedness-sensitive defintion in (20) raises several questions: why would natural language
relations be sensitive to the difference between closed and open bounds? And to what extent is
6
There are of course other problems to solve in the semantics of comparatives, and I don’t intend (20) to be a
cure-all. See Larson (1988); Heim (2000); Schwarzschild (2008) for a review.
12
this general? It’s possible that such a sensitivity would be idiosynractic of the degree domain; I’ll
argue in this section that it is not, that it is also responsible for the meaning alternation with vector
locative prepositions. I’ll end the paper by arguing that this sensitivity is a property of all relational
predicates in linearly ordered domains.
I’ll begin by introducing the formalism behind vector semantics, which will allow for an analysis
of the meaning alternation of VLPs.
4.1
An introduction to vector semantics
I assume, as discussed above, that locations are a semantic primitive, and that the pluralities of
locations used in natural language are structured in terms of scales (rather than sums). A vector v
is a set of totally ordered locations l ∈ L .
Vector semantics was introduced in Zwarts (1997) to account for the semantics of modifiers in
sentences like The mirror is three inches above the painting. I’ll follow Zwarts (1997) in assuming
a homomorphism space from an object to its vector space, (a set of vectors corresponding to the
object’s physical space). A two-dimensional object, for instance, can be mapped to a vertical vector
space or a horizontal vector space.
A given locative preposition is associated with only one axis (in the case of under and above,
this is the vertical axis. I’ll assume that this means that each locative selects for a plane that is
a subset of the object’s space, VERT(space(x)) in the case of VLPs. Each plane is composed of
a set of parallel vectors; the vector spaces differ from each other in their position in horizontal space.
Figure 4.
As suggested above, I analyze above and under as antonymic prepositions. I will therefore
assume that above is associated with the vertical axis and under with its inverse (Zwarts, 1997, 72).
(This means that under is defined in terms of above but not vice-versa; I have in mind the linguistic
concept of markedness.) So if l3 is higher in objective physical space than l2 , which is higher than
l1 , there are at least two vertical vectors associating these locations: the one corresponding to the
‘above’ ordering (v+ ) and the one corresponding to the ‘under’ ordering (v− ).
(23)
a.
b.
v+ : hl1 , l2 , l3 i
v− : hl3 , l2 , l1 i
13
4.2
Locatives and scale boundedness
Recall the meaning alternation data presented in §2 (repeated in (24) and (25)).
(24)
a.
b.
The balloon is above the tree.
The treasure is under the tree.
(25)
a.
b.
The balloon is above the house.
The treasure is under the house.
The vector-semantic assumptions introduced above allow us to reformulate the interpretations of
each of these sentences in terms of glbs and lubs, as in Table 4. In most cases, a the preposition
requires that the location of its external argument a exceed on the relevant ordering the lub of the
location of its b argument. From this point of view, the case in (24b) becomes the exception to a
general pattern.
Table 4.
relation
(5a) above
(5b) under
(6a) above
(6b) under
b
tree
tree
house
house
a w.r.t. b
exceed lub
exceed glb
exceed lub
exceed lub
Abstracting from objects to vectors highlights the parallel between this meaning alternation and
the one observed for degree comparatives. In what follows I’ll argue that they have the same source.
The meaning alternation is conditioned by the type of object b denoted by the preposition’s
internal argument. Objects like those listed in (26a) are those which condition the meaning alternation; those in (26b) do not.
(26)
a.
b.
a1 , ‘under the highest’ meaning: trees, tables, chairs, umbrellas, carports, hats
a2 , ‘under the lowest’ meaning: houses, cars, boxes, refrigerators, cabinets
My proposal is this: the vector space associated with a given object varies from context to context
(just as the degrees associated with permitted speed varies contextually). Objects like those in (26b)
are typically associated with fully closed vertical vectors because they are generally conceptualized
as having bottoms. In contrast, objects like those in (26a) are often associated with partially closed
vertical vectors because their bottoms are generally conceptualized as being imposed by other objects, like the ground. This difference in boundedness results in the same pattern witnessed in the
degree domain.
The objects in (26b), in general, have a bottom: a subcomponent corresponding in space to
a horizontal plane marking the salient end of that object in vertical space. This means that the
vertical vectors corresponding to their location in physical space will have closed lower bounds; the
locations marking the end of these objects will be included in the spatial extension of the objects
themselves. They are, in effect, the spatial equivalent of an atelic event; they display cumulativity
of reference.
In contrast the objects in (26a), in many contexts, aren’t conceptualized as having bottoms.
They have a subcomponent corresponding to the lowest point in their vector space, but this sub14
component and the location it is related to fail to count, in many contexts, as a bottom. In such
contexts, the top of the object is the only salient point of reference for establishing a spatial relationship; they are the spatial equivalent of telic events. This is not to say that, in these contexts, the
objects are conceptualized as continuing indefinitely downward in space; rather their lower bound
is established by some separate object, often the ground.
I’ll first illustrate the readings we’ve seen relative to these assumptions. I’ll then discuss important consequences of this empirical perspective. In Figure 5, the first illustration corresponds to the
reading available for (24a), the second for (24b), etc.
Figure 5.
In these examples, too, the correct generalization seems to be: ‘a VLP b’ positions a with respect
to the greatest closed bound of the vector space of b. Generally, this is the lub of b; when b is a
tree – whose vertical vector space is relevantly bound by the ground – it is the glb of b.
To capture this generalization formally, I will characterize VLPs as denoting relations between
sets of vectors, described informally as: ‘A VLP∗ B’ is true iff the vertical vector space corresponding to A exceeds∗ the greatest bound of the vertical vector space corresponding to B which
is included in B.
(27)
a.
b.
c.
λv∗ λv∗0 ∃l0 ∈ v∗0 [l0 >∗ Max(λl.bound(l, v∗ ) ∧ l ∈ v∗ )], where
Max(v∗ ) = ιl[l ∈ v∗ ∧ ∀l0 ∈ v∗ [l0 6= l → (l0 <∗ l)]] and
bound(l, v∗ ) is true iff ((∀l0 ∈ v∗ 6= l, l0 <∗ l) ∨ (∀l0 ∈ v∗ 6= l, l <∗ l0 ))
This definition is oversimplified in that it assumes ‘a is VLP b’ is false if there is some vertical
overlap between a and b. This works for the cases above but will need to be weakened, as discussed
earlier, to account for the (true) utterances of ‘a is VLP b’ in cases in which there is some overlap
between a and b. It is also oversimplified in that it is a relation between vectors, not planes or
vector spaces. I will assume for now that the scale structure of vectors is homogenous across an
object’s vertical plane, but I do this for convenience’s sake. It’s entirely possible that objects can
differ in the homogeneity of their e.g. vertical vectors, and that this difference is signficant for the
a1 /a2 meaning alternation.
(27) accounts for the readings in (24) and (25) (depicted in Figure 5): in the case of (24a), the
tree’s lub is the greatest closed bound on the ‘above’ scale, so the object denoted by the first argument a is positioned relative to the tree’s lub. In (24b) – a is under the tree – the tree’s canopy is
now a glb. While the tree’s lub qualifies as the greatest bound on the ‘under’ scale, it is not closed,
and so the sentence is true whenever a exceeds the tree’s glb – its canopy – on the ‘under’ scale.
15
For both sentences in (25) the calculations are more straightforward because both of the house’s
vertical bounds are closed. In the case of (25a), a is required to exceed (on the ‘above’ scale) the
house’s glb, which is the highest bound on the ‘above’ scale. In (25b), a is required to exceed (on
the ‘under’ scale) the house’s lub, which is the highest bound on the ‘under’ scale.
This explanation characterizes the ground as imposing a bound on a tree’s vertical vector space
in many but not all contexts. It relies on the idea that an object x that is distinct from an object y
can nevertheless function as a spatial bound for y. For instance, it seems quite natural to think of
the ground as a lower bound for a tree for the purposes of picnickers and birds but not groundhogs or
earthworms. Spoken by a biologist researching the habitats of groundhogs, the sentence The winter
burrow is under the tree seems to be different from the one in (5b) in that it requires the burrow
to be below the (majority of the) roots of the tree, not just below the ground. This interpretation
of an ‘under the tree’ sentence erases the meaning alternation demonstrated in Table 1, which is
what we would expect if the alternation was to do with lower bounds, and if lower bounds could
vary depending on context.
I characterized the objects in (26b) as ones which are generally conceptualized as having bottoms.
But this doesn’t require that this bottom be the most salient lower bound. An object is generally
thought to qualify as under the car if it is under the chassis but above the bottom of the tires (say,
in the sentence My ball rolled under the car ). But the tires themselves, in some contexts, function
to define the car’s lub: He got caught under the car describes an injury, not a game of hide-and-seek.
In the previous section I showed that the meaning alternation in degree comparatives wasn’t
restricted to the negative-antonym comparative (a point made in Beck, 2012). The slower comparative showed a meaning alternation when the open bound was at the lesser end of the scale, while
the faster comparative showed a meaning alternation when the open bound was at the greater
end of the scale, a flip due to the difference in the internal argument’s monotonicity. I’ll end this
discussion of locative prepositions by arguing that the same holds for them: that above, in the right
conditions, participates in a meaning alternation.
Gravity is such that objects tend to rest on top of things but not underneath things. But imagine
a small bush growing from the ceiling of a large cave. In this scenario, the under construction in
(28b) requires that the snake be below the lowest point of the bush – i.e. on the ground under the
bush – which counts as the highest closed bound on the ‘low’ scale.
(28)
a.
b.
The snake is above the bush.
The snake is under the bush.
But the above construction in (28a) can appropriately characterize a snake slithering across the
cave’s ceiling near the bush. It does not require, in contrast to (24a), that the snake be above the
cave’s ceiling (i.e. above ground). In this case, it seems, the bush has a closed lower bound on the
‘above’ scale but an open lower bound (the cave’s ceiling).
If these judgments are right then the a1 /a2 meaning alternation is something both VLPs are
susceptible to. This lends credence to the propsed analysis, which holds boundedness responsible for
the alternation, but can take into account the differences between antonymic relations. It also rules
out alternative analyses which characterize the meaning alternation in terms of lexical differences
between above and under.
16
5
Discussion
We have seen several instances of what appears to be the same sort of meaning alternation: some
in the domain of locations (and vectors), some in the domain of degrees (and intervals). I’ve argued
that the meaning alternation stems from sensitivity to the boundedness of a scalar argument.
Generalizing across the location and degree domains, this proposed analysis looks something like
(29a) (described informally in (29b)), for any x ∈ Dσ and any X ∈ Dhσ,ti .
(29)
a.
b.
λX∗ λX∗0 ∃x0 ∈ X∗0 [x0 >∗ Max∗ (λx.bound(x, X∗ ) ∧ x ∈ X∗ )]
‘a R b’ is true iff some member of the scale a exceeds∗ the greatest bound of the scale
b which is included in b.
This characterization raises the natural question: what could possibly explain this restriction in
natural language, and to what extent is it general? I’ll address this last question and then speculate
on the first.
I’ve argued that the meaning alternation is a property of two strictly ordered domains: locations
and degrees. The discussion in the Appendix argues that the meaning alternation is also a property
of the temporal domain, although there is one important discrepancy in extending the generalization
in (29a) to times. In contrast, it’s hard to see anything resembling a meaning alternation in relational
predicates in summing domains, like the domain of individuals.
(30)
a.
b.
John is the father of Bill and Mary.
Bill is the son of John and Sue.
In each of these sentences the internal argument denotes a plurality of individuals, just as the b
arguments in the above constructions denoted (or were associated with via homomorphism) plural
locations or degrees. But instead of relating the subjects to one or another of the members of the
plurality, these sentences relate the subject to the plurality as a whole. This is what we’d predict
given the discussion in §1 about the differences between sum domains and scalar domains: in the
former, entities are logically related to their sums, but in the latter, entities are logically related to
other entities.7
In fact, the linguistic significance of the maximal member of a sum has been well documented,
beginning with at least Rullmann (1995), who posits a null maximality operator to produce the
right, exhaustive semantics for questions like Who went to the party? (although see George, 2011,
for an overview of the complications of this initial proposal). §1 discussed this in general terms:
plural wh-clauses denote maximal members in sum domains, but the greatest member in scalar
domains. In each case, the denotation of the wh-clause is a lub, but in each case the lub bears a
different relationship to the other members in its set or plurality.
One interesting difference between scalar and sum domains has to do with boundedness. While
the concept of a lub is defined on a semilattice, it is in every case a closed lub; in other words, the
lub of a plural individual is always a member of that plurality. We’ve seen that the opposite is true
in the case of scales. If summing domains only have closed bounds, and if the meaning alternation is
7
It’s not clear how to test this prediction in the case of other sum domains. What some have proposed to be
event relations instead denote temporal relations, as in John ate while Bill sang (?), with the possible exception of
verbal aspect. And I’m unaware of any word or phrase that denotes a relationship between possible worlds (without
additionally quantifying over them).
17
really just an aversion to open bounds, then we can explain why the a1 /a2 alternation is a property
of some domains but not others.
Which brings me to the other question: what could account for a natural language aversion to
open bounds? I’ve argued that trees in some cases have their lower bounds imposed artificially, e.g.
by the ground. Assuming this, then the location of a treasure buried underground, beneath a tree,
could be described in at least two different ways:
(31)
a.
b.
The treasure is under the tree.
The treasure is under the ground.
(5b)/(24b)
These sentences highlight two different things about the location of the treasure. (31a) is noncommittal about whether the treasure is above ground or below ground but locates it in horizontal
space (and relative to the tree). (31b) is noncommittal about where in horizontal space the treasure
is, but locates it as below ground. Pragmatically, we might generalize the difference in (31) as
follows: the utterance of a sentence ‘a is VLP b’ is felicitous iff the location of b is informative
in the context. And if the location of b is informative in the context, a description of a’s location
relative to b’s should reference a point which is included in the spatial expression of b.
But this of course does not count as a satisfying explanation of why relational predicates display
an open bound aversion. I am content for now to argue just that there is an open bound aversion,
or something that looks like one, and that it seems to track an independently motivated difference
in how entities in a given domain can be ordered. The generalization gives credence to accounts
before mine which argue for scale structure distinctions as important facets of the degree semantics
of natural language. It demonstrates the significance in a typed semantics and the ontology that
goes with it. And it supports the belief that we can engage in a project of cross-domain semantics,
in which we can infer about one domain from the behavior of another, alongside the project of
cross-linguistic semantics, in which we infer about one language from the behavior of another.
Appendix: Temporal prepositions and scale boundedness
Relational predicats in the temporal domain demonstrates the same sensitivity to open versus closed
bounds, amounting to a sensitivity to telic versus atelic event types. There is however an additional
complication to these data: while locatives and comparatives use the greatest closed bound as a
reference point, the temporal preposition before uses the greatest open bound as a reference point.
I’ll begin by presenting the data and then will try to attribute this difference to some independently
observed semantic idiosyncrasies of before.
The temporal preposition before displays a sensitivity to closed bounds similar to that demonstrated by locatives and comparatives. In (32a), the event a denoted by the main clause (MC) must
begin after the end of the event b denoted by the temporal clause (TC), which is to say that John’s
running out of water must take place after the end of the temporal interval corresponding to John’s
climbing the mountain.
(32)
a.
b.
John ran out of water after he climbed to the top of the mountain.
John ran out of water before he climbed to the top of the mountain.
And in (32b), the event a denoted by the MC must begin before the end of the event b. In particular,
18
(32b) is true iff John’s running out of water preceded the endpoint of the interval corresponding to
John’s climbing the mountain. This means that (32b) is true in a situation in which John ran out
of water mid-way through his reaching the top of the mountain, but it’s also true if John ran out
of water before he began to climb.
These sentences suggest the following generalization about temporal adverbs TP: ‘a TP b’
positions the temporal extent of a relative to the latest point of the temporal extent of b.8 But, as
before, this isn’t quite right. In (33a), as in (32a), after positions a with respect to the latest point
of the temporal extension of b, which is to say that it’s true iff Mary met John after she ceased to
be single.
(33)
a.
b.
Mary met John after she was single.
Mary met John before she was single.
But (33b), in contrast to (32b), is incompatible with a situation in which Mary met John midway through the period in which she was single. In (33b), before positions the temporal extent of a
with respect to the earliest point of the temporal extent of b (the ‘before the earliest’ interpretation).
This suggests that a more appropriate informal generalization about temporal adverbs is as follows: while after positions the temporal extent of a relative to the latter-most point of the temporal
extent of b, before positions the temporal extent of a relative to either the latest or the earliest
point of the temporal extent of b, depending. This is depicted in Figure 6, with a1 corresponding
to the reading in (32b), and a2 corresponding to the reading in (33b).
Figure 6.
By way of an analysis, I assume, following Krifka (1989), that eventualities are associated with
their run-times via a homomorphism H. Following Moens and Steedman (1988), I will identify
events in terms of “their starting point and the point at which they end (in the case of processes
[= activities –JR]) or culminate (in the case of culminated processes [= achievements –JR])” (p26).
So the temporal interval H(b) associated with an activity will have closed lower and upper bounds,
while the temporal interval H(b) associated with an accomplishment will have a closed lower bound
and an open upper bound, as in (34).
(34)
a.
b.
H(eactivity/state ) = [u, v] = {t : u ≤ t ≤ v}
H(eaccomplishment ) = [u, v) = {t : u ≤ t < v}
The idea is that the time point corresponding to the beginning of an eventuality is included in
the eventuality’s run-time; the first moment of an eventuality counts as a time during which that
eventuality is taking place. And the time corresponding to the end of an activity (walking, for
instance) is also included in the event’s run-time; the final moment of the event of John walking
counts as a time during which John is walking. Same for states. But the time corresponding to
the end of an accomplishment is the moment of culmination, therefore arguably a bound on the
8
I’ll use the terms ‘latest’ and ‘earliest’ objectively and will continue using ‘greatest’ and ‘least’ subjectively.
19
run-time of the event but one whose inclusion in the run-time has a different status than the final
moment of an activity or state.
Finally, I assume that the difference between before and after is (at least) one of direction; while
before orders two intervals with respect to the < relation, after orders two intervals with respect to
the > relation. This means that the lub of a ‘before’ interval is its earliest point (the point highest
on the ‘before’ scale than any other) while the lub of an ‘after’ interval is its latest point (the point
highest on the ‘after’ scale), and so forth for the glbs.
Just like the a1 and a2 readings of locatives are conditioned by the type of object b, the a1 and
a2 readings of temporal prepositions are conditioned by the type of eventuality denoted by the TC.
In particular, accomplishment TCs (like climb to the top in (32)) receive the a1 , ‘before the latest’
interpretation, while TCs denoting extended atelic eventualities like states (e.g. be single in (33))
and activities receive the a2 , ‘before the earliest’ interpretation. (TCs denoting punctual or atomic
events don’t demonstrate the ambiguity for the obvious reason: we cannot distinguish their latest
point and earliest point.) We can therefore recharacterize the data as in Table 5.
Table 5.
relation
(32a) after
(32b) before
(33a) after
(33b) before
b
telic (partially closed)
telic (partially closed)
atelic (closed)
atelic (closed)
a w.r.t b
exceed lub
exceed glb
exceed lub
exceed lub
(35) and (36) are two more pairs that conform to this generalization.
(35)
a.
b.
Mary called an ambulance after John drowned.
Mary called an ambulance before John drowned.
(36)
a.
b.
Mary called an ambulance after John was sick.
Mary called an ambulance before John was sick.
(35) features an accomplishment to drown; we can imagine that the event of John’s drowning was
extended in time but had an endpoint corresponding to his death. (35a) is true iff Mary called
the ambulance after this endpoint; (35b) is true as long as Mary called the ambulance before this
endpoint. That is, (35b) is true if Mary called the ambulance in the extension of John’s drowning
before his death; but it’s also true if Mary called the ambulance before the entire event of John’s
drowning, say, before he entered the water, because Sue had just broken her leg.
(36) features a state to be sick ; John’s being sick is extended in time but its endpoint is closed
(counts as a time during which John is sick). And, as in (33), (36a) is true iff Mary called an
ambulance after the last instant John was sick, but (36b) is true only if Mary called an ambulance
before the first instant John was sick.
These data have in common with the locative and degree data that the exceptional case – the
one requiring that a exceed the least bound – is the one with the negative-antonym relational predicate and the partially-closed internal argument. There is however an important contrast with the
previous pattern. In the case of this partially closed interval it’s the least bound – as opposed to
the greatest bound – that’s open, as demonstrated in Figure 7.
20
Figure 7.
In particular, (32a) (after... climbed to the top) positions H(a) relative to the open, lub of H(b)
and (33a) (after... was single) positions H(a) relative to the closed, lub of H(b). And (32b) (before... climbed to the top) positions H(a) relative to the open, glb of H(b) and (33b) (before... was
single) positions H(a) relative to the closed, lub of H(b). At first glance, then, a more appropriate
generalization for the semantics of temporal prepositions is: ‘a TP b’ positions H(a) relative to the
maximum or greatest open bound of H(b) on the relevant scale.
So temporal prepositions pattern with locative prepositions and degree comparatives in that
they appear to be sensitive to the difference between opened and closed bounds. In the case of
locative prepositions and degree comparatives, closed bounds were privileged over open bounds. In
the case of temporal prepositions, open bounds are privileged above closed ones. This of course
runs counter to the pragmatic explanation of the sensitivity given above.
This contrast between times on the one hand and locations and degrees on the other is intriguing. While I cannot offer an account of this contrast, I will end by discussing two major differences
between temporal prepositions and locative prepositions/degree comparatives. It is my suspicion
that some odd semantic properties of temporal prepositions might be useful for explaining the reverse pattern demonstrated above.
There are (at least) two interesting and relatively well-explored semantic properties of before:
it licenses NPIs, and it can receive one of three possibly distinct interpretations (Zwarts, 1995). I’ll
introduce both properties and then show that each is related to the a1 /a2 meaning alternation in
some respect.
Before (but not after ) can license NPIs in its TC (Anscombe, 1964; Heinämäki, 1974; Beaver
and Condoravdi, 2003; Condoravdi, 2010; Krifka, 2010). The examples in (39) come from Beaver
and Condoravdi (2003).
(37)
a. Cleo lept into action before David moved a muscle/could say a word.
b. *Cleo lept into action after David moved a muscle/could say a word.
Since Ladusaw (1979), NPIs have been characterized as being licensed only in downward-entailing
environments. The status of before as an NPI-licenser has received a fair amount of attention because
it seems as though it is not a downward-entailing operator, at least at first blush, as (38) shows
(from Condoravdi, 2010).9
9
Although this example is from Condoravdi (2010), she does not ultimately endorse it as the correct test for DEness in before clauses. Her analysis instead focuses on an ordering implication between the two clauses and claims
that before can be correctly characterized as an NPI-licenser in a theory which takes this implication into account
and which assumes Strawson DE-ness (von Fintel, 1999).
21
(38)
a.
b.
Ed left before we were in the room.
9 Ed left before we were in the room standing by the window.
Ed left after we were in the room.
9 Ed left after we were in the room standing by the window.
However, it’s plausible that the lack of entailment in (38a) is due to the ability of before sentences
to receive three different sorts of interpretation (examples from Condoravdi, 2010).
(39)
a.
b.
c.
Ed left before the guests were in the room.
I decided to leave before there was any trouble.
The mice died before they showed any immune response.
veridical
counterfactual
non-committal
(39a) carries what some have characterized as a non-cancellable implicature that the guests were in
the room at some point (in particular, after Ed left). In contrast, the most natural interpretation
of (39b) is one in which it’s not the case that there was any trouble, hence the label ‘counterfactual’. Finally, (39c) is labeled ‘non-committal’ because it is acceptable in both a situation in which
the mice failed to show an immune response and a situation in which they showed a post-mortem
response.
Interestingly, there is a relationship between the availability of these readings and the a1 /a2
meaning alternation. Sentences that receive a veridical interpretation always receive a ‘before
earliest’ interpretation; while non-committal sentences like (39c) can receive either a ‘before earliest’
or a ‘before latest’ interpretation (examples based on Heinämäki, 1974).10
(40)
a.
b.
Sachi bought a Honda before they were considered fashionable.
Bill arrived in town before Ed built his house.
veridical
non-committal
(40a) is veridical: it entails or strongly implicates that Hondas were, at some past period of time,
considered fashionable. It also receives the a2 , ‘before the earliest’ interpretation described above:
it requires that Sachi bought a Honda before the earliest point in time that Hondas were considered
fashionable. On the other hand, (40b) is non-committal: it is compatible with, but does not require,
that Ed succeeded in building his house. In a context in which he did (or in a context in which he
began to build his house), however, the sentence receives the a1 , ‘before the latest’ interpretation.
It is true as long as Bill’s arrival preceded the completion of Ed’s house. It’s true, for instance, if
Bill’s arrival occurred halfway through Ed’s construction process.
There’s also a difference in telicity between the eventualities denoted by the TCs in (40). The
generalization introduced in the previous section states that atelic eventualities receive a ‘before
the earliest’ interpretation while telic events receive a ‘before the latest’ interpretation, and that’s
what (40) reflects. It’s been independently observed that the three interpretations in (39) correlate
to some extent with aktionsart: stative TCs always receive a veridical interpretation and accomplishments are non-committal (Heinämäki, 1974; Condoravdi, 2010). Consequently, the alternation
in (40), too, can be characterized in terms of an eventuality’s telicity.
There is a further connection between the a1 /a2 meaning alternation and the licensing of NPIs.
The presence of an NPI in a TC can change its denotation from an accomplishment to a plural
event, and thereby from an a1 interpretation to something like an a2 interpretation. (41) shows a
10
Sentences receiving a counterfactual interpretation – because they describe situations in which the b event has no
temporal extension – are irrelevant for the ‘before earliest’/‘before latest’ distinction.
22
minimal pair.
(41)
a.
b.
The bassist left town before John learned to play the drums.
The bassist left town before anyone learned to play the drums.
a1 , ‘before latest’
a2 , ‘before earliest’
The TC in (41a) denotes an accomplishment, and the sentence is non-committal; it’s consistent
with John having learned to play the drums, but it’s also consistent with John never having learned
to play the drums. With the former reading, it receives a ‘before the latest’ interpretation: it’s true
if the bassist left town at any time before the point at which John succeeded at learning the drums
(including mid-way through his learning it).
In contrast, in a scenario in which many of the town’s residents undertook to learn the drums at
various points in time, (41b) requires that the bassist left town prior to the first resident’s learning
of the drums (as opposed to the last resident’s). While this is clearly not a ‘before the latest’
interpretation, it might differ slightly than the ‘before the earliest’ interpretation of singular events
because the sentence seems compatible with the bassist having left town before the first resident
learned the drums, but after she started learning them.
In the case of the degree and location domains, I was able to show the full generality of the
meaning alternation: a switch from a downward-monotonic b interval conditioned a switch to the
positive antonym relational predicate participating in the meaning alternation. Unfortunately I
cannot show such a generalization in the domain of times, as event typology demonstrates that
there are events with teloi but no events with heterogenous starting points.
In sum: temporal prepositions, too, demonstrate a meaning alternation that can be couched
in terms of sensitivity to scale boundedness. In contrast to the other ordered domains, however,
before seems to privilege open bounds over closed ones. I’ve argued here in favor of the idea that
the domains pattern similarly in the end, and the difference in privileging might be linked to other
independent semantic properties of before. The plausibility of this suggestion will prove significant
to my claim that the a1 /a2 meaning alternation is a property of scalar domains generally.
References
Anscombe, G. (1964). Before and after. The Philosophical Review, 74:3–24.
Bach, E. (1986). The algebra of events. Linguistics and Philosophy, 9:5–16.
Bartsch, R. and Vennemann, T. (1972). The grammar of relative adjectives and of comparison. Linguistische Berichte,
20:19–32.
Beaver, D. and Condoravdi, C. (2003). A uniform analysis of before and after. Proceedings of SALT, XIII:37–57.
Beavers, J. (2008). Scalar complexity and the structure of events. In Dölling, J., Heyde-Zybatow, T., and Schäfer,
M., editors, Event structures in linguistic form and interpretation, pages 245–268. Mouton de Gruyter.
Beck, S. (2012). Lucinda driving too fast again. Journal of Semantics, 29:1–63.
Beck, S. and Rullmann, H. (1999). A flexible approach to exhaustivity in questions. Natural Language Semantics,
7:249–298.
Bresnan, J. (1973). Syntax of comparative clause construction in English. Linguistic Inquiry, 4:275–344.
Büring, D. (2007). More or less. In Paper presented at the Chicago Linguistic Society meeting, Chicago.
Burnett, H. (2012). The Grammar of Tolerance: On Vagueness, Context-Sensitivity, and the Origin of Scale Structure.
PhD Thesis, UCLA.
Caponigro, I. (2004). The semantic contribution of wh-words and type shifts: evidence from free relatives cross
linguistically. In Young, R. B., editor, Proceedings from Semantics and Linguistic Theory XIV, pages 38–55.
Carlson, L., Regier, T., and Covey, E. (2003). Defining spatial relations: Reconciling axis and vector representations.
In van der Zee, E. and Slack, J., editors, Representing Direction in Language and Space, pages 111–131. Oxford
University Press.
23
Chomsky, N. (1977). On wh-movement. In Formal syntax, pages 71–132. Academic Press.
Condoravdi, C. (2010). NPI licensing in temporal clauses. Natural Language and Linguistic Theory, 28:877–910.
Cresswell, M. (1976). The semantics of degree. In Partee, B., editor, Montague Grammar. Academic Press.
Davidson, D. (1969). The individuation of events. In Rescher, N., editor, Essays in honor of Carl G. Hempel, pages
216–234. Reidel.
Dowty, D. (1979). Word Meaning and Montague Grammar: The Semantics of Verbs and Times in Generative
Semantics and in Montagues PTQ. Reidel.
von Fintel, K. (1999). NPI-licensing, Strawson-Entailment, and context-dependency. Journal of Semantics, 16:97–148.
George, B. (2011). Question embedding and the semantics of answers. PhD thesis, UCLA.
Groenendijk, J. and Stokhof, M. (1989). Type-shifting rules and the semantics of interrogatives. In Chierchia, G.,
Partee, B., and Turner, R., editors, Properties, Types and Meanings, vol. 2: Semantic Issues, pages 21–68. Kluwer.
Hay, J., Kennedy, C., and Levin, B. (1999). Scale structure underlies telicity in ‘degree achievements’. In Proceedings
of SALT 9, pages 127–144. CLC.
Heim, I. (1985). Notes on comparatives and related matters. Ms., University of Texas, Austin.
Heim, I. (2000). Degree operators and scope. In Jackson, B. and Matthews, T., editors, Proceedings of SALT X, pages
40–64. CLC Publications.
Heim, I. (2006). Remarks on comparative clauses as generalized quantifiers.
Heim, I. (2007). Little. In Proceedings of SALT XVI. CLC Publications.
Heinämäki, O. (1974). Semantics of English temporal connectives. Thesis, University of Texas at Austin.
Hellan, L. (1981). Towards an Integrated Analysis of Comparatives. Narr, Tübingen.
Hoeksema, J. (1983). Negative polarity and the comparative. Natural language and linguistic theory, 1:403–434.
Jacobson, P. (1995). On the quantificiational force of English free relatives. In Bach, E., Jelinek, E., Kratzer, A., and
Partee, B., editors, Quantification in Natural Languages, pages 451–486. Kluwer.
Kamp, H. (1975). Two theories of adjectives. In Keenan, E., editor, Formal Semantics of Natural Language, pages
123–155. Cambridge University Press.
Kennedy, C. (1999). Projecting the Adjective: The Syntax and Semantics of Gradability and Comparison. Garland.
Kennedy, C. and McNally, L. (1999). From event structure to scale structure: degree modificatoin in deverbal
adjectives. In Proceedings of SALT 9, pages 163–180. CLC.
Kennedy, C. and McNally, L. (2005). Scale structure, degree modification and the semantic typology of gradable
predicates. Language, 81(2):345–381.
Klein, E. (1980). A semantics for positive and comparative adjectives. Linguistics and Philosophy, 4:1–45.
Klein, E. (1982). The interpretation of adjectival comparatives. The Journal of Linguistics, 18:113–136.
Krifka, M. (1989). Nominal reference, temporal constitution and quantification in event semantics. In Bartsch, R.,
van Bentham, J., and van Emde Boas, P., editors, Semantics and contextual expression, pages 75–115. Foris.
Krifka, M. (1990). Four thousand ships passed through the lock: Object-induced measure functions on events.
Linguistics and Philosophy, 13:487–520.
Krifka, M. (1992). Thematic relations as links between nominal reference and temporal consitution. In Sag, I. and
Scabolcsi, A., editors, Lexical Matters, pages 29–54. CSLI Publications.
Krifka, M. (1995). The semantics and pragmatics of polarity items. Linguistic Analysis, 25:209–257.
Krifka, M. (2010). Before and after without coercion: comment on the paper by Cleo Condoravdi. Natural Language
and Linguistic Theory, 28:911–929.
Ladusaw, W. (1979). Polarity as Inherent Scope Relations. PhD Thesis, University of Texas, Austin.
Landau, B. and Jackendoff, R. (1993). “what” and “where” in spatial language and spatial cognition. Behavioral and
Brain Sciences, 16:217–265.
Larson, R. (1988). Scope and comparatives. Linguistics and Philosophy, 11:1–26.
Lechner, W. (2001). Reduced and phrasal comparatives. Natural Language and Linguistic Theory, 19(4):683–735.
Lechner, W. (2004). Ellipsis in comparatives. Mouton de Gruyter.
Lehrer, A. (1985). Markedness and antonymy. Journal of linguistics, 21:397–429.
Link, G. (1983). The logical analysis of plurals and mass terms: a lattice-theoretical approach. In Baüerle, R.,
Schwarze, C., and von Stechow, A., editors, Meaning, Use and Interpretation of Language, pages 302–323. Walter
de Gruyter.
Logan, G. and Sadler, D. (1996). A computational analysis of the apprehension of spatial relations. In Bloom, P.,
Peterson, M., Nadel, L., and Garrett, M., editors, Language and space, pages 493–529. MIT Press.
McConnell-Ginet, S. (1973). Comparative Constructions in English: A Syntactic and Semantic Analysis. PhD Thesis,
University of Rochester.
Meier, C. (2002). Maximality and minimality in comparatives. In Katz, G., Reinhard, S., and Reuter, P., editors,
Sinn und Bedeutung VI, pages 275–287.
Moens, M. and Steedman, M. (1988). Temporal ontology and temporal reference. Computational Linguistics, 14:15–28.
O’Keefe, J. (1996). The spatial prepositions in english, vector grammar, and the cognitive map theory. In Bloom, P.,
24
Peterson, M., Nadel, L., and Garrett, M., editors, Language and Space, pages 277–316. MIT Press.
Paradis, C. (2001). Adjectives and boundedness. Cognitive Linguistics, 12:47–65.
Pesetsky, D. (1987). Wh-in-situ: movement and unselective binding. In Reuland, E. and ter Meulen, A., editors, The
representation of (in)definiteness, pages 98–129. MIT Press.
Rett, J. (2011). Exclamatives, degrees and speech acts. Linguistics and Philosophy, 34:411–442.
Rotstein, C. and Winter, Y. (2004). Total adjectives vs. partial adjectives: Scale structure and higher-order modifiers.
Natural Language Semantics, 12:259–288.
Rullmann, H. (1995). Maximality in the semantics of wh-constructions. Thesis, University of Massachusetts, Amherst.
Schwarzschild, R. (2008). The semantics of the comparative and other degree constructions. Language and Linguistic
Compass, 2(2):308–331.
Seuren, P. (1973). The comparative. In Keifer, F. and Ruwet, N., editors, Generative Grammar in Europe. Reidel.
Seuren, P. (1984). The comparative revisited. Journal of Semantics, 3:109–141.
von Stechow, A. (1984). Comparing semantic theories of comparison. Journal of Semantics, 3:1–77.
Vendler, Z. (1957). Verbs and times. The Philosophical Review, 66:143–160.
Williams, E. (1977). Discourse and logical form. Linguistic inquiry, 8:101–139.
Yoon, Y. (1996). Total and partial predicates and the weak and strong interpretations. Natural Language Semantics,
4:217–236.
Zwarts, F. (1995). Nonveridical contexts. Linguistic Analysis, 25:286–312.
Zwarts, J. (1997). Vectors as relative positions: a compositional semantics of modified PPs. Journal of Semantics,
14:57–86.
25

Download Report

Space, Time, and Antonymy

Paperzz.com

Your Paperzz