
- •Contents
- •Figures
- •Tables
- •Contributors
- •Preface
- •Acknowledgements
- •1 Overview
- •1.1 Introduction
- •1.2 The roots of English
- •1.3 Early history: immigration and invasion
- •1.4 Later history: internal migration, emigration, immigration again
- •1.5 The form of historical evidence
- •1.6 The surviving historical texts
- •1.7 Indirect evidence
- •1.8 Why does language change?
- •1.9 Recent and current change
- •2 Phonology and morphology
- •2.1 History, change and variation
- •2.2 The extent of change: ‘vertical’ and ‘horizontal’ history
- •2.3 Tale’s end: a sketch of ModE phonology and morphology
- •2.3.1 Principles
- •2.3.2 ModE vowel inventories
- •2.3.3 ModE consonant inventories
- •2.3.4 Stress
- •2.3.5 Modern English morphology
- •2.4 Old English
- •2.4.1 Time, space and texts
- •2.4.2 The Old English vowels
- •2.4.3 The Old English consonants
- •2.4.4 Stress
- •2.4.5 Old English morphology
- •2.4.5.1 The noun phrase: noun, pronoun and adjective
- •2.4.5.2 The verb
- •2.4.6 Postlude as prelude
- •2.5 The ‘OE/ME transition’ to c.1150
- •2.5.1 The Great Hiatus
- •2.5.2 Phonology: major early changes
- •2.5.2.1 Early quantity adjustments
- •2.5.2.2 The old diphthongs, low vowels and /y( )/
- •2.5.2.3 The new ME diphthongs
- •2.5.2.4 Weak vowel mergers
- •2.5.2.5 The fricative voice contrast
- •2.6.1 The problem of ME spelling
- •2.6.2 Phonology
- •2.6.2.2 ‘Dropping aitches’ and postvocalic /x/
- •2.6.2.4 Stress
- •2.6.3 ME morphology
- •2.6.3.2 The morphology/phonology interaction
- •2.6.3.3 The noun phrase: gender, case and number
- •2.6.3.4 The personal pronoun
- •2.6.3.5 Verb morphology: introduction
- •2.6.3.6 The verb: tense marking
- •2.6.3.7 The verb: person and number
- •2.6.3.8 The verb ‘to be’
- •2.7.1 Introduction
- •2.7.2 Phonology: the Great Vowel Shift
- •2.7.4 English vowel phonology, c.1550–1800
- •2.7.5 English consonant phonology, c.1550–1800
- •2.7.5.1 Loss of postvocalic /r/
- •2.7.5.2 Palatals and palatalisation
- •2.7.5.3 The story of /x/
- •2.7.6 Stress
- •2.7.7 English morphology, c.1550–1800
- •2.7.7.1 Nouns and adjectives
- •2.7.7.2 The personal pronouns
- •2.7.7.3 Pruning luxuriance: ‘anomalous verbs’
- •2.8.1 Preliminary note
- •2.8.2 Progress, regress, stasis and undecidability
- •2.8.2.1 The evolution of Lengthening I
- •2.8.2.2 Lengthening II
- •3 Syntax
- •3.1 Introduction
- •3.2 Internal syntax of the noun phrase
- •3.2.1 The head of the noun phrase
- •3.2.2 Determiners
- •3.3 The verbal group
- •3.3.1 Tense
- •3.3.2 Aspect
- •3.3.3 Mood
- •3.3.4 The story of the modals
- •3.3.5 Voice
- •3.3.6 Rise of do
- •3.3.7 Internal structure of the Aux phrase
- •3.4 Clausal constituents
- •3.4.1 Subjects
- •3.4.2 Objects
- •3.4.3 Impersonal constructions
- •3.4.4 Passive
- •3.4.5 Subordinate clauses
- •3.5 Word order
- •3.5.1 Introduction
- •3.5.2 Developments in the order of subject and verb
- •3.5.3 Developments in the order of object and verb
- •3.5.5 Developments in the position of particles and adverbs
- •3.5.6 Consequences
- •4 Vocabulary
- •4.1 Introduction
- •4.1.1 The function of lexemes
- •4.1.3 Lexical change
- •4.1.4 Lexical structures
- •4.1.5 Principles of word formation
- •4.1.6 Change of meaning
- •4.2 Old English
- •4.2.1 Introduction
- •4.2.4 Word formation
- •4.2.4.1 Noun compounds
- •4.2.4.2 Compound adjectives
- •4.2.4.3 Compound verbs
- •4.2.4.7 Zero derivation
- •4.2.4.8 Nominal derivatives
- •4.2.4.9 Adjectival derivatives
- •4.2.4.10 Verbal derivation
- •4.2.4.11 Adverbs
- •4.2.4.12 The typological status of Old English word formation
- •4.3 Middle English
- •4.3.1 Introduction
- •4.3.2 Borrowing
- •4.3.2.1 Scandinavian
- •4.3.2.2 French
- •4.3.2.3 Latin
- •4.3.3 Word formation
- •4.3.3.1 Compounding
- •4.3.3.4 Zero derivation
- •4.4 Early Modern English
- •4.4.1 Introduction
- •4.4.2 Borrowing
- •4.4.2.1 Latin
- •4.4.2.2 French
- •4.4.2.3 Greek
- •4.4.2.4 Italian
- •4.4.2.5 Spanish
- •4.4.2.6 Other languages
- •4.4.3 Word formation
- •4.4.3.1 Compounding
- •4.5 Modern English
- •4.5.1 Introduction
- •4.5.2 Borrowing
- •4.5.3 Word formation
- •4.6 Conclusion
- •5 Standardisation
- •5.1 Introduction
- •5.2 The rise and development of standard English
- •5.2.1 Selection
- •5.2.2 Acceptance
- •5.2.3 Diffusion
- •5.2.5 Elaboration of function
- •5.2.7 Prescription
- •5.2.8 Conclusion
- •5.3 A general and focussed language?
- •5.3.1 Introduction
- •5.3.2 Spelling
- •5.3.3 Grammar
- •5.3.4 Vocabulary
- •5.3.5 Registers
- •Electric phenomena of Tourmaline
- •5.3.6 Pronunciation
- •5.3.7 Conclusion
- •6 Names
- •6.1 Theoretical preliminaries
- •6.1.1 The status of proper names
- •6.1.2 Namables
- •6.1.3 Properhood and tropes
- •6.2 English onomastics
- •6.2.1 The discipline of English onomastics
- •6.2.2 Source materials for English onomastics
- •6.3 Personal names
- •6.3.1 Preliminaries
- •6.3.2 The earliest English personal names
- •6.3.3 The impact of the Norman Conquest
- •6.3.4 New names of the Renaissance and Reformation
- •6.3.5 The modern period
- •6.3.6 The most recent trends
- •6.3.7 Modern English-language personal names
- •6.4 Surnames
- •6.4.1 The origin of surnames
- •6.4.2 Some problems with surname interpretation
- •6.4.3 Types of surname
- •6.4.4 The linguistic structure of surnames
- •6.4.5 Other languages of English surnames
- •6.4.6 Surnaming since about 1500
- •6.5 Place-names
- •6.5.1 Preliminaries
- •6.5.2 The ethnic and linguistic context of English names
- •6.5.3 The explanation of place-names
- •6.5.4 English-language place-names
- •6.5.5 Place-names and urban history
- •6.5.6 Place-names in languages arriving after English
- •6.6 Conclusion
- •Appendix: abbreviations of English county-names
- •7 English in Britain
- •7.1 Introduction
- •7.2 Old English
- •7.3 Middle English
- •7.4 A Scottish interlude
- •7.5 Early Modern English
- •7.6 Modern English
- •7.7 Other dialects
- •8 English in North America
- •8.1.1 Explorers and settlers meet Native Americans
- •8.1.2 Maintenance and change
- •8.1.3 Waves of immigrant colonists
- •8.1.4 Character of colonial English
- •8.1.5 Regional origins of colonial English
- •8.1.6 Tracing linguistic features to Britain
- •8.2.2 Prescriptivism
- •8.2.3 Lexical borrowings
- •8.3.1 Syntactic patterns in American English and British English
- •8.3.2 Regional patterns in American English
- •8.3.3 Dictionary of American Regional English (DARE)
- •8.3.4 Atlas of North American English (ANAE)
- •8.3.5 Social dialects
- •8.3.5.1 Socioeconomic status
- •8.3.6 Ethnic dialects
- •8.3.6.1 African American English (AAE)
- •8.3.6.2 Latino English
- •8.3.7 English in Canada
- •8.3.8 Social meaning and attitudes
- •8.3.10 The future of North American dialects
- •Appendix: abbreviations of US state-names
- •9 English worldwide
- •9.1 Introduction
- •9.2 The recency of world English
- •9.3 The reasons for the emergence of world English
- •9.3.1 Politics
- •9.3.2 Economics
- •9.3.3 The press
- •9.3.4 Advertising
- •9.3.5 Broadcasting
- •9.3.6 Motion pictures
- •9.3.7 Popular music
- •9.3.8 International travel and safety
- •9.3.9 Education
- •9.3.10 Communications
- •9.4 The future of English as a world language
- •9.5 An English family of languages?
- •Further reading
- •1 Overview
- •2 Phonology and morphology
- •3 Syntax
- •4 Vocabulary
- •5 Standardisation
- •6 Names
- •7 English in Britain
- •8 English in North America
- •9 English worldwide
- •References
- •Index

4 Vocabulary
Dieter Kastovsky
4.1Introduction
4.1.1 |
The function of lexemes |
|
The ‘normal’ native speaker of a language will probably regard sounds and words as the most basic building blocks of a language, because they are the elements which can be perceived most easily without specific training. We put sounds together to form words, and we put words together to form larger structures, i.e. sentences realised as utterances (at least that is what we think we are doing). Therefore, speakers tend to react most readily to variation and change of pronunciation and vocabulary. Words, or rather lexical items or lexemes (= dictionary entries), can be regarded as intermediate elements between the level of sounds and the level of syntactic structure. But there are some additional reasons having to do with the existence of certain types of lexemes and their function why lexemes have this bridge function between phonology, morphology, syntax and the lexicon (see Section 4.1.5. below).
Lexemes are the means by which we make direct reference to extralinguistic reality, converting our basic perception of the world around us into language. Their basic function thus is to serve as labels for segments of extralinguistic reality which a speech community finds nameworthy, so that it can talk about it in a simple and direct way. And it is only when one has such ‘nameworthy’ entities that one can talk about their relationship (more precisely the relationship between their referents) in terms of more complex grammatical structures such as sentences (utterances): first one has to identify something to talk about before one can talk about it, i.e. one has to know ‘who is doing what to whom’. This is why lexemes are one of the most central categories in language: it is their existence that links the perception of more or less discrete extralinguistic phenomena (like bushes or trees, apes or monkeys, types of movement such as walk, fly, swim, manners of producing sounds such as talk, shout, whisper, bark, meow, chirp, etc.) to our perception of the world around us. On the other hand, it is often the case that the very existence of a lexeme creates a demarcation in an otherwise continuous extralinguistic referent. Thus in terms of its physical nature, the colour spectrum is continuous; linguistically it is divided into segments, whose number varies
199

200 D I E T E R K A S T O V S K Y
from language to language. Similarly, there is no visible dividing line between chin and cheek, but the existence of the lexemes chin and cheek forces us to make this distinction, which does not necessarily exist in all languages. In English we always make a distinction between hand and arm, foot and leg, but in Polish this
distinction is optional, and normally reka is used for both arm and hand, and
noga for both foot and leg. Lexemes thus not only act as labels for referents that have an autonomous extralinguistic existence, they also create such referents by introducing distinctions that are not given a priori in extralinguistic reality. Even closely related languages, such as English and German, or Italian, French and Spanish, will differ considerably as to the overall structure of their vocabulary, because of the different communicative needs of the speech communities involved and the vagaries of historical development. Thus why would German want to make a distinction between the ingestion of food or drink by human beings and animals in the form of essen (human being) : fressen (animal, but extended to human beings if they behave like an animal), trinken (human being) : saufen (animals, but also human beings who overdo it with a specific type of beverage, i.e. alcohol), when English simply makes do with eat : drink; and why, at least until recently, did British English differentiate between coach (long-distance) and bus (local), when German just has (Auto)-Bus?
The same is true of different historical stages of one and the same language, and for English this is perhaps even more striking than for any other European language. Its vocabulary structure was transformed profoundly between the first records of English and today, so that what we find in the eighth century at the level of vocabulary is basically unrecognisable to us, and today’s English would also be equally alien to someone from the eighth century, even disregarding phonological and morphological changes.
A simple example, viz. an OE Bible passage and its early Modern English and Modern English equivalents, might serve to illustrate this (the lexical differences between OE and the other periods are indicated by italics):
So¯þl¯ıce˙ on þam¯ dagum wæs geworden˙ gebod˙ fram þam¯ casere¯ |
Augusto, þæt |
|||||
eall ymbehwyrft wære¯ tomearcod¯ |
. þeos¯ tomearcodnes¯ |
wæs ærest¯ geworden |
||||
fram þam¯ deman¯ |
Syrige˙ Cir¯ıno. And ealle hig˙ eodon¯ |
and syndrige˙ ferdon¯ |
on |
|||
hyra ceastre˙. |
(West Saxon translation, early eleventh century) |
|
And it chaunced in thoose dayes that ther went oute a commaundment from Auguste the Emperour, that all the woorlde shuld be taxed. And this taxynge was the fyrst and executed when Syrenius was leftenaunt in Syria. And every man went vnto his awne citie to be taxed. (Early Modern English, Tyndale 1534)
Truely in these days it happened that an order was issued by the Emperor
Augustus that the whole world should be assessed for taxes. This taxing was the first and was implemented when Syrenus was judge in Syria. And everybody went to their place of birth to be taxed. (Modern English, translation DK)

Vocabulary 201
4.1.2 |
The stratification of the vocabulary |
|
An important aspect here is the fact that the vocabulary used by a speech community exhibits a complex, multidimensional stratification. For Modern English, the following dimensions of linguistic variation have become established (cf., e.g., Quirk et al., 1985:15ff.): (a) region, (b) social group, (c) field of discourse, (d) medium, (e) attitude.
There is no reason to assume that the situation was radically different in earlier periods. We know that there had always been dialectal differences, and these were fairly great. Thus the author of the Cursor Mundi, a northern poem from about 1300, remarks that he had to translate a text written in Southern English into Northern English for the benefit of the northerners who were not familiar with other forms of English:
In sotherin englis was it draun, |
‘In Southern English was it written, |
And turn it haue I till our aun |
and I have turned it into our own |
Langage o northrin lede |
language of the Northern people, |
þat can nan oiþer englis rede |
who can’t read any other English.’ |
There must certainly also always have been differences according to the field of discourse, insofar as poetic diction clearly differed from prose diction, and the same is true of technical texts such as medical, legal, religious ones, etc. But our picture of this variation is rather patchy for the earlier periods due to the restricted documentation that we have. This is true, in particular, as regards the difference between spoken and written language. Thus until the late Middle Ages we only have records of written language and the registers that go with it. This reflects a very limited degree of variation, since only a fraction of the population was literate. So we do not really know anything about the lexical variation of spoken English. It is only when private letters, court proceedings and plays begin to appear that we get a glimpse of the spoken language, too, which is at the end of the ME period, but above all from the early Modern English period onwards.
4.1.3 |
Lexical change |
|
If the vocabulary of a language reflects the perception of the world by a speech community, it will have to be constantly adapted to its changing needs. Therefore, the vocabulary is as much a reflection of deep-seated cultural, intellectual and emotional interests, perhaps even of the whole Weltbild of a speech community, as are the texts that have been produced by its members. The systematic study of the vocabulary of a language thus is an important contribution to the understanding of the culture and civilisation of a speech community over and above the analysis of the texts in which this vocabulary is put to communicative use. And the history of the vocabulary of a speech community is a reflection of its general history, since both innovations and losses document changes in the social needs of this community arising from the pressure to adapt to changing external

202 D I E T E R K A S T O V S K Y
circumstances. The vocabulary of a language thus is also a link to the material and spiritual culture of its speakers; conversely, without knowing this material and spiritual culture one might often not know what someone is talking about. Let me illustrate this with two examples I came across during a recent stay in the US.
In a bar in New York I saw the following ad: When was the last time you were Jagermeistered?¨ In order to understand this, it is not enough to know that English has an expression to wine and dine a person (with the passive to be wined and dined), which in turn is based (at least for wine) on the pattern to water flowers, salt soup, butter the bread ‘provide something with x’, formally parallel to the verbs bomb, knife, stone, guillotine ‘attack, kill someone by x’. One also has to know that Jagermeister¨ fits into the wining/dining scenario and not the attack scenario, i.e. that this is a German liqueur and not the name of a person or instrument. Thus, on the basis of all this (extra)linguistic knowledge, one can eventually arrive at the interpretation ‘When were you last treated to a J¨agermeister?’ During the same trip I also learned that the doggy-bags, which have been an important part of American culture for many years, are now replaced by go-boxes, at least in some areas. Without the restaurant context and the familiar look of the bag (go-boxes still come in brown bags) I would probably not have been able to figure out what a go-box is.
Many changes in the English vocabulary are due to massive borrowing from the languages with which English came into contact in the course of its history, and this has also had far-reaching repercussions for morphology and phonology. If we take a bird’s eye view of the vocabulary of Modern English, we cannot but be amazed by its overall size: estimates range between 700,000 and more than a million documented lexical items; the second edition of the OED, our main source for the history of the English vocabulary, contains about 616,500 lexical items according to its own estimate (OED 1:xxiii), but it certainly makes reference to many more via suband run-on entries, although, admittedly, this source also contains much that is no longer used. But it is not just the sheer size of the vocabulary, it is also its heterogeneity which is remarkable. Almost unlimited borrowing since the tenth and eleventh centuries – first from Latin, then from Scandinavian, then from French, then again from Latin and Greek, and finally from almost any language English came into contact with – is responsible for this situation. As a result of its history, English is far less resistant than any other European language to borrowing, and since the battle against the ‘hard words’ in the seventeenth century there has never been any serious campaign against foreign elements comparable to the government-decreed measures in France, Slovakia, or Russia, or the Germanisation movement in Germany in the first half of the twentieth century. About 70 per cent of present-day English vocabulary consists of loans, with loans from French and/or Latin (including Greek and Neo-Latin) taking up the lion’s share. Just over 350 languages have contributed to this wealth (see Section 9.4). When, in contradistinction, we look at recorded OE lexemes, only about 3 per cent are loans, basically from Latin (and often not recognisable as

Vocabulary 203
such); the proportion of loans in the early Modern English neologism vocabulary is estimated at 40 to 50 per cent (see Scheler, 1977: 74), i.e. still much lower than today.
The following is only a small selection illustrating the range of languages that have contributed to English vocabulary (Latin, Greek and French will be left out here, because their influence will be discussed separately below): American Indian (caucus, moose, racoon), Arabic (alcohol, assassin, zero), Chinese (ketchup, tea, wok), Czech (gherkin, robot, vampire), Dutch (brandy, cookie, landscape), Finnish (mink, sauna), German (kindergarten, sauerkraut, snorkel), Hebrew (cherub, jubilee), Hindi (bungalow, dinghy, shampoo), Hungarian (goulash, paprika), Italian (aria, balcony, lava, mafia, opera, piano, spaghetti), Japanese (futon, soy, sushi), Mexican Indian (avocado, chocolate, tomato), Persian (arsenic, lilac, pyjamas), Portuguese (buffalo, marmalade, port), Russian (bistro, mammoth, sputnik, vodka), Sanskrit (candy, indigo, jungle), Spanish (cafeteria, cash, cockroach, sherry, siesta), Tahitian (tattoo), Tamil (catamaran, cheroot, mango), Tongan (taboo), Turkish (caftan, coffee, scarlet, yoghurt), Yiddish (bagel, glitzy, kosher, kugel, schmaltz(y), schlep, schmooze, yenta); see Hughes (2000: 365ff.).
As a consequence especially of the borrowings from French, Latin and Greek, the morphophonemic and word-formation systems of Modern English are also heterogeneous, because they involve different strata, i.e. a native and a nonnative one. This is most conspicuous if we look at lexical families which are based on a common meaning. In languages such as German, and for that matter OE, there are usually etymological, formal-morphological ties between the members of such families, i.e. the members are derived by word-formation processes from a common basis – the vocabulary is ‘associated’. In Modern English, on the other hand, the vocabulary is often ‘dissociated’, because semantically related words are unrelated etymologically; cf. the following English and German examples:
(1) |
gall |
: |
bilious |
Galle |
: |
gallig |
|
mouth |
: |
oral |
Mund |
: |
mundlich¨ |
|
eye |
: |
oculist, ophthalmologist |
Auge |
: |
Augenarzt |
|
father |
: |
paternal/fatherly |
Vater |
: |
vaterlich¨ |
|
moon |
: |
lunar vehicle |
Mond |
: |
Mondfahrzeug |
Thus semantic relationships within the vocabulary are much more transparent in German than they are in English. But this has not always been the case. In OE the situation was much like that in Modern High German. The vocabulary was associative, i.e. based on a systematic exploitation of native word-formation patterns, which produced extensive, transparent word families. Cf. the following selected list of compounds and derivatives related to the verbs gan/gangan ‘to go’, which is fairly typical of the overall situation in OE (notice the many Romance equivalents in the glosses, replacing the original Germanic lexemes):