Hapax legomenon
A hapax legomenon (pronounced /ˈhæpɨks lɨˈɡɒmənɒn/ or [ˈheɪpæks]) (pl. hapax legomena, sometimes abbreviated to hapaxes) is a word which occurs only once in either the written record of a language, the works of an author, or in a single text. While technically incorrect, the term is also sometimes used of a word that occurs in only one of an author's works, even though it occurs more than once in that work. Hapax legomenon is a direct transliteration from the Greek form ἅπαξ λεγόμενον, meaning "[something] said [only] once".
The related terms dis legomenon, tris legomenon, and tetrakis legomenon refer respectively to double, triple, or quadruple occurrences, but are far less commonly used.
Hapax legomena are quite common, as a consequence of Zipf's Law, which states that the frequency of any word in a work or corpus is inversely proportional to its rank in the frequency table. For large corpora, about 40% to 60% of the words are hapax legomena, and another 10% to 15% are dis legomena.
Note that the term hapax legomenon refers to a word's appearance in a body of text, not to its origins, nor to its prevalence in speech. It thus differs from a nonce word, which may never be recorded, or may find currency and be recorded widely, or may appear several times in the work which coins it, and so on.
Significance
Hapax legomena in texts pose difficulties in translation and decipherment, particularly when the words in question are used only once in the entire record of an ancient language. Inferring meaning from context is easier and more certain when there are multiple contexts to compare. For example, many of the remaining undeciphered Mayan glyphs are hapax legomena, and Biblical hapax legomena (particularly in Hebrew) pose sometimes difficult issues in translation. Hapax legomena also pose challenges in natural language processing.
Some scholars have considered hapax legomena important for attributing works to particular authors. For example, each of Shakespeare's plays contains a similar percentage of hapax legomena not found elsewhere in his work.
P.N. Harrison, in The Problem of the Pastoral Epistles (1921) made hapax legomena popular among Bible scholars, when he argued that there are considerably more of them in the three Pastoral Epistles than in other Pauline Epistles. He argued that the number of hapax legomena in a putative author's corpus indicates his or her vocabulary and is characteristic of the author as an individual.
This theory has faded in significance due to a number of problems raised by others. For example, in 1896, W.P. Workman found the following numbers of hapax legomena in each Pauline Epistle: Rom. 113, I Cor. 110, II Cor. 99, Gal. 34, Eph. 43 Phil. 41, Col. 38, I Thess. 23, II Thess. 11, Philem. 5, I Tim. 82, II Tim. 53, Titus 33. Obviously, the last three totals (for the Pastoral Epistles) are not out of line with the others.
Apart from author identity, there are several other factors that affect the number of hapax legomena, such as:
text length: this directly affects the expected number and percentage of hapax legomena; the brevity of the Pastoral Epistles also makes any statistical analysis problematic. text topic: if the author writes on different subjects, of course many subject-specific words will occur only in limited contexts. text audience: if the author is writing to a peer rather than a student, or their spouse rather than their employer, again quite different vocabulary will appear. time: over the course of years, both the language itself and a given author's knowledge and use of language, will change.
In the particular case of the Pastoral Epistles, all of these variables are quite different than in the rest of the Pauline Corpus, and hapax legomena are no longer widely accepted as a strong indicator of authorship (although the authorship of the Pastorals is subject to debate on other grounds).
There are also subjective questions over whether two forms amount to "the same word": dog vs dogs, clue vs clueless, sign vs signature, and many other grey cases arise. The Jewish Encyclopedia points out that although there are 1500 hapaxes in the Old Testament, only about 400 of those are not obviously related to other attested word forms.
It would not be especially difficult for a forger to construct a work with any percentage of hapax legomena desired. However, it seems unlikely that forgers much before the 20th century would have thought of such a ploy, much less thought it worth the effort.
A final difficulty with the use of hapax legomena for authorship determination is that there is considerable variation among works known to be by a single author, and sometimes disparate authors show very similar values. In other words, it is not a reliable indicator. Authorship studies now usually use a wide range of measures, and look for a pattern across them, rather than relying on a single measurement.
Examples
Some examples of hapax legomena in a given language or body of work are:
Hebrew examples
Gvina (גבינה - cheese) is a hapax legomenon of Biblical Hebrew, found only in Job 10:10. The word has become extremely common in modern Hebrew. Similarly, the word Ḥashmal (חשמל) appears only in Ezekiel 1 in the Bible. Today it is used to refer to electricity. Akut (אקוט - fought), only has one mention in the Hebrew Bible. It is only mentioned in Psalms 95:10. Isaiah 34:14, describing the desolation of Edom, has the only occurrence of Lilith in the Hebrew Bible. Gopher wood is mentioned once in the Bible, at Genesis 6:14, in the instruction to make Noah's ark "of gopher wood". Because of the single appearance, the literal meaning is lost. Gopher is simply a transliteration, although scholars today tentatively suggest that the wood intended is cypress.
Greek examples
Autoguos (αυτογυος), an ancient Greek word for a sort of plough, is found once (and exclusively) in Hesiod, the precise meaning remaining obscure. Panaorios (παναωριος), ancient Greek for "very untimely", is one of many hapax legomena of the Iliad. The Greek New Testament contains 686 hapax legomena, of which 62 occur in 1 Peter, and 54 occur in 2 Peter.
Italian examples
Ramogna is mentioned only once in Italian literature, precisely in Dante's Divina Commedia (Purgatory XI, 25). Trasumanar is another hapax legomenon mentioned in Dante's Divina Commedia (Paradise I, 70).
English examples
Honorificabilitudinitatibus is a hapax legomenon of Shakespeare's works. Nortelrye, a word for "education", occurs exactly once in Chaucer. Slæpwerigne occurs exactly once in the Old English corpus, in the Exeter Book. There is debate over whether it means "weary with sleep" or "weary for sleep". Flother, a synonym for snowflake, is a hapax legomenon of written English pre-1900, found in a manuscript from around 1275. In the Brown Corpus of American English, about half of the 50,000 words are hapax legomena.
Translation
The phrase "Hapax legomenon" occurs as such in the following languages: English, Czech, German, Esperanto, Croatian, Norwegian (Nynorsk), Low Saxon, Simple English, Finnish, Turkish.
Translation(s) in other languages: Arabic: كلمة فريدة, Asturian: Hapas, Spanish: Hápax, Basque: Hapax, French: Hapax, Italian: Hapax, Hebrew: מילה יחידאית, Dutch: Hapax, Russian: Гапакс.
|