|
The term umlaut is used for two closely related notions: a special kind of vowel modification and a particular diacritic mark.
Vowel modification
In linguistics, the process of umlaut (from German
um- "around", "transformation" + laut "sound") is a modification of a vowel which causes it to be pronounced more to the front of the mouth to accommodate a vowel in the following
syllable, especially when that syllable is an inflectional
suffix. This process is found in many — especially Germanic — languages.
For example, the German noun Mann (man) with the a pronounced as in English "father", becomes Männer [m'En@r, m'En6] in the plural, with the ä pronounced
like the ai in "hair", a front vowel sound that is assimilated to the vowel in the -er suffix. The original
conditioning environment in German was an i or j in the following syllable (the plural suffix originally was
-ir). Later, umlaut acquired a grammatical function and was extended by analogy, for example to form plurals like
Ofen ['o:f@n] / Öfen ['2:f@n] ("oven"/"ovens"). Note that English, being a Germanic language, has preserved
some of these changes in irregular inflected forms such as man/men, tooth/teeth, long (adj)/length
(n), ox/oxen, etc., even though it has lost the suffixes that originally caused them, and has changed their
spelling. In English, the process was called i-mutation.
An umlaut should be distinguished from a change in vowel indicating a difference in grammatical function, called an ablaut, as in sing/sang/sung. Ablaut originated in the Proto-Indo-European language, whereas umlaut originated later, in
Proto-Germanic. These terms may also be used for similar changes in
other language families.
Diacritical mark
The word is also used to refer to the diacritical mark composed of two small
dots placed over a vowel (¨) to indicate this change in German (the same mark is used to indicate diaeresis in
other languages). The origin of the graphical symbol lies in the following e, which was originally written above in tiny
form. In handwritings of the Middle Ages until (in Germany) 1941, the e looked like two tiny strokes
(compare the Sütterlin minuscule "e"). In script form this simplified to two
bars above the letter. These bars became confused with the diaeresis, and are now normally written as two small dots above. The
umlauts are ä, ö, and ü.
In Finnish, Hungarian and North Germanic
languages (i.e., Danish, Icelandic, Norwegian and
Swedish) characters (ü, ä, ö,
and å) looking similar to German umlauts are in fact considered letters in their own right, despite
their representing sounds similar to the corresponding sounds in German. As it is not a case of marking grammatical variation,
i.e., of tempus or modus, nor of syllable
modification, it is in fact neither a case of umlaut nor of diacritical marking. Hence it ought to be improper to call these
characters umlauts; however, there is no more precise descriptor in English.
When typing in German, if umlaut letters are not available, they
are usually replaced by the underlying vowel and a following e. In Switzerland, capital umlauts are sometimes printed as digraphs, i.e., Ae, Oe, Ue, instead of Ä, Ö,
Ü.
Entering umlauts in HTML
In HTML umlauts can be entered with an &?uml; entity reference. All
umlauts are part of all latin versions of the ISO 8859 character sets and thus have
the same codepoints in ISO-8859-1 (-2, -3, -4, -9, -10, -13, -14, -15, -16) and Unicode. See the following
table:
Umlauts in HTML
| Character |
Replacement |
HTML entity |
Codepoint |
| ä |
ae |
ä |
0x00E4 |
| ö |
oe |
ö |
0x00F6 |
| ü |
ue |
ü |
0x00FC |
| Ä |
Ae |
Ä |
0x00C4 |
| Ö |
Oe |
Ö |
0x00D6 |
| Ü |
Ue |
Ü |
0x00DC |
In addition to the umlauts given above, the dotted vowels below may, or may not, be valid
in different alphabets:
dotted vowels (no umlauts)
| Character |
Replacement |
HTML entity |
Codepoint |
| ë |
(none) |
ë |
0x00EB |
| ï |
(none) |
ï |
0x00EF |
| ÿ |
(none) |
ÿ |
0x00FF |
| Ë |
(none) |
Ë |
0x00CB |
| Ï |
(none) |
Ï |
0x00CF |
| Ÿ |
(none) |
Ÿ |
0x0178 |
See also
|