Orthography: Vowels, Glides and Diphthongs
Posted: Thu Nov 15, 2018 11:36 am
One thing I always struggle with is the choice about how to analyse and write vowels, glides and diphthongs, and what the best choice is. In natural languages, vowels and their associated semi-vowels often seem to be in almost but not quite complementary distribution, so that in most but not all cases you could predict the syllabification and whether the (semi-)vowel was the nucleus or not. If the choice is made to write vowels and semi-vowels differently, there's then another choice of whether any diphthongs exist (written as sequences of orthographic vowels) or if all potential complex nuclei are a combination of a vowel and a semi-vowel.
I think there are three basic common orthographic combinations, plus some hybrid patterns and non-phonemic orthographies. Spanish, for example, show both patterns sometimes, with <y> and <i> representing basically the same sound in different contexts, and English uses a famously complex system for orthographically representing vowel qualities.
Romanian: Vowel (i) = Semivowel (i) = Diphthong (Vi)
Quechua: Vowel (i) != (Semivowel (j), Combination (Vj))
Tariana: (Vowel (i), Diphthong (Vi)) != Semivowel (j)
Decisions taken in natural languages seem to be a mix of arbitrary choice and for analytical convenience. Possible criteria:
Vowel vs Semivowel
1. If i/j, u/w ... are in complementary distribution, then treat as a single phoneme. Normally written using the vowel letters, e.g. <i>, <u>, but in some languages with wide-spread vowel epenthesis or syllabic consonants it seems to go the other way. PIE often seems to get described in this way, with the high vowels being not quite fully paid up vowels because they show the same distribution as the syllabic consonants.
2. If syllabification is not predictable in some circumstances, then treat as two phonemes. E.g. if both ju and iw are possible then a sequence <iu> might be ambiguous in some contexts. May be ignored if the unpredictable contexts are very limited, which I get the impression is quite common in the wild.
Diphthongs
3. If there are strong collocational restrictions (e.g. ai but no ei, oi, ..., or au but no eu) then treat as diphthongs not as V + semivowel.
4. If part seems to occupy an onset or coda "slot" to the exclusion of other consonants, then treat as V + semivowel. For example, in a language with no onset clusters, if ia / ja cannot take an (additional) onset then analyse as ja.
5. Conversely, if part of a possible diphthong can satisfy a phonological or morphological requirement for an onset, then treat as V + semivowel. E.g. if an epenthetic consonant is inserted between adjacent vowels created by affixation, but no epenthetic consonant is inserted in the context of a possible diphthong.
I've been struggling with this a bit for Sint. On the one hand, there are restrictions on which combinations occur. There are four vowels, but only 3 "diphthong" combinations not the 4x3=12 you might expect. Furthermore, the "diphthongs" take exactly the same possible onsets and codas as any other syllable nucleus. However, the final element of the diphthong can become an onset in certain morphological contexts, e.g.:
tsajanaa
tsai-a-naa
BACK-P-go
"she came back"
A description of the morphology and spelling of morphologically complex forms would be simplified by a V + semivowel analysis, but this would slightly complicate the description of possible syllable structure(s).
How have others resolved this problem? Has anyone else found the intermediate / ambiguous status of semivowels an orthographic issue?
I think there are three basic common orthographic combinations, plus some hybrid patterns and non-phonemic orthographies. Spanish, for example, show both patterns sometimes, with <y> and <i> representing basically the same sound in different contexts, and English uses a famously complex system for orthographically representing vowel qualities.
Romanian: Vowel (i) = Semivowel (i) = Diphthong (Vi)
Quechua: Vowel (i) != (Semivowel (j), Combination (Vj))
Tariana: (Vowel (i), Diphthong (Vi)) != Semivowel (j)
Decisions taken in natural languages seem to be a mix of arbitrary choice and for analytical convenience. Possible criteria:
Vowel vs Semivowel
1. If i/j, u/w ... are in complementary distribution, then treat as a single phoneme. Normally written using the vowel letters, e.g. <i>, <u>, but in some languages with wide-spread vowel epenthesis or syllabic consonants it seems to go the other way. PIE often seems to get described in this way, with the high vowels being not quite fully paid up vowels because they show the same distribution as the syllabic consonants.
2. If syllabification is not predictable in some circumstances, then treat as two phonemes. E.g. if both ju and iw are possible then a sequence <iu> might be ambiguous in some contexts. May be ignored if the unpredictable contexts are very limited, which I get the impression is quite common in the wild.
Diphthongs
3. If there are strong collocational restrictions (e.g. ai but no ei, oi, ..., or au but no eu) then treat as diphthongs not as V + semivowel.
4. If part seems to occupy an onset or coda "slot" to the exclusion of other consonants, then treat as V + semivowel. For example, in a language with no onset clusters, if ia / ja cannot take an (additional) onset then analyse as ja.
5. Conversely, if part of a possible diphthong can satisfy a phonological or morphological requirement for an onset, then treat as V + semivowel. E.g. if an epenthetic consonant is inserted between adjacent vowels created by affixation, but no epenthetic consonant is inserted in the context of a possible diphthong.
I've been struggling with this a bit for Sint. On the one hand, there are restrictions on which combinations occur. There are four vowels, but only 3 "diphthong" combinations not the 4x3=12 you might expect. Furthermore, the "diphthongs" take exactly the same possible onsets and codas as any other syllable nucleus. However, the final element of the diphthong can become an onset in certain morphological contexts, e.g.:
tsajanaa
tsai-a-naa
BACK-P-go
"she came back"
A description of the morphology and spelling of morphologically complex forms would be simplified by a V + semivowel analysis, but this would slightly complicate the description of possible syllable structure(s).
How have others resolved this problem? Has anyone else found the intermediate / ambiguous status of semivowels an orthographic issue?