Re: Morphological complexity
Posted: Fri Jun 05, 2020 6:07 am
That’s pretty much what my definition of ‘complexity’ was trying to capture. The ‘lowest calculated quantum number’ in the definition would be the bare minimal number of quanta the sentence could be construed to have; similarly for the ‘highest calculated quantum number’. The difference between them should count the number of things which have mandatory marking. On the other hand, this definition isn’t too good at accounting for stuff like irregularity and slang, which is why I mentioned that it could be better.Moose-tache wrote: ↑Fri Jun 05, 2020 4:33 amI think this is reasonable, but we can use cross-linguistic analysis to help us define our "minimal" number of quanta, as in my examples below. This whole discussion started as a question of whether it's actually more complex to learn Cherokee than Spanish, or if it merely feels that way. So if we compare Spanish, Cherokee, and many other languages, in large passages, we should be able to quantify how many units of information must be included in Spanish or Cherokee, above the number of pieces of information that are common to all or nearly all control languages. These quanta of information would include mandatory morphological marking, marked syntactic structures, suppletion, slang, and anything else that would require its own discrete lexical construction in the mind of the speaker in order to be used correctly. …Definition. The complexity of a language on a given sentence (let’s denote it as χ) is defined by finding each proposition which could be associated with that sentence, getting the quantum number for each such proposition, and then taking the difference between the highest and lowest calculated quantum numbers.
Alright, let’s test my definition above on these:Let's take three examples:
Korean: gwisin i mwusewe [ghost subj frightening] (This is the most natural translation of "I am afraid of ghosts," and does not necessarily mean "Ghosts are scary in general.")
Choctaw: shilop i~ mahlatalih [ghost 3rd-afraid-1st]
Mandarin: wo3 pa4 gui3 [1st afraid ghost]
Both Korean and Choctaw have obligatory tense, seen here more or less as null suffixes, but this is absent in Mandarin (though of course all languages have the option to elaborate on time). Mandarin and Choctaw require the first person be overtly marked, but Korean lets it be implied (the assumption that emotion verbs with no overt subject marking refer to the speaker is common cross linguistically, and even shows up in English). Korean requires that “ghost” take argument marking, while in Choctaw it only needs a third person agreement prefix on the verb, and in Mandarin it shows up with no overt marking beyond its syntactic location to the right of the verb. In all three of these languages the only commonality is that “ghost” and “scared” appear, along with whatever syntactic alignment is the default. (Once again I am treating word-choice as one quantum. In a language with fifty words for fear, each with its own default syntactic alignment, we may need to unpack the semantic aspect of complexity further. But for now I'm focusing on morpho-syntax).
So does “scared of ghosts” inherently require person marking for the subject? Does it inherently require overt tense? Does it inherently require an adposition to specify the syntactic role of “ghost?” I would suggest that, at a fundamental level, it does not. The upshot of this is that person marking, argument marking, and tense all get thrown into one of those two buckets marked “redundancy” and “irregularity.” Also important: note that the three examples each have relatively similar amounts of redundancy/irregularity, despite having radically different amounts of “morphology” in most typological analyses.
χmin | χmax | χ | |
Korean | 2 | 3 | 2 |
Choctaw | 3 | 4 | 1 |
Korean | 3 | 4 | 1 |
That seems pretty odd to me as well.
Indeed, that worries me as well. If you use my definitions, the ‘quantum number’ of the proposition SCARES(ghosts, me) is ҁ=3 — but the ‘minimal complexity’ of the Korean sentence is χ=2. It seems like a reasonable constraint to require χ≥ҁ, but that assumption seems to be breaking here. So either Moose-tache’s Korean sentence isn’t actually a translation, or something is very wrong here.Maybe you're thinking that the Korean sentence doesn't require it. But that just makes me think about null marking. Take the Mandarin phrase, for instance: it has no overt aspect marker, but it contrasts with sentences that do have it (corresponding to "I used to be afraid of ghosts", "I'm afraid of ghosts so...", "I have been afraid of ghosts", etc). I'm comfortable with saying that the Mandarin sentence doesn't mark tense, but I think it does signal aspect.
I was thinking along the same lines (note my use of ‘proposition’ earlier). Possibly another way to define the quantum number ҁ of a proposition is to simply count the number of identifiers involved in a propositional-calculus representation of that proposition — in which case the ‘quanta’ in question simply refer to the identifiers used in the expression.… You could even say that the "fundamental level" is something like SCARES(ghosts, me). Such approaches can run into problems because the proposed primitives often aren't. But the ones in your example seem fine to me.