Katapharteo: an engelang with only one type of syntactic relation and roots that conjugate in base four
Posted: Wed Feb 10, 2021 3:45 pm
I've had this idea for a non-naturalistic, engineered language floating around in my head for a while, and I recently thought it was time to finally hammer out some details. I've never really attempted an engelang before, and I don't see too many of them posted around here, so this should be a fun experiment. I think working on something that is explicitly, intentionally, wildly un-naturalistic will be a nice change of pace from my usual conlanging workflow. Thus, I give you Katapharteo: a very strange language. This thread will a be a scratchpad of sorts, as the details are not all completed, so expect updates as well (I mean, if I actually get around to them...).
Idea & Design Goals
I've had a four main goals in mind when designing Katapharteo. The first, and most fundamental, is to create a language that has only one type of syntactic relation. That is to say, if you were to draw a syntax tree for a Katapharteo sentence, you wouldn't need to make use of VPs, NPs, PPs and so on, you'd only need a single phrase type: XP. As a corollary, there are no distinct parts of speech in Katapharteo. I'm going to be calling this syntactic relation "adjunction", because I think that dependents in Katapharteo are best understood as analogous to adjuncts in natural language.
The second design goal is to create something that is, in theory, unambiguously parsable by a computer. This is pretty self-explanatory, so I won't say much more about it. The third goal was to create something that, unlike a lot of engelangs I've seen, is actually phonologically interesting, and uses that phonology in interesting ways as part of its grammar. I ultimately settled on what basically amounts to, uh, using consonant gradation to inflect words for numerals in base four, which... well, you'll see what I mean when I start explaining the grammar.
The final design goal was to not worry to much about design goals, or coherence or anything like that, and just kinda go wild. Throw in every half-interesting feature that I'd never dream of putting in one of my ordinary, naturalistic conlangs and just see what happens. I guess in some sense, I'd just like to buck the trend of engelangs designed for "elegance" or """logic""" or whatever, and make something completely alien just for the sake of it.
Hopefully I've accomplished all of those to a reasonable degree, but you'll have to be the judge of that.
Phonology
First, we have to talk about phonology. There are three classes of phonemes in Katapharteo: initials, pre-initials, and vowels. Syllables have the structure (P)IV, where I is an initial and P is a pre-initial. All initials and pre-initials are phonetically consonants, but that's not so much a relevant phonological class here. The initials are all stops, laid out in a regular 4x4 grid. These are the most important sounds in the language, and they undergo gradation to indicate all grammatical relations between words. The pre-initials are basically just there to bump up the number of possible syllables, with the exception of /l/, which has a very special function.
Initials:
/p t tʃ k/ <p t c k>
/pʰ tʰ tʃʰ kʰ/ <ph th ch kh>
/p' t' tʃ' k'/ <p' t' c' k'>
/b d dʒ g/ <b d j g>
Pre-initials:
/N s x r l/ <n s x r l>
/N/ is a nasal unspecified for place, which assimilates to the following stop. /s x/ assimilate in voicing to a following stop
Vowels:
/i ɨ u/ <i eu u>
/e ə o/ <e eo o>
/ɛ ɔ/ <ea oa>
/æ ɑ/ <ae a>
+length, nasalization, and high/low tones <VV, Vm, V́/V>
Roots & Root Structure
The notion of a "word" is less important for the syntax of Katapharteo than that of a root. Roots are strictly bisyllabic, and underlyingly all have the structure (P)SV(P)SV, where P is a pre-initial other than /l/, and S is one of /p t tʃ k/. The initials in a root undergo gradation to indicate grammatical relations, and thus, on the surface, all initials can occur in a root. There are 2560 total possible roots, of which I intend 2000 or so to actually be used. More complex lexemes are formed out of multi-root constructions, which I'll be calling "compounds" even though they really aren't. More on that in the syntax section.
Morphophonology
Ok, now we're getting to the interesting bits. Roots in Katapharteo inflect for a single category, which I'll call "level". In essence, "level" is just an integer expressed in base four. The digits 0, 1, 2, and 3 are denoted by the phonation of the initials in a root: a plain stop denotes 0, an aspirate denotes 1, an ejective denotes 2, and a voiced stop denotes 3. A bisyllabic root, containing two initials, thus has the potential to express any two-digit base four number, or any number from 0-15. A root's "level" is simply the number encoded in its initials. An important note is that these numbers are expressed with the least significant digit first. That is, the first initial encodes the one's place, and the second initials encodes the four's place.
For example, take the root kata - "language, speech, word". In its unmarked form, it is in level zero: its initials correspond to the base four digits 00. Inflecting it for level 1 corresponds to aspirating the first stop in the word: khata. Inflecting it into level two would look like ejectivizing the first stop: k'ata. Level five would require aspirating both stops: khatha.
If you ever need to inflect a root beyond level 15, the solution is simple: extend the root via the suffix -lta. This suffix can be added to a root as many times as is necessary, in order to make the root long enough to express an arbitrary integer level. This suffix is also the sole occurrence of the pre-initial /l/, the intent of which is to make root extensions unambiguously identifiable.
I'll be glossing level using quaternary numbers in the same order as the digits are encoded. Thus, kata would be glossed "language.00", khata would be "language.10", and so on.
Syntax
As mentioned, there is only one syntactic relation in Katapharteo, which I'm calling "adjunction". This is a relation between roots, and on a semantic level, a root which is adjunct to another root should be taken to be modifying it in some way, or otherwise contributing extra information. Context obviously plays a huge role here, as what precisely that extra information means is dependent on the root in question. However, this is par for the course in natural languages as well (think about how general a notion of "modification" is encoded by the genitive particle in Japanese, for example), so I don't think it needs too much justification.
Adjunction is indicated by level. In particular: a root of level n is adjunct to the closest root of level (n-1) to its left. This might be a bit counter-intuitive, so hopefully an example will clarify. Take the following several sentences:
kata xthóamske
language.00 difficult(y).10
"(a) difficult language"
Here, the root xthóamske is in level one, and thus modifies kata, the most recent word of level zero.
kata xthóamske t'urpo cheanteoo
language.00 difficult(y).10 very.20 strange.10
"(a) very difficult and strange language"
Here, t'urpo is in level two, modifying xthóamske, while cheanteoo is in level one, meaning that it again modifies the previous level zero root, kata.
Ok, so this is all well and good for things that are already adjuncts in most natural languages, but how does this handle verbs? Especially transitive verbs? Well, the answer is that, in general, "verb phrases" (again, really just general XPs with a semantically "verb-like" head root) are adjunct to their subject, and objects are adjunct to "verbs". Take the following example:
tenta sthiiki k'apo
man.00 see/sight.10 dog.20
"(a) man sees (a) dog"
This might be best translated in a very literal way as "man with sight of dog", or some such. Notice that this means that relativizing on subjects is identical to simply describing a subject with multiple adjuncts, i.e
tenta sthiiki k'apo thapo st'opa
man.00 see/sight.10 dog.20 directed_movement.10 store.20
"(a) man sees (a) dog and goes to (the) store" or "the man who sees the dog goes to the store" or "the man who goes to the store sees a dog"
Something like "man with sight of dog and movement to store ".
When there are multiple semantic roles involved, these are communicated through something akin to converbs: "verb-like" roots that modify the main "verb" and add extra information. For example:
stété phunkee nk'apá thaku rt'exkiduke
exchange.00 give.10 money.20 receive.10 banana.20
"to buy a banana"
teke sthété p'unkee ngapá t'aku rdexkitukhe
1sg.00 exchange.10 give.20 money.30 receive.20 banana.30
"I buy a banana"
One final thing to not is about multi-root lexemes. Look at the word rtexkituke "banana". It's actually just two roots, rtexki - "fruit" and tuke - yellow, which stand in adjunction to one another. That's why the internal structure of the "word" shifts from sentence to sentence: its constituent roots have to take on different levels. The fact that it's written without a space simply indicates that it's a conventionalized lexical item.
Anyway, that's all I got for now. Hope this is something a bit new and refreshing for everyone, as making it was for me.
Idea & Design Goals
I've had a four main goals in mind when designing Katapharteo. The first, and most fundamental, is to create a language that has only one type of syntactic relation. That is to say, if you were to draw a syntax tree for a Katapharteo sentence, you wouldn't need to make use of VPs, NPs, PPs and so on, you'd only need a single phrase type: XP. As a corollary, there are no distinct parts of speech in Katapharteo. I'm going to be calling this syntactic relation "adjunction", because I think that dependents in Katapharteo are best understood as analogous to adjuncts in natural language.
The second design goal is to create something that is, in theory, unambiguously parsable by a computer. This is pretty self-explanatory, so I won't say much more about it. The third goal was to create something that, unlike a lot of engelangs I've seen, is actually phonologically interesting, and uses that phonology in interesting ways as part of its grammar. I ultimately settled on what basically amounts to, uh, using consonant gradation to inflect words for numerals in base four, which... well, you'll see what I mean when I start explaining the grammar.
The final design goal was to not worry to much about design goals, or coherence or anything like that, and just kinda go wild. Throw in every half-interesting feature that I'd never dream of putting in one of my ordinary, naturalistic conlangs and just see what happens. I guess in some sense, I'd just like to buck the trend of engelangs designed for "elegance" or """logic""" or whatever, and make something completely alien just for the sake of it.
Hopefully I've accomplished all of those to a reasonable degree, but you'll have to be the judge of that.
Phonology
First, we have to talk about phonology. There are three classes of phonemes in Katapharteo: initials, pre-initials, and vowels. Syllables have the structure (P)IV, where I is an initial and P is a pre-initial. All initials and pre-initials are phonetically consonants, but that's not so much a relevant phonological class here. The initials are all stops, laid out in a regular 4x4 grid. These are the most important sounds in the language, and they undergo gradation to indicate all grammatical relations between words. The pre-initials are basically just there to bump up the number of possible syllables, with the exception of /l/, which has a very special function.
Initials:
/p t tʃ k/ <p t c k>
/pʰ tʰ tʃʰ kʰ/ <ph th ch kh>
/p' t' tʃ' k'/ <p' t' c' k'>
/b d dʒ g/ <b d j g>
Pre-initials:
/N s x r l/ <n s x r l>
/N/ is a nasal unspecified for place, which assimilates to the following stop. /s x/ assimilate in voicing to a following stop
Vowels:
/i ɨ u/ <i eu u>
/e ə o/ <e eo o>
/ɛ ɔ/ <ea oa>
/æ ɑ/ <ae a>
+length, nasalization, and high/low tones <VV, Vm, V́/V>
Roots & Root Structure
The notion of a "word" is less important for the syntax of Katapharteo than that of a root. Roots are strictly bisyllabic, and underlyingly all have the structure (P)SV(P)SV, where P is a pre-initial other than /l/, and S is one of /p t tʃ k/. The initials in a root undergo gradation to indicate grammatical relations, and thus, on the surface, all initials can occur in a root. There are 2560 total possible roots, of which I intend 2000 or so to actually be used. More complex lexemes are formed out of multi-root constructions, which I'll be calling "compounds" even though they really aren't. More on that in the syntax section.
Morphophonology
Ok, now we're getting to the interesting bits. Roots in Katapharteo inflect for a single category, which I'll call "level". In essence, "level" is just an integer expressed in base four. The digits 0, 1, 2, and 3 are denoted by the phonation of the initials in a root: a plain stop denotes 0, an aspirate denotes 1, an ejective denotes 2, and a voiced stop denotes 3. A bisyllabic root, containing two initials, thus has the potential to express any two-digit base four number, or any number from 0-15. A root's "level" is simply the number encoded in its initials. An important note is that these numbers are expressed with the least significant digit first. That is, the first initial encodes the one's place, and the second initials encodes the four's place.
For example, take the root kata - "language, speech, word". In its unmarked form, it is in level zero: its initials correspond to the base four digits 00. Inflecting it for level 1 corresponds to aspirating the first stop in the word: khata. Inflecting it into level two would look like ejectivizing the first stop: k'ata. Level five would require aspirating both stops: khatha.
If you ever need to inflect a root beyond level 15, the solution is simple: extend the root via the suffix -lta. This suffix can be added to a root as many times as is necessary, in order to make the root long enough to express an arbitrary integer level. This suffix is also the sole occurrence of the pre-initial /l/, the intent of which is to make root extensions unambiguously identifiable.
I'll be glossing level using quaternary numbers in the same order as the digits are encoded. Thus, kata would be glossed "language.00", khata would be "language.10", and so on.
Syntax
As mentioned, there is only one syntactic relation in Katapharteo, which I'm calling "adjunction". This is a relation between roots, and on a semantic level, a root which is adjunct to another root should be taken to be modifying it in some way, or otherwise contributing extra information. Context obviously plays a huge role here, as what precisely that extra information means is dependent on the root in question. However, this is par for the course in natural languages as well (think about how general a notion of "modification" is encoded by the genitive particle in Japanese, for example), so I don't think it needs too much justification.
Adjunction is indicated by level. In particular: a root of level n is adjunct to the closest root of level (n-1) to its left. This might be a bit counter-intuitive, so hopefully an example will clarify. Take the following several sentences:
kata xthóamske
language.00 difficult(y).10
"(a) difficult language"
Here, the root xthóamske is in level one, and thus modifies kata, the most recent word of level zero.
kata xthóamske t'urpo cheanteoo
language.00 difficult(y).10 very.20 strange.10
"(a) very difficult and strange language"
Here, t'urpo is in level two, modifying xthóamske, while cheanteoo is in level one, meaning that it again modifies the previous level zero root, kata.
Ok, so this is all well and good for things that are already adjuncts in most natural languages, but how does this handle verbs? Especially transitive verbs? Well, the answer is that, in general, "verb phrases" (again, really just general XPs with a semantically "verb-like" head root) are adjunct to their subject, and objects are adjunct to "verbs". Take the following example:
tenta sthiiki k'apo
man.00 see/sight.10 dog.20
"(a) man sees (a) dog"
This might be best translated in a very literal way as "man with sight of dog", or some such. Notice that this means that relativizing on subjects is identical to simply describing a subject with multiple adjuncts, i.e
tenta sthiiki k'apo thapo st'opa
man.00 see/sight.10 dog.20 directed_movement.10 store.20
"(a) man sees (a) dog and goes to (the) store" or "the man who sees the dog goes to the store" or "the man who goes to the store sees a dog"
Something like "man with sight of dog and movement to store ".
When there are multiple semantic roles involved, these are communicated through something akin to converbs: "verb-like" roots that modify the main "verb" and add extra information. For example:
stété phunkee nk'apá thaku rt'exkiduke
exchange.00 give.10 money.20 receive.10 banana.20
"to buy a banana"
teke sthété p'unkee ngapá t'aku rdexkitukhe
1sg.00 exchange.10 give.20 money.30 receive.20 banana.30
"I buy a banana"
One final thing to not is about multi-root lexemes. Look at the word rtexkituke "banana". It's actually just two roots, rtexki - "fruit" and tuke - yellow, which stand in adjunction to one another. That's why the internal structure of the "word" shifts from sentence to sentence: its constituent roots have to take on different levels. The fact that it's written without a space simply indicates that it's a conventionalized lexical item.
Anyway, that's all I got for now. Hope this is something a bit new and refreshing for everyone, as making it was for me.