Linguistic Miscellany Thread

Natural languages and linguistics
Richard W
Posts: 1736
Joined: Sat Aug 11, 2018 12:53 pm

Re: Linguistic Miscellany Thread

Post by Richard W »

jal wrote: Fri Nov 14, 2025 11:22 am Well, they're a special type of construction, where both the subject and subject complement are clauses, and their order is reversed.
To us English speakers, they're the natural order, and we wouldn't reverse the identification. Indeed, as the first phrase is more specific, I would say that it was the subject. They also follow the principle of having lengthy parts at the end of the sentence.
User avatar
jal
Posts: 1292
Joined: Fri Aug 31, 2018 3:13 pm

Re: Linguistic Miscellany Thread

Post by jal »

Richard W wrote: Fri Nov 14, 2025 12:33 pmTo us English speakers, they're the natural order, and we wouldn't reverse the identification. Indeed, as the first phrase is more specific, I would say that it was the subject. They also follow the principle of having lengthy parts at the end of the sentence.
I know how it works in English (and maybe you're right with regards to the syntactic subject), but that wasn't my question. This isn't the English thread. I was solliciting knowledge about languages other than English.


JAL
User avatar
malloc
Posts: 1424
Joined: Mon Jul 09, 2018 8:42 pm
Location: The Evil Empire

Re: Linguistic Miscellany Thread

Post by malloc »

Does anyone else feel like so many letters of the Hebrew alphabet look remarkably similar to the point of confusion? Consider pairs like bet and kaf, dalet and resh, or he and tav, to say nothing of the word-final variants. Along with the well-known absence of vowels in Hebrew writing, this seems like it would make reading Hebrew (or other languages written in this alphabet) rather challenging.
Richard W
Posts: 1736
Joined: Sat Aug 11, 2018 12:53 pm

Re: Linguistic Miscellany Thread

Post by Richard W »

malloc wrote: Fri Nov 14, 2025 10:02 pm Does anyone else feel like so many letters of the Hebrew alphabet look remarkably similar to the point of confusion? Consider pairs like bet and kaf, dalet and resh, or he and tav, to say nothing of the word-final variants. Along with the well-known absence of vowels in Hebrew writing, this seems like it would make reading Hebrew (or other languages written in this alphabet) rather challenging.
It's a well-known problem of the main family of Semitic alphabets. Later alphabets permanently merged phonetically distinct letters. I think the worst offender is the writing system for Avestan. The standard Mongolian writing system, which has become alphabetic, does distinguish the letters, but one has to keep track of the context.

Aramaic distinguishes I think 3 consonants in speech but not writing - at some point the 3 pairs changed, so there was clearly an unwritten distinction.

Arabic suffered from a loss of distinction, as seen in the main Kufic style. The dots that nowadays distinguish many letters are a later addition, not to be confused with the vowel marks.
bradrn
Posts: 7503
Joined: Fri Oct 19, 2018 1:25 am

Re: Linguistic Miscellany Thread

Post by bradrn »

malloc wrote: Fri Nov 14, 2025 10:02 pm Does anyone else feel like so many letters of the Hebrew alphabet look remarkably similar to the point of confusion? Consider pairs like bet and kaf, dalet and resh, or he and tav, to say nothing of the word-final variants. Along with the well-known absence of vowels in Hebrew writing, this seems like it would make reading Hebrew (or other languages written in this alphabet) rather challenging.
It does get confusing sometimes, but from the perspective of someone who needs to read Hebrew fairly regularly, I’d say that it doesn’t seem to be a huge problem in practice. It depends on typeface too: for instance, that used by Sefaria is probably less liable to misreading than that used by Chabad, although both are still very easy to read for me. There’s a more traditional typeface which is worse than both, but I can’t seem to find any images of it… [EDIT: here]

Another thing to note is that handwritten Hebrew (of the sort erroneously referred to as ‘cursive’) is much less ambiguous. Of the letter pairs you mention, bet gets an extra curve, dalet gets a loop, and he is (at least for some people) altered to two concentric arcs.

(Also, I’m surprised you mention he/tav, and not he/chet — the latter pair are far more similar to each other than the former.)
Richard W wrote: Sat Nov 15, 2025 1:08 am
malloc wrote: Fri Nov 14, 2025 10:02 pm Does anyone else feel like so many letters of the Hebrew alphabet look remarkably similar to the point of confusion? Consider pairs like bet and kaf, dalet and resh, or he and tav, to say nothing of the word-final variants. Along with the well-known absence of vowels in Hebrew writing, this seems like it would make reading Hebrew (or other languages written in this alphabet) rather challenging.
It's a well-known problem of the main family of Semitic alphabets. Later alphabets permanently merged phonetically distinct letters. I think the worst offender is the writing system for Avestan. The standard Mongolian writing system, which has become alphabetic, does distinguish the letters, but one has to keep track of the context.
I’d say the worst offender is not Avestan, but its parent Pahlavi, which famously merged 24 phonemes into 13 letters, essentially becoming a logography.
Conlangs: Scratchpad | Texts | antilanguage
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices

(Why does phpBB not let me add >5 links here?)
User avatar
malloc
Posts: 1424
Joined: Mon Jul 09, 2018 8:42 pm
Location: The Evil Empire

Re: Linguistic Miscellany Thread

Post by malloc »

bradrn wrote: Sat Nov 15, 2025 4:22 amI’d say the worst offender is not Avestan, but its parent Pahlavi, which famously merged 24 phonemes into 13 letters, essentially becoming a logography.
Now that's mind-boggling.

Zompist once advised keeping letters in your conscript maximally distinct and observed that the elves in LotR must have horrible levels of dyslexia given how similar tengwar letters look. I often get the same impression from Hebrew and Arabic.
User avatar
Raphael
Posts: 6958
Joined: Sun Jul 22, 2018 6:36 am

Re: Linguistic Miscellany Thread

Post by Raphael »

malloc wrote: Sat Nov 15, 2025 7:59 am
bradrn wrote: Sat Nov 15, 2025 4:22 amI’d say the worst offender is not Avestan, but its parent Pahlavi, which famously merged 24 phonemes into 13 letters, essentially becoming a logography.
Now that's mind-boggling.

Zompist once advised keeping letters in your conscript maximally distinct and observed that the elves in LotR must have horrible levels of dyslexia given how similar tengwar letters look. I often get the same impression from Hebrew and Arabic.
OK, but how does the Latin alphabet look like to people who are used to Arabic or Hebrew?
Lērisama
Posts: 746
Joined: Fri Oct 18, 2024 9:51 am
Location: Kernow Voy

Re: Linguistic Miscellany Thread

Post by Lērisama »

malloc wrote: Sat Nov 15, 2025 7:59 am
bradrn wrote: Sat Nov 15, 2025 4:22 amI’d say the worst offender is not Avestan, but its parent Pahlavi, which famously merged 24 phonemes into 13 letters, essentially becoming a logography.
Now that's mind-boggling.

Zompist once advised keeping letters in your conscript maximally distinct and observed that the elves in LotR must have horrible levels of dyslexia given how similar tengwar letters look. I often get the same impression from Hebrew and Arabic.
I think it's more about unfamiliarity than similarity, but you learned the Latin script too long ago to remember that feeling with it. I don't think there's any script where all the letters look distinct to someone illiterate in it.

Edit: what Raphael said
LZ – Lēri Ziwi
PS – Proto Sāzlakuic (ancestor of LZ)
PRk – Proto Rākēwuic
XI – Xú Iạlan
VN – verbal noun
SUP – supine
DIRECT – verbal directional
My language stuff
User avatar
Raphael
Posts: 6958
Joined: Sun Jul 22, 2018 6:36 am

Re: Linguistic Miscellany Thread

Post by Raphael »

Lērisama wrote: Sat Nov 15, 2025 8:39 am I don't think there's any script where all the letters look distinct to someone illiterate in it.
Perhaps not in real life, but that might make for an interesting conscripting challenge...
zompist
Site Admin
Posts: 4007
Joined: Sun Jul 08, 2018 5:46 am
Location: Right here, probably
Contact:

Re: Linguistic Miscellany Thread

Post by zompist »

Lērisama wrote: Sat Nov 15, 2025 8:39 am I don't think there's any script where all the letters look distinct to someone illiterate in it.
I don't know about that— the Greek alphabet has pretty distinct letterforms. So far as I can see it doesn't even have mirrored letterforms, cf. b/d and p/q. Nor can any majuscules be confused with minuscules, cf. capital I / little l.
bradrn
Posts: 7503
Joined: Fri Oct 19, 2018 1:25 am

Re: Linguistic Miscellany Thread

Post by bradrn »

zompist wrote: Sat Nov 15, 2025 3:56 pm
Lērisama wrote: Sat Nov 15, 2025 8:39 am I don't think there's any script where all the letters look distinct to someone illiterate in it.
I don't know about that— the Greek alphabet has pretty distinct letterforms. So far as I can see it doesn't even have mirrored letterforms, cf. b/d and p/q. Nor can any majuscules be confused with minuscules, cf. capital I / little l.
What about nu/ypsilon, zeta/xi, or in some styles phi/psi?

EDIT: also sigma/omicron
Conlangs: Scratchpad | Texts | antilanguage
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices

(Why does phpBB not let me add >5 links here?)
Civil War Bugle
Posts: 118
Joined: Sat Jul 14, 2018 6:57 pm

Re: Linguistic Miscellany Thread

Post by Civil War Bugle »

I took several courses in Arabic in college (longer ago than I want it to be) and took one semester of Hebrew. I was one of two non-Jewish people in the Hebrew class, and I think the professor designed the syllabus on the assumption that everyone in the class had learned the alphabet at some point before attending college, e.g. while preparing for bar mitzvahs or what have you. So the course sped through that aspect, and I spent a week or so trying to catch up on the reading aspect, and doing a certain amount of guesswork based on cognates, but once I did catch up, it seemed fine from the perspective of a literate person.

On the other hand, having never applied what I learned in that class since graduation, my memory of the Hebrew alphabet has atrophied a bit (not entirely but...) while I still can read and produce the entirety of the Arabic character set as used in Arabic.
zompist
Site Admin
Posts: 4007
Joined: Sun Jul 08, 2018 5:46 am
Location: Right here, probably
Contact:

Re: Linguistic Miscellany Thread

Post by zompist »

bradrn wrote: Sat Nov 15, 2025 4:10 pm
zompist wrote: Sat Nov 15, 2025 3:56 pm I don't know about that— the Greek alphabet has pretty distinct letterforms. So far as I can see it doesn't even have mirrored letterforms, cf. b/d and p/q. Nor can any majuscules be confused with minuscules, cf. capital I / little l.
What about nu/ypsilon, zeta/xi, or in some styles phi/psi?
EDIT: also sigma/omicron
I don't see those as very similar. It's not hard to distinguish pointy from round (ν/υ) or open from closed (ψ/φ). I'll grant that ο/σ are close enough to require some learning. I'd also point out that these similarities apply only to the minuscules— as we've seen, long centuries often make letterforms approach each other.

I find ח/ת or ע/ﬠ or דר or כ/ב much more confusing. On the other hand, modern fonts help a lot. E.g. when כ is circular it's easily recognized.
bradrn
Posts: 7503
Joined: Fri Oct 19, 2018 1:25 am

Re: Linguistic Miscellany Thread

Post by bradrn »

zompist wrote: Sat Nov 15, 2025 5:47 pm I find ח/ת or ע/ﬠ or דר or כ/ב much more confusing. On the other hand, modern fonts help a lot. E.g. when כ is circular it's easily recognized.
Firstly, ע/ﬠ are not even separate letters in the first place. In fact I didn’t even know that that ‘alternative ayin’ existed at all — looking at the Unicode standard, it seems to be intended for use with diacritics below.

As for the rest, I’ve never had a problem with ת/ח, which look pretty different to me. Far more confusing is ח/ה, which in some traditional typefaces can look nearly identical. ר/ד and כ/ב can be somewhat confusing, but I feel that typeface design largely manages to keep them separate.

Another one you didn’t mention is ס/ם. Most modern typefaces are good at keeping them separate, but traditionally they could look similar enough that I have gotten confused, though it’s ameliorated by the fact that ם is restricted in distribution.
Conlangs: Scratchpad | Texts | antilanguage
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices

(Why does phpBB not let me add >5 links here?)
zompist
Site Admin
Posts: 4007
Joined: Sun Jul 08, 2018 5:46 am
Location: Right here, probably
Contact:

Re: Linguistic Miscellany Thread

Post by zompist »

bradrn wrote: Sun Nov 16, 2025 6:19 am
zompist wrote: Sat Nov 15, 2025 5:47 pm I find ח/ת or ע/ﬠ or דר or כ/ב much more confusing. On the other hand, modern fonts help a lot. E.g. when כ is circular it's easily recognized.
Firstly, ע/ﬠ are not even separate letters in the first place. In fact I didn’t even know that that ‘alternative ayin’ existed at all — looking at the Unicode standard, it seems to be intended for use with diacritics below.
Got the wrong letters-- I meant ע/צ.
As for the rest, I’ve never had a problem with ת/ח, which look pretty different to me. Far more confusing is ח/ה, which in some traditional typefaces can look nearly identical. ר/ד and כ/ב can be somewhat confusing, but I feel that typeface design largely manages to keep them separate.

Another one you didn’t mention is ס/ם. Most modern typefaces are good at keeping them separate, but traditionally they could look similar enough that I have gotten confused, though it’s ameliorated by the fact that ם is restricted in distribution.
A modern font can help a lot! I blame calligraphers... they seem to like letters to look alike. Blackletter (for the Roman alphabet) is just as bad.
bradrn
Posts: 7503
Joined: Fri Oct 19, 2018 1:25 am

Re: Linguistic Miscellany Thread

Post by bradrn »

zompist wrote: Sun Nov 16, 2025 7:02 am A modern font can help a lot! I blame calligraphers... they seem to like letters to look alike. Blackletter (for the Roman alphabet) is just as bad.
The difficulty is that homogeneous scripts tend to be easier to write. As annoying as blackletter is to read, it’s marvellously efficient in both space and time when written with the appropriate pen.

(I’ll also note that ‘blackletter’ is a rather broad term. Textura is significantly more homogeneous than Fraktur.)
Conlangs: Scratchpad | Texts | antilanguage
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices

(Why does phpBB not let me add >5 links here?)
User avatar
Man in Space
Posts: 2434
Joined: Sat Jul 21, 2018 1:05 am

Re: Linguistic Miscellany Thread

Post by Man in Space »

I find the more traditional type is easier to read than modern sans-serif fonts or handwriting—the newer stuff omits the things that help me distinguish certain letters by stroke (and in some cases, most notably aleph, the newer style looks completely unintuitive—I tend to write aleph using three strokes, to wit one small right-to-left diagonal, a large left-to-right diagonal, and then a third right-to-left, not an arc and vertical line).

Arabic (well at least naskhi) gives me similar problems.
bradrn
Posts: 7503
Joined: Fri Oct 19, 2018 1:25 am

Re: Linguistic Miscellany Thread

Post by bradrn »

Man in Space wrote: Sun Nov 16, 2025 4:04 pm I find the more traditional type is easier to read than modern sans-serif fonts or handwriting—the newer stuff omits the things that help me distinguish certain letters by stroke (and in some cases, most notably aleph, the newer style looks completely unintuitive—I tend to write aleph using three strokes, to wit one small right-to-left diagonal, a large left-to-right diagonal, and then a third right-to-left, not an arc and vertical line).
The thing is, printed Hebrew and modern Hebrew are essentially two different scripts now. I remember learning them separately in primary school as such.
Conlangs: Scratchpad | Texts | antilanguage
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices

(Why does phpBB not let me add >5 links here?)
Richard W
Posts: 1736
Joined: Sat Aug 11, 2018 12:53 pm

Re: Linguistic Miscellany Thread

Post by Richard W »

Our family's been using human pronouns and classifier(s) when talking about our kitten in Thai. One such pronoun is หนู, widely used as 2nd/3rd person pronoun for youngsters. Yesterday it finally actually happened - my wife said หนูเป็นแมว. While she meant, "She's a cat", it can also translate as "The mouse is a cat"!
User avatar
alice
Posts: 1397
Joined: Mon Jul 09, 2018 11:15 am
Location: 'twixt Survival and Guilt

Re: Linguistic Miscellany Thread

Post by alice »

According to something I heard on the radio yesterday, "calque" is a loanword, whereas "loanword" is a calque. How many other languages have this, or similar?
"But he had reckoned without my narrative powers! With one bound I narrated myself up the wall and into the bathroom, where I transformed him into a freestanding sink unit.

We washed our hands of him, and lived happily ever after."
Post Reply