SCA² question
SCA² question
How well does SCA² handle composite glyphs -- that is, characters that (should) appear as one glyph, but are actually made of two or more Unicode points? Like, if I want to put in an ø with an acute accent (ǿ), that takes two Unicode code points: U-00F8 for the ø, and U+0301 for the combining accent. Does SCA² process this as one character or two?
Re: SCA² question
I believe it’s two characters — as far as I’m aware, SCA² works on characters rather than graphemes. Of course, you can use rewrite rules to turn it into one character if you want. (Although in the specific case of ǿ, you can actually represent that as a single code point: ǿ is one character, ø+◌́ is two.)GreenBowtie wrote: ↑Sat May 30, 2020 1:56 am How well does SCA² handle composite glyphs -- that is, characters that (should) appear as one glyph, but are actually made of two or more Unicode points? Like, if I want to put in an ø with an acute accent (ǿ), that takes two Unicode code points: U-00F8 for the ø, and U+0301 for the combining accent. Does SCA² process this as one character or two?
EDIT: I tested it, and e.g. rules like ́/x/_ convert ǿ to øx. So SCA² definitely does treat the combining acute accent as its own character.
Conlangs: Scratchpad | Texts | antilanguage
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices
(Why does phpBB not let me add >5 links here?)
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices
(Why does phpBB not let me add >5 links here?)
-
- Site Admin
- Posts: 2945
- Joined: Sun Jul 08, 2018 5:46 am
- Location: Right here, probably
- Contact:
Re: SCA² question
Yes, it's quite simpleminded, so use single characters where you can. Rewrite rules are your friend, though!
Re: SCA² question
That's about what I figured, thank you!