SCA² question

Topics that can go away
Post Reply
User avatar
Emily
Posts: 387
Joined: Fri Aug 03, 2018 6:24 am
Contact:

SCA² question

Post by Emily »

How well does SCA² handle composite glyphs -- that is, characters that (should) appear as one glyph, but are actually made of two or more Unicode points? Like, if I want to put in an ø with an acute accent (ǿ), that takes two Unicode code points: U-00F8 for the ø, and U+0301 for the combining accent. Does SCA² process this as one character or two?
bradrn
Posts: 6293
Joined: Fri Oct 19, 2018 1:25 am

Re: SCA² question

Post by bradrn »

GreenBowtie wrote: Sat May 30, 2020 1:56 am How well does SCA² handle composite glyphs -- that is, characters that (should) appear as one glyph, but are actually made of two or more Unicode points? Like, if I want to put in an ø with an acute accent (ǿ), that takes two Unicode code points: U-00F8 for the ø, and U+0301 for the combining accent. Does SCA² process this as one character or two?
I believe it’s two characters — as far as I’m aware, SCA² works on characters rather than graphemes. Of course, you can use rewrite rules to turn it into one character if you want. (Although in the specific case of ǿ, you can actually represent that as a single code point: ǿ is one character, ø+◌́ is two.)

EDIT: I tested it, and e.g. rules like ́/x/_ convert ǿ to øx. So SCA² definitely does treat the combining acute accent as its own character.
Conlangs: Scratchpad | Texts | antilanguage
Software: See http://bradrn.com/projects.html
Other: Ergativity for Novices

(Why does phpBB not let me add >5 links here?)
zompist
Site Admin
Posts: 2972
Joined: Sun Jul 08, 2018 5:46 am
Location: Right here, probably
Contact:

Re: SCA² question

Post by zompist »

Yes, it's quite simpleminded, so use single characters where you can. Rewrite rules are your friend, though!
User avatar
Emily
Posts: 387
Joined: Fri Aug 03, 2018 6:24 am
Contact:

Re: SCA² question

Post by Emily »

That's about what I figured, thank you!
Post Reply