Placing of diacritic in lc b with dot below, Dcedilla etc.

Vasil Stanev
Vasil Stanev Posts: 775
edited February 2018 in Technique and Theory
Hi guys,
I am having a tough time with understanding where the diacritic should be placed on some of the Extended Latin glyphs. Namely:
  1. B with dor below 1E02 - under stem or centered?
  2. b with dot below 1E03 - under bowl or centered?
  3. D with cedilla, d with cedilla 1E10, 1E11 - 1. where is the cedilla placed 2. should I supply alternative corresponding glyphs with comma? If yes, how should they be coded?
  4. D with caron below, d with caron below 1E12 1E13 - where is the caron placed?
  5. h with dot above 1E23 - is the dot over the stem like an /i or over the bowl like in /b with dot above?
  6. h with dieresis 1E27 - same question as the previous
  7. H with cedilla, h with cedilla - same question as p.3
  8. p with dot above - same question as 5.
  9. Welsch letters: 1EFA to 1EFF - are they archaic or not at all?
  10. Are there some pecularities I need to know about designing pan-nigerian letters?

    If I had to speculate, I would reason that most of the diacritics below the base letter should be centered, because nobody writing them out would have the time to precisely place the diacritic under the bowl. But I can't be aware of all local preferences. I am using UniBook and visiting various websites to get the needed information but get different results. e.g. I know hcircumflex should have the sign over the stem, and see it misplaced to the right in several sources. So I do not trust everything unconditionally.

    I would kindly ask that we keep spam to a minimum because of deadlines. Only comment if you have precise info. Thanks!

Comments

  • Look at the discussion "Macron on Top".
  • Are there some pecularities I need to know about designing pan-nigerian letters
    This probably doesn't count as “pan-nigerian letters” but for Yoruba, you'll want to localize uni0323 (dotaccent below) with a form that more closely resembles uni0329 (vertical line below).

    André
  • I know hcircumflex should have the sign over the stem, and see it misplaced to the right in several sources. So I do not trust everything unconditionally.
    While I definitely prefer it over the stem, my understanding is that Esperanto users prefer it centered.

    André
  • Igor Freiberger
    Igor Freiberger Posts: 273
    edited February 2018
    After the help you got from John and André, one can hardly add much. Some small points:
    1. For Ḅ, the code is 1E04.
    2. For ḅ, the code is 1E05. Following André observation, both may have an alternative form using 0329 (combining vertical line below), which was used in Yoruba to preserve legibility for underlined letters. This was mostly an adaptation from typewriters time, not sure how relevant this is nowadays.
    3. (b) I include both cedilla- and comma-style. D with cedilla is used for transliterations, D with comma was used in an old orthography for Romanian and also in now-extinct Livonian. Most type designers use an extension like .alt for alternative forms.
    4. For 1E23, 1E27, 021F and 0125, I use the diacritic over the stem and keep alternative versions with diacritic over the bowl. But I am a bit overly worried to regional and minority variations, to have the first option is enough except for Esperanto.
    5. I did a research on this some time ago and decided to use cedillas attached to left stems for all two-leg letters (h, n, k, 028C), but there are samples for right stem and even hanging cedilla at the middle of letters. There was a good topic in Typophile about it.
    6. If you plan to provide a good support for West African languages, do not rely only in Pan-Nigerian. There are numerous additions to be considered, especially in Latin B block. Unhappily, online documentation is limited. More info in this thread here in TD.
  • Thank you for the good replies.
    What's the deal with
    uni0186, uni018C, 018E, is 0190 different then a resized epsilon, is 0192 different than a florin, is 0194 used, is 0195 used, ditto 0196, 019C, 019E, 019F (and is the 019F different than Theta), are 01B5-01BF used at all?

    Are there pecularities like the elonagted dot bellow that are not seen in the unicode table? Are there glyphs that are positively never ever used?

    Ditto for Yogh (021C, 021D)
    Which languages use hcaron? Is the diacritic placed similary like in hcircumflex - does it need an .alt version where it is centered horizontally? 

    Thanks in advance!

  • What's the deal with uni0186, uni018C, 018E
    I'm not sure what you're asking here.

    is 0190 different then a resized epsilon
    0190 is the uppercase form of 025B, both of which are based on the greek letter epsilon, but they should be designed to harmonize with the Latin rather than Greek characters in the font. So whether it is simply a larger version of epsilon would depend on the font.

    is 0192 different than a florin

    0192 unfortunately is used for both the florin and the lowercase of 0191 which don't normally look the same. A font supporting African languages would need to have a localized form of 0192 and just hope that users don't plan on discussing prices in guilders. For African languages it should be based on lowercase f, whereas the florin symbol is typically slightly smaller and often based on an italic f.

    0194 used, is 0195 used, ditto 0196, 019C, 019E, 019F (and is the 019F different than Theta), are 01B5-01BF used at all?

    Used by whom? All of these are used, though some are only used in historical contexts (e.g. wynn, hwair) The unicode charts generally provide some info on where characters are used.

    019F is based on Latin uppercase O. Theta may or may not be the same depending on the font.

    Ditto for Yogh (021C, 021D)

    Again, you'll have to be more specific. Yogh takes a number of different forms, in some cases looking more ezh-like and in other cases more three-like, depending on the font. It is used in middle English, though some normalized texts will replace it with gh or y depending on context.

    André
  • What's the deal with uni0186, uni018C, 018E
    I'm not sure what you're asking here.
    0194 used, is 0195 used, ditto 0196, 019C, 019E, 019F (and is the 019F different than Theta), are 01B5-01BF used at all?
    Used by whom? All of these are used, though some are only used in historical contexts (e.g. wynn, hwair) The unicode charts generally provide some info on where characters are used.
    Thank you for the resourceful answer.
    I was asking if most of these glyphs are used outside of historical contexts, like the two with the bar - it seemed to me these were earlier versions of proposed glyphs that have stayed in Unicode without being actually used.

    Even with the best of intent I do not think it would be possible to cover all the glyphs necessary for African languages, since most of them, as far as I can gather, are still in development. I can find no extensive reliable information about localizations and exceptions. Me and my colleague agreed to leave the uni0234-024D from L. Extended-B range and the IPA Extensions unfilled.