Context of Diacritics

Hello type folks!

I'm very excited to announce that Context of Diacritics — an analysis of diacritics made to help you with adding diacritics to your ligatures — is finally up and running. Check it out here: http://urtd.net/projects/cod/

I look forward to your feedback.

Cheers, Ondrej
cod.jpg 127.8K

Comments

  • Chris Lozos
    Chris Lozos Posts: 1,458
    Thanks! That should be helpful!
  • Nick Shinn
    Nick Shinn Posts: 2,207
    edited August 2013
    “What language uses the most letters with diacritics?”

    Is that per alphabet or per written text?
    Also, are polytonic Greek marks considered diacritics?
  • It is per text, probably it should be noted somewhere on the page.

    This time I only focused on Latin, didn't give Greek much thought. I'll see if I can do it in the future.
  • Great work Ondrej!

    Here you have one more to add to your "words by base pairs" list: Sofía. My daughter's name, the /f/iacute pair always breaking fonts :)
  • Chris Lozos
    Chris Lozos Posts: 1,458
    Ahh, those f words again :-)
  • Nick Shinn
    Nick Shinn Posts: 2,207
    My daughter’s name: Zoë.
    More English diacritics:
    http://en.wiktionary.org/wiki/Appendix:English_words_with_diacritics
  • Thanks Nick, nice resource, I didn't know about it. I didn't include English in the database, because the diacritics is only used in loanwords and those are covered in the analysis of the respective source languages.

    The project is less about mapping individual languages and more about listing all existing diacritical combinations. I'm sure all combinations used in English loanwords are already in the database.
  • James Puckett
    James Puckett Posts: 1,992
    It’s important to remember that just because a letter combination does appear in a dictionary does not mean that said combination does not exist. Names of people and places tend to break the rules that apply to other kinds of words.

    That said, this is a useful tool!
  • Jack Jennings
    Jack Jennings Posts: 151
    edited August 2013
    From a cursory reading of the list: coördinate, coöperate, preëminent, reëlect, reënter, reëstablish, daïs, zoölogy, oölogy, and oöcyte.

    I'm sure all of these spellings/words are extremely rare, but I don't think these would be considered loan words (or at least not particularly recent ones) and each contains a diacritic combination that doesnt seem to be present in other languages…

    Great resource none the less though!
  • As I said earlier: The project is less about mapping individual languages and more about listing all existing diacritical combinations. I'm sure all combinations used in English loanwords are already in the database.

    http://urtd.net/projects/cod/combinations/o_odieresis
    http://urtd.net/projects/cod/combinations/e_edieresis
  • Nick Shinn
    Nick Shinn Posts: 2,207
    edited August 2013
    Dude, no ndieresis?
  • I missed fð.
  • The user and all related content has been deleted.
  • Thank you Ondrej, this is sweet!
  • James I think their style guide might actually require one of those in each article printed. They pay their writers on a per-diaeresis basis.
  • Ondrej, congratulations! Very useful tool.
  • Nick Shinn
    Nick Shinn Posts: 2,207
    It should be spelled diëresis, dammit!
    At least, that‘s how I’ve always pronounced it, rather than “die-resis”.
  • Dusan Jelesijevic
    Dusan Jelesijevic Posts: 66
    edited February 2014
    In Serbia people also use Latin, beside Cyrillic.