Context of Diacritics

Options
Hello type folks!

I'm very excited to announce that Context of Diacritics — an analysis of diacritics made to help you with adding diacritics to your ligatures — is finally up and running. Check it out here: http://urtd.net/projects/cod/

I look forward to your feedback.

Cheers, Ondrej
cod.jpg 127.8K

Comments

  • Chris Lozos
    Chris Lozos Posts: 1,458
    Options
    Thanks! That should be helpful!
  • Nick Shinn
    Nick Shinn Posts: 2,159
    edited August 2013
    Options
    “What language uses the most letters with diacritics?”

    Is that per alphabet or per written text?
    Also, are polytonic Greek marks considered diacritics?
  • Ondrej Jób
    Options
    It is per text, probably it should be noted somewhere on the page.

    This time I only focused on Latin, didn't give Greek much thought. I'll see if I can do it in the future.
  • PabloImpallari
    Options
    Great work Ondrej!

    Here you have one more to add to your "words by base pairs" list: Sofía. My daughter's name, the /f/iacute pair always breaking fonts :)
  • Chris Lozos
    Chris Lozos Posts: 1,458
    Options
    Ahh, those f words again :-)
  • Nick Shinn
    Nick Shinn Posts: 2,159
    Options
    My daughter’s name: Zoë.
    More English diacritics:
    http://en.wiktionary.org/wiki/Appendix:English_words_with_diacritics
  • Ondrej Jób
    Options
    Thanks Nick, nice resource, I didn't know about it. I didn't include English in the database, because the diacritics is only used in loanwords and those are covered in the analysis of the respective source languages.

    The project is less about mapping individual languages and more about listing all existing diacritical combinations. I'm sure all combinations used in English loanwords are already in the database.
  • James Puckett
    James Puckett Posts: 1,979
    Options
    It’s important to remember that just because a letter combination does appear in a dictionary does not mean that said combination does not exist. Names of people and places tend to break the rules that apply to other kinds of words.

    That said, this is a useful tool!
  • Jack Jennings
    Jack Jennings Posts: 151
    edited August 2013
    Options
    From a cursory reading of the list: coördinate, coöperate, preëminent, reëlect, reënter, reëstablish, daïs, zoölogy, oölogy, and oöcyte.

    I'm sure all of these spellings/words are extremely rare, but I don't think these would be considered loan words (or at least not particularly recent ones) and each contains a diacritic combination that doesnt seem to be present in other languages…

    Great resource none the less though!
  • Ondrej Jób
    Options
    As I said earlier: The project is less about mapping individual languages and more about listing all existing diacritical combinations. I'm sure all combinations used in English loanwords are already in the database.

    http://urtd.net/projects/cod/combinations/o_odieresis
    http://urtd.net/projects/cod/combinations/e_edieresis
  • Nick Shinn
    Nick Shinn Posts: 2,159
    edited August 2013
    Options
    Dude, no ndieresis?
  • Andreas Stötzner
    Options
    I missed fð.
  • [Deleted User]
    Options
    The user and all related content has been deleted.
  • Santiago Orozco
    Options
    Thank you Ondrej, this is sweet!
  • Jack Jennings
    Options
    James I think their style guide might actually require one of those in each article printed. They pay their writers on a per-diaeresis basis.
  • Filip Blazek
    Options
    Ondrej, congratulations! Very useful tool.
  • Nick Shinn
    Nick Shinn Posts: 2,159
    Options
    It should be spelled diëresis, dammit!
    At least, that‘s how I’ve always pronounced it, rather than “die-resis”.
  • Dusan Jelesijevic
    Dusan Jelesijevic Posts: 66
    edited February 2014
    Options
    In Serbia people also use Latin, beside Cyrillic.