Language support & character set standards: how do foundries label their Latin-based products today?
Stephen Coles
Posts: 1,007
Clients often ask me what languages are covered by most professional fonts and I still have to tell them that there’s no simple answer. A good foundry clearly illustrates each font’s support, but a lack of a standard makes it difficult to compare between foundries.
There are Unicode standards and character encodings, but most font makers offer products that cover sets of languages somewhere between ASCII/Basic Latin, Latin Extended A, and full Unicode.
Is it fair to say most font makers now use the labels “Basic Latin” and “Latin Extended” to describe their Latin-based character sets? How standardized are those sets? What is your favorite reference for those sets that explain to the average human which languages are covered?
Tagged:
0
Comments
-
the labels “Basic Latin” and “Latin Extended”
These terms have lost what meaning they had several years ago. I don't know if they indeed have any value other than causing confusion. Lists of covered languages is more telling but much longer to spell out. The "Pro" term likewise has outlived its usefulness. Perhaps if there were true and universally accepted standard labels, it would be better. God help us if we try to agree on what that is though!
Even if we use a percent of all Latin languages covered figure [YourFont 90%], a user would want to know what languages were there.
1 -
I'm confused by the coverage of Latin Extended as well, being that one foundry will list Latin Extended with language support for about 59 languages, but another foundry with the same glyph set (basically Latin Extended-A) will support more than 140 languages? What am I missing here, it's the exact same glyph set.1
-
I don’t label my fonts with any character set because there are enough labels out there to be confusing. I put a complete character set in the PDF specimen, and my newer stuff includes a list of languages I know are supported by that character set. Although I don’t pad my list with different versions of Norwegian, mutually intelligible dialects, constructed languages, and politically disputed names of languages.
Even better, all of my vendors will display a dump or the character set and/or have a type tester that supports diacritical marks. So designers can look this up themselves. Which is something they should be able to learn in about five minutes—it’s not hard to look up a language on Omniglot or Wikipedia and see what letters it uses. This would probably be a good topic for a Typographica article. World languages are fun stuff, and designers would probably be happy to know that one doesn’t need a degree in linguistics to look this stuff up.3 -
I think you answered my question James, lol.0
-
Fontspring has a system that displays most of the languages covers under the tech specs tab. It seems to scan for minimum character sets for each language rather than using codepage flags. For example: it can differentiate between a few Greek symbols for mathematics and a proper, usable Greek set.2
-
There are other standardized character sets. For instance, my Cormorant family covers Adobe Latin 4:
http://adobe-type-tools.github.io/adobe-latin-charsets/adobe-latin-4.html
0 -
What am I missing here, it's the exact same glyph set.
As James alluded to, it depends upon how one decides to define “language” and “coverage.”
The difference between a language and a dialect is a fuzzy one and different foundries will choose to draw the line in different places. (Or, more accurately perhaps, the sources they relied upon drew the lines in different places.)
Is Dalecarlian a dialect (or group of dialects) of Swedish or an independent language deserving of being listed separately? (I’ve seen it on at least one list, although I cannot find a source for the Latin alphabet required.)
Coverage can also be a fuzzy area. The Guaraní alphabet includes a g with a tilde over it. This character is not encoded in Unicode. Most fonts do not include a gtilde, while they often cover the rest of the diacritics required. So, do they cover Guaraní? Some will include this in their list, some will not. Does it depend upon having a combining tilde? Does it depend upon having a {mark} feature to place that tildecmb over the g or not?
Jèrriais is the form of the Norman language spoken in Jersey, one of the Channel Islands off the coast of France. There are, perhaps, a couple thousand speakers. The alphabet does not require any diacritics beyond those used for French. Does Jèrriais need to be listed by a foundry as a covered language? Some do, some do not.
Beyond the usual suspects, the issue of language support gets murky and it may be tricky (if not impossible) to give a complete, exhaustive, and definitive list for any given set of codepoints.
5 -
Good point about web support, Frode.
0 -
Trying to be as brief and clear as I can be when writing, I most often describe Latin faces as covering Western and Central European languages / just one or both; And beyond that by scripts covered. It’s a little sloppy, but as we’ve discussed here there’s no perfect solution. FontShop’s family pages do list the languages covered by a given family, which I think gives as specific an answer as is needed.
0 -
With web fonts, I think you cannot any longer claim to support a language unless you offer mark features.
Do browsers automatically (and reliably) build accented letters on the fly?0 -
Browser support for combining mark sequences should be pretty reliable because the layout engines all the major browsers use support GPOS mark positioning.0
-
John Hudson said:Browser support for combining mark sequences should be pretty reliable because the layout engines all the major browsers use support GPOS mark positioning.If only Google Fonts served from the CDN weren't stripped of GPOS... Google Chrome is actually monkey-patching this as it substitutes precomposed glyphs on the fly, even if the font doesn't. Firefox reveals the ugly truth (about both GF and Google Translate in this case):0
-
We provide a list of the languages supported on the theory that will make more sense to end users.1
-
I agree to the notion that it is tricky to give the long answer on the matter, because a common understanding of the underlying definitions – does hardly exist. Fontspring lists e.g. Arapaho, Cebuano, Gilbertese and Warlpiri for some of my fonts, how useful is that? I don’t know.In order to find a brief answer: I tend to label my fonts with “complete Euro-Latin”. Not that any official definition about the meaning of Euro-Latin existed, but I hope it gives people a sort of sensible clue about the font will be running for all romanic, germanic, gaelic, slavic, finno-ugric and baltic as well as Turk languages – nearly everything which is likely to occur in a European context. By which label I imply, admittedly, also the usage scope of most (large) American languages. Since these are basically English, French, Spanish and Portugese, I usually don’t mention “American” explicitly, I’m not aware that anyone does.Vietnamese, however (this corresponds to this earlier discussion) is another matter. As well as Azeri (geographically Asian but linguistically a branch of the Euro-related Turk complex) and the indigenious American languages which require special attention in some respect.I know my view of this is somewhat Euro-centric. What I’m not convinced of is (in my opinion) the completely outdated labeling by “Western” and “Eastern” or “Central European”, I see no point in splitting the Latin realm by those categories.I also think that, for a truely global mastering more insights about concepts from Asia, Africa and the Americas would be welcome. The other discussion (see link above) is now 5 years old, little seems to have been moving since then.
2
Categories
- All Categories
- 43 Introductions
- 3.7K Typeface Design
- 798 Font Technology
- 1K Technique and Theory
- 617 Type Business
- 444 Type Design Critiques
- 541 Type Design Software
- 30 Punchcutting
- 136 Lettering and Calligraphy
- 83 Technique and Theory
- 53 Lettering Critiques
- 483 Typography
- 301 History of Typography
- 114 Education
- 68 Resources
- 498 Announcements
- 79 Events
- 105 Job Postings
- 148 Type Releases
- 165 Miscellaneous News
- 269 About TypeDrawers
- 53 TypeDrawers Announcements
- 116 Suggestions and Bug Reports