On December 16, 2019, PanLex director David Kamholz spoke at the San Francisco Center for the Book on PanLex and the Internet Archive’s joint project to digitize Balinese palm-leaf manuscripts. You can view a recording of the presentation below, and slides are available here. The presentation introduces Balinese manuscripts, covers technical challenges in supporting Balinese script (the writing system in which the manuscripts are written), invites the audience to interactively analyze core features of the Balinese script, demos the new Palm Leaf Wiki, and recognizes the team of Balinese contributors who are managing the transcription (typing) of manuscripts on the wiki. To date, more than 3,000 leaves and 92 complete works have been transcribed.
In 2019, Translations Commons published “Indigenous Languages: Zero to Digital”, a guide to creating digital infrastructure for indigenous communities. Using flowcharts and clear instructions, it explains how to create every level of the technology stack required to make a language usable online. This easy-to-understand and ground-breaking resource was co-authored with several partners in language and technology, and in coordination with the United Nations’ International Year of Indigenous Languages.
Translation Commons is a nonprofit, volunteer-run resource-sharing platform for language professionals.
Words for animals often have interesting histories. Some, like English mouse, have remained almost unchanged for centuries (millennia, if you go back to Indo-European). Others, like English dog, can be tracked only so far before the trail runs dry. The word for bear was altered in many Indo-European languages through a process called taboo deformation.
This post brings together some English small animal names with interesting histories, including some bonus notes on other languages.
The immediate source of English squirrel is Anglo-French esquirel, in turn derived from Old French escurueil (compare Modern French écureuil). This word derives from Vulgar Latin*scuriolus (the asterisk means the form is reconstructed—inferred from evidence rather than directly attested), which is a diminutive of *scurius. The reconstructed *scurius is a metathesized variant of attested Latin sciurus ‘squirrel’. We cannot say for certain why sciurus was metathesized into *scurius, but a likely contributing factor is that *scurius better fits typical Latin word patterns; Latin has many nouns ending with -ius and few words beginning with sciu-.
Statue of Saxon leader Widukind in Herford, Germany. (Image by M. Kunz)
Every November 5, the United Kingdom celebrates Guy Fawkes Night. Guy Fawkes was an Englishman who attempted to blow up the House of Parliament in 1605. The story is fairly well known—but why was this guy named Guy? What kind of a name is that, anyway? As it turns out, it’s kind of a long story!
Proto-Germanic, the reconstructed ancestor language of Germanic languages such as English and German, had a word *widuz ‘wood’—this, in fact, is the source of the English word wood. This root was used in names such as Old Saxon Widukind, literally ‘child of the wood’. These names could be shortened to Wido. The short form was borrowed into Old French as the name Guy and into Italian as Guido. The initial g-sound was added to fit the sound pattern of these languages; neither allowed w at the beginning of a word, and borrowed words originally beginning with w were pronounced with g. (The same process is evident in French guerre and Italian guerra ‘war’, which derive from a Frankish word similar to English war.)
On October 25, 02019, PanLex was honored to present the first keynote speech at WikidataCon in Berlin, Germany. As our representative, I was excited to share PanLex’s ideas about the importance of linguistic diversity and lexical data’s role in helping to preserve that diversity with the staff, volunteers, and users of Wikidata.
The Wikidata audience was wonderfully receptive to PanLex’s mission and work. A significant portion of the talks and workshops at the conference were on how Wikidata can help underserved, minority, and indigenous language communities, so the ground was ripe for discussions of how our respective missions aligned. Read More…