Now that the Balinese lontar project on Palmleaf.org has proven successful, the Internet Archive recently suggested that we move it to a well-established platform so the Balinese community can take over management, and so it can be maintained long-term. The obvious choice is Wikisource, both because of the nature of the content (lontar works are like books) and because Palmleaf.org already runs on MediaWiki, the software that powers all Wikimedia Foundation sites.
PanLex recently reached out to the Balinese Wikipedia community, and they agreed that it’s a great idea to move the Palmleaf.org platform to a new Balinese Wikisource. It will make the lontar content more widely available and useful, and the infrastructure we have developed will help enable future projects in other languages. In order to fund the work needed for the transition, PanLex has applied for a project grant from the Wikimedia Foundation. We’ve gotten great feedback so far. If you have any comments or feedback on the proposal, there is still time to weigh in on the grant page! Funding decisions will be announced in May.
PanLex wants to be part of this international effort to safeguard and support the world’s indigenous languages. It was great to hear from so many people and organizations around the world that are working towards the same goal. Thanks to our friends at Translation Commons for telling us about the event! PanLex is excited to participate, in particular in Translation Commons’s recently announced Language Digitization Initiative, which will help achieve the goals of frameworks like IDIL.
Word origins can vary widely within and across languages. For certain concepts like ‘water’, ‘head’, or ‘egg’, the modern words often resemble the ancestors of these words from thousands of years ago. On the other hand, for wanderwort concepts like ‘taxi’ or ‘sushi’, borrowing across languages is the norm. Nearly anywhere you go in the world, people call taxis something like taxi and sushi something like sushi. Most concepts, however, fall somewhere in between. Many of these are culturally or geographically specific, and the words for them are lost or change significantly with cultural change or migrations.
Generally, words for human body parts are in the first category, and decidedly stable due to their obvious cross-culturalness. All human cultures need to refer to their noses, knees, and hearts. However, oddly enough, the words for the names of individual fingers seem to be quite unstable over time. Closely related languages and even dialects of the same language often have totally different words for the middle finger or ring finger. Even if they share the same meaning (such as ‘middle’ or ‘ring’), they often use completely unrelated words for those concepts.
Yvypóra po rechauka, avañe’eme / Finger names in Guaraní, by P. S. F. Freitas, CC-BY-SA 3.0
What if you fed a machine-learning algorithm a list of ice cream flavor names and tasked it with creating new ones? What if you fed it a bunch of pick-up lines and got it to generate new ones? What would the results be like?
On December 16, 2019, PanLex director David Kamholz spoke at the San Francisco Center for the Book on PanLex and the Internet Archive’s joint project to digitize Balinese palm-leaf manuscripts. You can view a recording of the presentation below, and slides are available here. The presentation introduces Balinese manuscripts, covers technical challenges in supporting Balinese script (the writing system in which the manuscripts are written), invites the audience to interactively analyze core features of the Balinese script, demos the new Palm Leaf Wiki, and recognizes the team of Balinese contributors who are managing the transcription (typing) of manuscripts on the wiki. To date, more than 3,000 leaves and 92 complete works have been transcribed.