Open Source Site Archives Endangered Languages

5shares

The Pangloss Collection, an open archive containing more than 3600 audio and video recordings in 170 languages from across all continents, is now being revamped with a new website.

Examples from the archive include stories and songs in Xârâgurè (New Caledonia), conversations and tales in Kakabe (Guinea), and cooking recipes in Koyi rai (Nepal) and Na-našu (Italy)—a total of 780 hours of listening.

The archives are the result of more than twenty years’ work by linguists and ethnologists who, in their own field of study, are working to collect and preserve the world’s linguistic heritage. Some of the documents come from the digitization of old magnetic tapes. Nearly half of the recordings are transcribed and annotated, some with contextual elements or translations into other languages. The site is open to contributions from both academic and non-academic experts, who are encouraged to improve the corpus by contributing to transcriptions and translations.

In order to be more accessible to the general public, who can freely listen to and download these precious documents and hereby get a sense for the world’s linguistic diversity, the redesigned pangloss.cnrs.fr website can now be consulted via two levels of access. As the content is largely under a Creative Commons license, it is available for use in museographic projects or audio creations.

Beyond its heritage aspect, this collection is also part of an open science approach to facilitate the conservation, referencing, and availability of primary data for researchers. Its purpose is to limit the loss of scientific data (a “second death” for extinct languages) whilst also encouraging collaboration with other disciplines: computer scientists interested in automatic language processing can access the files they need and take part in the co-development of tools (e.g. for automatic transcription). The site is fully bilingual (French–English) and also includes partial translations in other languages, including Chinese for records in certain Asian languages.

In addition to contributions from various laboratories associated with the CNRS, the Pangloss Collection is supported by the recently created Institute for Linguistic Heritage and Diversity at the EPHE-PSL, and data are stored in the archive of the large research infrastructure (Très grande infrastructure de recherche –TGIR) Huma-Num. The Pangloss Collection is a member of the international Digital Endangered Languages and Musics Archives Network (DELAMAN). It is hosted by the Cocoon platform, Collection de corpus oraux numériques, which is one of the participating archives of the Open Language Archive Community (OLAC).

5shares

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Building Young Writers’ Stamina

Unlocking the Vietnamese Language: A Student’s Perspective in Saigon

Fastest Growth in US International Students in 40 Years

International Students Returning to US

Building Young Writers’ Stamina

Unlocking the Vietnamese Language: A Student’s Perspective in Saigon

Fastest Growth in US International Students in 40 Years

International Students Returning to US

Forever

Recommended

1-Year

1-Month

Become a member

Supporting Multilingual Learners in Accessing CTE Texts

Canadian Communities Welcome French-Speaking Students and Immigrants

Welsh and Irish Unite in Song

Russian Push in Africa Accompanies Unrest

WIDA Response

$11 Million To Support Multilingualism in Schools

In Memoriam: Ivannia Soto

Opera for Educators

New Site Chronicles Endangered and Under-Documented Languages

Film Aims to Revitalize Mixtec

Arizona Opens Center for Linguistic Revitalization

Celebrate International Day of the World’s Indigenous Peoples

Subscribe for exclusive content

Subscribe to Liberty Case