Online media annotation

PARADISEC has digitised thousands of hours of legacy audio, the results of fieldwork by many researchers since the 1950s. The recordings are in a number of small languages (more than 1,400 languages are represented in the collection). For many of these recordings the only written information we have are scant notes on the tape cover. … Read more

Online Elan file player in the PARADISEC collection

In PARADISEC we store media files with their transcriptions whenever possible, typically in .eaf format, created by the standard transcription tool Elan. Best practice in language documentation includes creating a corpus of media with transcripts so that others can access it in future and locate what is in the files. Untranscribed files remain largely inaccessible, … Read more

Offline static collections

Following the earlier discussion of creating collections for offline delivery (particularly on Raspberry Pi), we now have a simple method that indexes a set of items from the PARADISEC collection and generates an html view, which means that files are not disconnected from the catalog in the way they were in the past. To do … Read more

Large language models for small languages

Of the 7,000 languages in the world today, most have little presence on the internet. What records there are in these languages are often religious translations from large languages and so, while being a valuable set of texts in the language, they have little local cultural content. For some of these languages, there are recordings … Read more

The Tape Restorator

We wrote about dried out cassette tapes in an earlier blog post, and the problem they create for playback, screeching as they try to move through the playback machine’s mechanism and ultimately failing to play. You can hear an audio example in that post. To get the tapes into a playable form, they need to … Read more

Using Raspberry Pi in Ranongga

By guest blogger Debra McDougall I thought I’d write a short update to Nick’s post on 21 January 2024 about using the Raspberry Pi to return legacy recordings to people in Ranongga, Solomon Islands. These recordings were digitised in 2019-2020 with the support of an ELDP Legacy Materials Grant (0609), a project led by myself and … Read more