Online Elan file player in the PARADISEC collection

In PARADISEC we store media files with their transcriptions whenever possible, typically in .eaf format, created by the standard transcription tool Elan. Best practice in language documentation includes creating a corpus of media with transcripts so that others can access it in future and locate what is in the files. Untranscribed files remain largely inaccessible, relying on simple descriptive metadata. With the new catalog viewer (discussed earlier) we can search within files, not just in the descriptive metadata, and so we can find words inside Elan transcripts and instantly play the result.

PARADISEC has 16,339 Elan files in 9,979 items, and has previously had a player for these files, but it only presented the firdt tier of an Elan file, which may have been the transcript line, but also could have been a single chunk number or anything else the creator decided to include, because Elan files have as many tiers of annotation as the creator wants to add. An online player needs to take that into account, by presenting the user with a checklist of which tiers to display. John Ferlito has now written an Elan player that does this and is available for all items in our collection. For any media file in the PARADISEC catalog that has an .eaf file associated with it, displaying the media file will also show the Elan player, as can be seen in the video below. You can view the transcript either vertically or horizontally, and you can jump to anywhere in the file using the slider. The relationship between a media file and transcript is inferred from filenaming, with each sharing the same name but differing in their extension (.eaf vs .wav or .mp4). We will soon add a metadata entry to make the link explicit (‘isAnnotatonOf’, ‘hasAnnotation’) which will allow the creator to link different transcript or media files.

Example of audio playing with an Elan transcript. Matthew David telling a story about ten rats from Pangpang village, Efate, Vanuatu. https://catalog.paradisec.org.au/repository/NT16/2025072502/NT16-2025072502-01.mp3

Here at Endangered Languages and Cultures, we fully welcome your opinion, questions and comments on any post, and all posts will have an active comments form. However if you have never commented before, your comment may take some time before it is approved. Subsequent comments from you should appear immediately.

We will not edit any comments unless asked to, or unless there have been html coding errors, broken links, or formatting errors. We still reserve the right to censor any comment that the administrators deem to be unnecessarily derogatory or offensive, libellous or unhelpful, and we have an active spam filter that may reject your comment if it contains too many links or otherwise fits the description of spam. If this happens erroneously, email the author of the post and let them know. And note that given the huge amount of spam that all WordPress blogs receive on a daily basis (hundreds) it is not possible to sift through them all and find the ham.

In addition to the above, we ask that you please observe the Gricean maxims:

*Be relevant: That is, stay reasonably on topic.

*Be truthful: This goes without saying; don’t give us any nonsense.

*Be concise: Say as much as you need to without being unnecessarily long-winded.

*Be perspicuous: This last one needs no explanation.

We permit comments and trackbacks on our articles. Anyone may comment. Comments are subject to moderation, filtering, spell checking, editing, and removal without cause or justification.

All comments are reviewed by comment spamming software and by the site administrators and may be removed without cause at any time. All information provided is volunteered by you. Any website address provided in the URL will be linked to from your name, if you wish to include such information. We do not collect and save information provided when commenting such as email address and will not use this information except where indicated. This site and its representatives will not be held responsible for errors in any comment submissions.

Again, we repeat: We reserve all rights of refusal and deletion of any and all comments and trackbacks.

Leave a comment