{"id":10313,"date":"2026-06-23T13:01:21","date_gmt":"2026-06-23T03:01:21","guid":{"rendered":"https:\/\/www.paradisec.org.au\/blog\/?p=10313"},"modified":"2026-06-23T13:02:13","modified_gmt":"2026-06-23T03:02:13","slug":"online-media-annotation","status":"publish","type":"post","link":"https:\/\/www.paradisec.org.au\/blog\/2026\/06\/online-media-annotation\/","title":{"rendered":"Online media annotation"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">PARADISEC has digitised thousands of hours of legacy audio, the results of fieldwork by many researchers since the 1950s. The recordings are in a number of small languages (more than 1,400 languages are represented in the collection).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For many of these recordings the only written information we have are scant notes on the tape cover. If we could listen to all the files we could gather more metadata and make the contents more useful for speakers of these languages. But that would take thousands of hours of listening time, that we don&#8217;t have. While we are working on <a href=\"https:\/\/www.paradisec.org.au\/blog\/2026\/02\/speech-recognition-on-your-laptop\/\" target=\"_blank\" rel=\"noreferrer noopener\">Automated Speech Recognition<\/a> methods to transcribe media automatically, it is still early days for that technology being applied to all the different languages in PARADISEC.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In the past, we have had a project of asking volunteers to download files and to provide summaries in Elan that has been successful in providing guides to media files. Sometimes these are summaries that tell you where events in the file occur (singing here, a woman talking here, a different woman talking here) sometimes they include the opening metadata spoken by the recorder, in other cases they capture the English words used in elicitation. Once a summary is prepared, a user can go straight to the section they are interested in and transcribe it (like correcting OCR in <a href=\"https:\/\/trove.nla.gov.au\/about\" target=\"_blank\" rel=\"noreferrer noopener\">TROVE<\/a>). For example, Arthur Capell&#8217;s <a href=\"https:\/\/catalog.paradisec.org.au\/repository\/AC1\/417\" target=\"_blank\" rel=\"noreferrer noopener\">file of Vanuatu recordings<\/a> has a transcript here: <a href=\"https:\/\/catalog.paradisec.org.au\/repository\/AC1\/417\/AC1-417-A1.mp3.\">https:\/\/catalog.paradisec.org.au\/repository\/AC1\/417\/AC1-417-A1.mp3.<\/a> The summary provided by the volunteer for this 40 minute file is below, giving essentially seven sections in the file:<\/p>\n\n\n\n<p class=\"has-text-align-left has-small-font-size wp-block-paragraph\">Omba recordings, Walurigi dialect done by Simon Garae<br>Reading of the prodigal son in Omba<br>Part two &#8211; The Prodigal Son story<br>Part Three &#8211; story text<br>Reading of story in Aneityum<br>Readings in the Kwamera dialect of Tanna<br>Recordings in Raga. Northern dialect of Pentacost Islands<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">You can see that it gives a high level summary of the content of the file and is not a transcript of the content of the recording. To make it easier for this kind of summary to be provided, we now have an online transcription system, called <a href=\"https:\/\/cockatiel.crate-works.org\/\" data-type=\"link\" data-id=\"https:\/\/cockatiel.crate-works.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">Cockatiel<\/a>, that runs in the browser.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Cockatiel does the initial segmentation of the audio, based on silence detection, and speaker diarisation (identifying the voices of speakers in the recording) as can be seen in this image:<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><a href=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-17-at-20.02.50.png\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"567\" src=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-17-at-20.02.50-1024x567.png\" alt=\"\" class=\"wp-image-10317\" srcset=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-17-at-20.02.50-1024x567.png 1024w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-17-at-20.02.50-300x166.png 300w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-17-at-20.02.50-768x426.png 768w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-17-at-20.02.50-1536x851.png 1536w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-17-at-20.02.50-2048x1135.png 2048w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure>\n\n\n\n<div class=\"wp-block-media-text is-stacked-on-mobile\"><figure class=\"wp-block-media-text__media\"><a href=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-22-at-11.23.09-am.png\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"464\" src=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-22-at-11.23.09-am-1024x464.png\" alt=\"\" class=\"wp-image-10336 size-full\" srcset=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-22-at-11.23.09-am-1024x464.png 1024w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-22-at-11.23.09-am-300x136.png 300w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-22-at-11.23.09-am-768x348.png 768w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-22-at-11.23.09-am.png 1184w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure><div class=\"wp-block-media-text__content\">\n<p class=\"wp-block-paragraph\">There are a number of settings that can be adjusted to improve this segementation process<\/p>\n<\/div><\/div>\n\n\n\n<div class=\"wp-block-media-text is-stacked-on-mobile\" style=\"grid-template-columns:34% auto\"><figure class=\"wp-block-media-text__media\"><a href=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-23-at-9.33.27-am.png\"><img loading=\"lazy\" decoding=\"async\" width=\"586\" height=\"466\" src=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-23-at-9.33.27-am.png\" alt=\"\" class=\"wp-image-10342 size-full\" srcset=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-23-at-9.33.27-am.png 586w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-23-at-9.33.27-am-300x239.png 300w\" sizes=\"auto, (max-width: 586px) 100vw, 586px\" \/><\/a><\/figure><div class=\"wp-block-media-text__content\">\n<p class=\"wp-block-paragraph\">Cockatiel exports in five formats, including Elan&#8217;s .eaf, and .srt for subtitles.<\/p>\n<\/div><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Cockatiel can be used to transcribe local files, but <a href=\"https:\/\/archive.mpi.nl\/tla\/elan\" target=\"_blank\" rel=\"noreferrer noopener\">Elan<\/a> does that job already. More important is the ability for Cockatiel to draw a file from an online archive, make the file available for transcription, and then push it to a workspace for approval before being accepted into the archive. It will soon become part of the PARADISEC interface, available for logged-in users.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/cockatiel.crate-works.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">Cockatiel<\/a> was developed by John Ferlito in a project led by Nick Thieberger with funding from the <a href=\"https:\/\/ldaca.edu.au\" data-type=\"link\" data-id=\"ldaca.edu.au\" target=\"_blank\" rel=\"noreferrer noopener\">Language Data Commons of Australia.<\/a> Code is available in <a href=\"https:\/\/github.com\/crate-works\/cockatiel\" target=\"_blank\" rel=\"noreferrer noopener\">github<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"<p>PARADISEC has digitised thousands of hours of legacy audio, the results of fieldwork by many researchers since the 1950s. The recordings are in a number of small languages (more than 1,400 languages are represented in the collection). For many of these recordings the only written information we have are scant notes on the tape cover. &#8230; <a title=\"Online media annotation\" class=\"read-more\" href=\"https:\/\/www.paradisec.org.au\/blog\/2026\/06\/online-media-annotation\/\" aria-label=\"Read more about Online media annotation\">Read more<\/a><\/p>\n","protected":false},"author":13,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[1],"tags":[],"class_list":["post-10313","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/10313","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/comments?post=10313"}],"version-history":[{"count":30,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/10313\/revisions"}],"predecessor-version":[{"id":10350,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/10313\/revisions\/10350"}],"wp:attachment":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/media?parent=10313"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/categories?post=10313"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/tags?post=10313"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}