{"id":10042,"date":"2025-04-29T12:01:42","date_gmt":"2025-04-29T02:01:42","guid":{"rendered":"https:\/\/www.paradisec.org.au\/blog\/?p=10042"},"modified":"2025-04-29T12:22:07","modified_gmt":"2025-04-29T02:22:07","slug":"redeveloped-uploading-for-paradisec","status":"publish","type":"post","link":"https:\/\/www.paradisec.org.au\/blog\/2025\/04\/redeveloped-uploading-for-paradisec\/","title":{"rendered":"Redeveloped uploading for PARADISEC"},"content":{"rendered":"\n<p>This week I received a set of six collections from <a href=\"https:\/\/jglobal.jst.go.jp\/en\/detail?JGLOBAL_ID=201801008838406440\">Masayuki Onishi.<\/a> Three were from his fieldwork, mainly in Bougainville (with <a href=\"https:\/\/catalog.paradisec.org.au\/collections\/MO02\" target=\"_blank\" rel=\"noreferrer noopener\">Baitsi<\/a>, <a href=\"https:\/\/catalog.paradisec.org.au\/collections\/MO03\" target=\"_blank\" rel=\"noreferrer noopener\">Naasioi<\/a>, and <a href=\"https:\/\/catalog.paradisec.org.au\/collections\/MO01\" target=\"_blank\" rel=\"noreferrer noopener\">Motuna (Siwai)<\/a>), two were his reworking of <a href=\"https:\/\/www.eoas.info\/biogs\/P001514b.htm\" target=\"_blank\" rel=\"noreferrer noopener\">Douglas Oliver&#8217;s<\/a> records dating back to the 1930s, one in <a href=\"https:\/\/catalog.paradisec.org.au\/collections\/DO02\" target=\"_blank\" rel=\"noreferrer noopener\">a range of languages<\/a>, and another on <a href=\"https:\/\/catalog.paradisec.org.au\/collections\/DO01\" target=\"_blank\" rel=\"noreferrer noopener\">Siwai<\/a> , and the sixth was from his colleague <a href=\"https:\/\/jglobal.jst.go.jp\/en\/detail?JGLOBAL_ID=201501015264417426\" target=\"_blank\" rel=\"noreferrer noopener\">Kazuya Inagaki<\/a> on <a href=\"https:\/\/catalog.paradisec.org.au\/collections\/KI01\" target=\"_blank\" rel=\"noreferrer noopener\">Nagovisi<\/a>. Masa had followed our guidelines and we had discussed how to create his collections, so he had a spreadsheet of metadata ready and also had prepared the signed deposit forms. All of this meant it took little effort to get his records into PARADISEC. Our catalog imports the spreadsheet and notifies us of any errors in the metadata that we can fix and then reupload. Within a couple of hours of receiving these 1,300 files they had been ingested and were open for users to explore, a testament to our new system that can operate with minimal manual labour. <\/p>\n\n\n\n<div class=\"wp-block-media-text is-stacked-on-mobile\" style=\"grid-template-columns:44% auto\"><figure class=\"wp-block-media-text__media\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"656\" src=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2025\/04\/DO02-002-10-1024x656.jpg\" alt=\"\" class=\"wp-image-10045 size-full\" srcset=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2025\/04\/DO02-002-10-1024x656.jpg 1024w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2025\/04\/DO02-002-10-300x192.jpg 300w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2025\/04\/DO02-002-10-768x492.jpg 768w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2025\/04\/DO02-002-10-1536x984.jpg 1536w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2025\/04\/DO02-002-10-2048x1311.jpg 2048w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure><div class=\"wp-block-media-text__content\">\n<p>Douglas Oliver left Masa with the task of editing and archiving his notes, including extensive comparative vocabularies of languages of Bougainville, written in tables in exercise books (as in this image). This collection contains the following items: <\/p>\n\n\n\n<p>\u2022. A collection of 40 recordings of oral tradition.<\/p>\n\n\n\n<p>\u2022 A comparative word list of Siwai (Motuna), Kunua (Konua), Nagovisi, Kieta (Naasioi) and Buin. Inculdes the cover (01), contents page (02), followed by 99 pages (03-100) of word lists arranged according to semantic categories.<\/p>\n<\/div><\/div>\n\n\n\n<p>\u2022 A comparative word list of Re&#8217;kona (Mono), Torao, Uruava, Banoni, Nasigo (Rotokas), Keriaka and Tapei (Eivo). Inculdes the cover (01), contents page (02), followed by 96 pages (03-98) of word lists arranged according to semantic categories.<br>\u2022  Phoneme inventories of Motuna (01, Nagovisi (02), Naasioi (03), Buin (04), Uruava (05), Torau (07), Konua (13), Keriaka (14), Rotokas (15), Eivo (16), Banoni (17) and Mono (18), with some comments (06, 11).<br>\u2022 28 pages of grammatical description, mainly of the morphology of the language (03-30).<br>\u2022 Notes on various aspects of Naasioi grammar.<br>\u2022 26 pages of grammatical description with exemplary short sentences.<br>\u2022 The paradigm of basic kinship terms of Motuna.<br>\u2022 Analysis of Siwai Verb-forms&#8217; in three pages.<\/p>\n\n\n\n<p>Over the past 2 years we have been redeveloping the PARADISEC system for ingesting new files. It now uses <a href=\"https:\/\/aws.amazon.com\/s3\/\" target=\"_blank\" rel=\"noreferrer noopener\">Amazon&#8217;s S3 <\/a>and its processing system (lambda) that starts as many processes as it needs, so there is no queue. Previously, we opened a sftp connection to a staging server that held the new items while they were processed: wav files have metadata added to a wrapper to make them BWF files, and an mp3 is generated for delivery; TIF files have a jpg for delivery; movie files are transcoded to mkv and mp4 for delivery. All filenames are checked against our accepted format. All this took time, and items were held in a queue until they had passed through each process. It all happens at once now, with large files (video for example) taking some time to process, but most files going straight in to the collection to be accessed according to the depositor&#8217;s intentions. <\/p>\n\n\n\n<p>Funding for this redevelopment work was provided by the <a href=\"https:\/\/www.grants.gov.au\/Ga\/Show\/a09260a0-0b4a-4c57-8d1a-cb32602e432e\" target=\"_blank\" rel=\"noreferrer noopener\">ARC LIEF grant <em>Modularised cultural heritage archives \u2013 future-proofing PARADISEC<\/em> (2022-2024).<\/a> Coding and implementation of the change to our systems is being done by John Ferlito.<\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>This week I received a set of six collections from Masayuki Onishi. Three were from his fieldwork, mainly in Bougainville (with Baitsi, Naasioi, and Motuna (Siwai)), two were his reworking of Douglas Oliver&#8217;s records dating back to the 1930s, one in a range of languages, and another on Siwai , and the sixth was from &#8230; <a title=\"Redeveloped uploading for PARADISEC\" class=\"read-more\" href=\"https:\/\/www.paradisec.org.au\/blog\/2025\/04\/redeveloped-uploading-for-paradisec\/\" aria-label=\"Read more about Redeveloped uploading for PARADISEC\">Read more<\/a><\/p>\n","protected":false},"author":13,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[9,18,15,3],"tags":[],"class_list":["post-10042","post","type-post","status-publish","format-standard","hentry","category-archiving","category-news","category-png-linguistics","category-technology"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/10042","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/comments?post=10042"}],"version-history":[{"count":25,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/10042\/revisions"}],"predecessor-version":[{"id":10068,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/10042\/revisions\/10068"}],"wp:attachment":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/media?parent=10042"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/categories?post=10042"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/tags?post=10042"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}