{"id":3515,"date":"2006-12-11T15:35:19","date_gmt":"2006-12-11T15:35:19","guid":{"rendered":"http:\/\/www.paradisec.org.au\/blog\/2006\/12\/barking-up-the-same-tree-the-need-for-digital-archives\/"},"modified":"2011-02-05T07:47:05","modified_gmt":"2011-02-05T07:47:05","slug":"barking-up-the-same-tree-the-need-for-digital-archives","status":"publish","type":"post","link":"https:\/\/www.paradisec.org.au\/blog\/2006\/12\/barking-up-the-same-tree-the-need-for-digital-archives\/","title":{"rendered":"Barking up the same tree: the need for digital archives"},"content":{"rendered":"<p>The surprise for me from the <a href=\"http:\/\/conferences.arts.usyd.edu.au\/index.php?cf=11\">Sustainable Data from Digital Fieldwork<\/a> workshop (aka <em>Suzzy Data<\/em>..) was how much plant taxonomists and field linguists have in common.  And how much we need to work together with librarians and archivists.  We both have to look after records &#8211; the decaying recordings of the languages, and the dried specimens in the herbariums.  We both work with the living communities, the trees that will get logged and the communities that live with the trees, and the families and children who will switch to speaking another language.<\/p>\n<p><!--more--><br \/>\nBotanists and linguists both have overlapping communities of users and communities of creators and managers.  Local people create linguistic data in that they are speakers.  Local people manage trees and plants in their areas, and are creators of information through their traditional ecological knowledge.  Linguists and botanists create records and analyses of data.  And all of us are users &#8211; since we all want access to each others&#8217; information.<br \/>\nWe&#8217;re both interested in <em>sustainability<\/em> and <em>collaboration<\/em>.   I was struck by <a href=\"http:\/\/hdl.handle.net\/2123\/1298\">Murray Henwood&#8217;s<\/a> description of how Dr Carrot visits a herbarium,  is immediately taken to the shelves of long flat boxes to admire the specimens belonging to the carrot family, whereupon (s)he writes comments on the information sheets associated with those specimens.  That&#8217;s a good way of building knowledge and having quality control.<br \/>\nCollaboration is also involved in data collecting &#8211; <a href=\"http:\/\/hdl.handle.net\/2123\/1292\">Barry Conn<\/a> talked about how his and Damas Kipiro&#8217;s project on documenting trees of Papua New Guinea is based on collaboration, and the need for:<br \/>\n&#8226; an outcome <b>for<\/b> the people of Papua New Guinea who need to make environmental decisions such as where and how to log trees.<br \/>\n&#8226; work to be done <b>by<\/b> people in Papua New Guinea. You shouldn&#8217;t need a PhD in botany to be able to identify a commercially valuable species and present a case for how it should be managed.  But a lack of such skills is hampering local people&#8217;s abilities to make environmental decisions.  And what do you do if you have better access to a bush knife than to a microscope? There are obvious parallels for linguists.<br \/>\nBarry also talked about <em>controlled vocabulary<\/em> &#8211; something which botanists have long realised is essential for taxonomy,  and which is a mark of their collaboration.  It is also important for building software for learner taxonomists to enter data.   <a href=\"http:\/\/hdl.handle.net\/2123\/1297\">Ronald Schroeter and Nick Thieberger<\/a> raised this later when talking about linguistic software.  We really need templates with controlled tier labels in interlinearising software like <a href=\"http:\/\/www.mpi.nl\/tools\/elan.html\">ELAN<\/a> (The <a href=\"http:\/\/childes.psy.cmu.edu\/\"> CHILDES program CLAN<\/a> already has recommended tier labels) so that material can be translated into other software programmes without too much fuss (and see Bruce&#8217;s post on this <a href=\"\/blog\/2006\/12\/suzzy-data-workshop-guest-blogger-bruce-birch\/\">here<\/a>).  We also need more agreement on glossing conventions (<a href=\"http:\/\/linguistlist.org\/emeld\/gold-ns\/index.cfm\">E-MELD GOLD<\/a> and the <a href=\"http:\/\/www.eva.mpg.de\/lingua\/files\/morpheme.html\"> Leipzig glossing conventions<\/a> are a start).  Botanists are way way ahead of linguists on this one.<br \/>\nSo what is <em>sustainable digital data<\/em>?  It&#8217;s having a good way to keep the records of data (Murray Henwood mentioned the loss that befell botany on 1 March 1943 when the <a href=\"http:\/\/www.bgbm.org\/BGBM\/research\/colls\/herb\/hist2.htm\">Berlin Herbarium<\/a> and its 4 million specimens were bombed).  Digital objects need cataloguing but ideally also have linking from the catalogue entry direct to the digital object.  We need to collaborate with librarians and archivists on this, as they have been developing &#8220;Digital Assets Management Systems&#8221;.<br \/>\nOne such system is the  open source <a href=\"http:\/\/www.dspace.org\/\">DSpace<\/a>, developed at MIT, which provides a kind of permanent URL (&#8216;handle&#8217;) for digital material.   Interesting pilots of this were shown.  Su Hanfling of the University of Sydney showed the pilots of <a href=\"http:\/\/hdl.handle.net\/2123\/1298\"> <em>eBot<\/em> and  <em>eFlora<\/em><\/a>.  <em>eBot<\/em> is a &#8220;digital repository of botanical objects&#8221; &#8211; mostly photos and their descriptions &#8211; with an illustrated glossary of botanical terms.  <em>eFlora<\/em> is an &#8220;electronic compendium of the plants of the Sydney region&#8221;.  Kim Mackenzie and Murray Garde showed ANU&#8217;s  <em><a href=\"http:\/\/www.anu.edu.au\/bidwern\/\">Bidwern<\/em><\/a>  in which digital photographs, videos and text material related to communities in Western Arnhem Land are stored in DSpace.<br \/>\nSustainable digital data also requires having good ways of <em>accessing<\/em> the records of the data and the digital objects themselves.  If people (local people and researchers) don&#8217;t access the material, then sooner or later someone will decide it&#8217;s not worth keeping (Barry Conn&#8217;s sad story of how three years of plant records disappeared because of such a decision).  All the DSpace projects have interesting user interfaces   The <em>Bidwern<\/em> material will be linked and accessible through an interface involving Google maps as well as a thesaurus based on Murray Garde&#8217;s Bininj Gun-wok dictionary.  <em>Bidwern<\/em> and <em>eFlora<\/em> both have interfaces designed for browsing users.  <em>eBot<\/em> potentially is also an interface for creators.  Very soon Dr Carrot will be able to visit the virtual herbariums and look at images of specimens and add notes online.<br \/>\nThese DSPace projects are still in prototype stage.  A project which is actually up and working is the <a href=\"http:\/\/qenaga.org\/index.cfm\">Dena&#8217;ina Qenaga web site<\/a>, run as a collaboration between Dena&#8217;ina people, <a href=\"http:\/\/www.uaf.edu\/anlc\" target=\"new\">Alaska Native Language Center<\/a>, <a href=\"http:\/\/www.alaskanative.net\" target=\"new\">Alaska Native Heritage Center<\/a>, <a href=\"http:\/\/www.linguistlilst.org\" target=\"new\">The LINGUIST List<\/a>, and the <a href=\"http:\/\/www.arsc.edu\" target=\"new\">Arctic Region Supercomputing Center<\/a>.  It&#8217;s an elegant introduction to aspects of the Dena&#8217;ina language and people.  But behind it all is an archive &#8220;which provides digital access to more than five hundred documents and recordings relating to the Dena&#8217;ina language, including nearly everything written in or about Dena&#8217;ina language.&#8221;  Very nice!<br \/>\nThe Dena&#8217;ina Qenaga project was funded by the National Science Foundation of the USA.  The  importance of this kind of e-humanities\/e-social sciences work  is recognised by the US Government &#8211; there&#8217;s a <a href=\"http:\/\/www.neh.gov\/grants\/guidelines\/Digital_Partnership.html\">program<\/a> funded through the  National Endowment for Humanities and the Institute of Museum and Library Services to carry out this kind of e-humanities work (<em>thanks Kimberley!<\/em>.  They &#8220;encourage projects that explore new ways to share, examine, and interpret humanities collections in a digital environment and to develop new uses and audiences for existing digital resources&#8221;.  Lucky Americans!<br \/>\nWhat Australia needs is long-term funding for digital archives to collaborate with users and creators to make archivally stable digital objects, make them accessible, and preserve them. E-Science, E-Humanities, E-Social Sciences, we&#8217;re all after the same thing.  Please?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The surprise for me from the Sustainable Data from Digital Fieldwork workshop (aka Suzzy Data..) was how much plant taxonomists and field linguists have in common. And how much we need to work together with librarians and archivists. We both have to look after records &#8211; the decaying recordings of the languages, and the dried &#8230; <a title=\"Barking up the same tree: the need for digital archives\" class=\"read-more\" href=\"https:\/\/www.paradisec.org.au\/blog\/2006\/12\/barking-up-the-same-tree-the-need-for-digital-archives\/\" aria-label=\"Read more about Barking up the same tree: the need for digital archives\">Read more<\/a><\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[9],"tags":[],"class_list":["post-3515","post","type-post","status-publish","format-standard","hentry","category-archiving"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/3515","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/comments?post=3515"}],"version-history":[{"count":1,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/3515\/revisions"}],"predecessor-version":[{"id":4389,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/3515\/revisions\/4389"}],"wp:attachment":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/media?parent=3515"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/categories?post=3515"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/tags?post=3515"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}