{"id":3556,"date":"2007-04-20T17:47:05","date_gmt":"2007-04-20T17:47:05","guid":{"rendered":"http:\/\/www.paradisec.org.au\/blog\/2007\/04\/when-is-a-linguists-work-done-and-dusted\/"},"modified":"2011-02-05T07:49:19","modified_gmt":"2011-02-05T07:49:19","slug":"when-is-a-linguists-work-done-and-dusted","status":"publish","type":"post","link":"https:\/\/www.paradisec.org.au\/blog\/2007\/04\/when-is-a-linguists-work-done-and-dusted\/","title":{"rendered":"When is a linguist&#8217;s work done and dusted?"},"content":{"rendered":"<p>There has been an interesting discussion on the LINGTYP linguistic typology list over the past week about publishing fieldwork data (archived <a href=\"http:\/\/listserv.linguistlist.org\/cgi-bin\/wa?A1=ind0704c&#038;L=lingtyp\">here<\/a>). David Gil<a href=\"http:\/\/listserv.linguistlist.org\/cgi-bin\/wa?A2=ind0704c&#038;L=lingtyp&#038;D=1&#038;F=&#038;S=&#038;P=1267\"> argued that<\/a>:<\/p>\n<blockquote><p>One&#8217;s collection of transcribed texts constitutes a set of complete objects, each of which could (if there were a willing publisher) stand alone as an electronic or hardcopy publication.  Barring the discovery and correction of errata, once the text is transcribed, that&#8217;s it, it&#8217;s done.<\/p>\n<blockquote>\n<p><!--more--><br \/>\nI responded that:<\/p>\n<blockquote><p>From my experience, and that of other researchers I have spoken to, understanding\/analysis of a given &#8220;text&#8221; (in the sense of inscription of a particular linguistic performance) evolves over time and is not &#8220;fixed&#8221; at any point, not even the point where it is &#8220;published&#8221; (in whatever version). Secondly, textual annotation (of which the &#8216;traditional&#8217; interlinear format is but one particular type) is hypertextual and, these days, multimedia in nature &#8211; this is hardly a new insight &#8211; see this <a href=\"http:\/\/kairos.technorhetoric.net\/3.1\/coverweb\/porush\/contra4.html\"><em>Kairos<\/em><\/a> article [<em>JHS: apologies to earlier readers &#8211; I caused an  html oblitto cut<\/em>] for a discussion of the hypertextual nature of annotation in the Talmudic tradition. Developments in Web 2.0 publishing also mean that multiple annotations of texts by multiple (distributed) contributors is now possible.<\/p><\/blockquote>\n<p>I pointed out that there are two excellent papers dealing with these topics that will be coming out in <em>Language Documentation and Description<\/em> Vol 4 to be published at SOAS later this month:<\/p>\n<ul>\n<li>Nick Evans and Hans-J\u00fcrgen Sasse &#8216;Searching for meaning in the Library of Babel: field semantics and problems of digital archiving&#8217;<\/li>\n<li>Anthony C. Woodbury &#8216;On thick translation in linguistic documentation&#8217;<\/li>\n<\/ul>\n<p>Both papers emphasise the ongoing, contingent, interpretive, hermeneutical quality of the documentation of languages, especially meaning in texts.<br \/>\nDavid Gil&#8217;s <a href=\" http:\/\/listserv.linguistlist.org\/cgi-bin\/wa?A2=ind0704c&#038;L=lingtyp&#038;D=1&#038;F=&#038;S=&#038;P=1584 \">reply<\/a> to my intervention included the following:<\/p>\n<blockquote><p>My claim is merely that with respect to texts, there still exists a kind of basic intuitive level of transcription plus annotation &#8212; comprising things such as orthographic transcription, phonetic transcription, interlinear gloss, free translation into English (or some other language) &#8212; that, once accomplished, provides a natural point at which the text may be published. Even if one chooses to add or amend things later.<\/p><\/blockquote>\n<p>Now, maybe my experience doing fieldwork and analysis over the past 35 years is unusual, but I have real doubts about the existence of such a &#8216;basic intuitive level of transcription plus annotation&#8217;, and real reservations about there being &#8220;a natural point at which the text may be published&#8221;. Transcription and analysis of texts, and their publication, involves a range of decisions taken at a particular point in time about what to include or not include. And we often publish because of pragmatic reasons such as there being an opportunity to do so, or because we are moving on to other projects, or we need the publication for a job or promotion. In addition, my experience in preparing my Jiwarli text corpus for publication (eventually published by Tokyo University of Foreign Studies in 2006 \u2013 email <b>pa2  AT soas.ac.uk<\/b> me if you want a free copy) was that serious editing in collaboration with my native-speaker consultant had to take place before he would approve publication of my transcriptions and translations of the stories we had recorded together. This included: deletion of repetitions and false starts, revisions to word order, insertion of contextual material, and elimination of loan words.<br \/>\nLooking back 11 years later I can imagine a whole set of different decisions that I would now make at various points about all areas of the published texts, from transcription to morpheme-by-morpheme glossing to running translation. What I think I understand about Jiwarli has been constantly changing over the years as I do more work on it and other languages.<br \/>\nSome researchers also seem to believe that there is a kind of endpoint when a whole language documentation is &#8216;done and dusted&#8217; as they say here in the UK. A draft paper entitled &#8220;Adequacy of Documentation&#8221; was discussed at the meeting of the Linguistic Society of America Committee on Endangered Languages and their Preservation in Los Angeles in January. In that paper, the authors wonder about: &#8220;the conditions that must be met for a language to be considered adequately documented&#8221;. They suggest it is possible to measure &#8220;how far along one has come in documenting a language&#8221; and to determine &#8220;how far there is to go&#8221; by using an &#8220;accounting function of analysis&#8221;.  So:<\/p>\n<blockquote><p>How do we know when we&#8217;ve gotten all the phonology? When we&#8217;ve done the phonological analysis and our non-directed elicitation isn&#8217;t producing any new phonology.  How do we know when we&#8217;ve gotten all the morphology? When we&#8217;ve done the morphological analysis, when our non-directed elicitation isn&#8217;t producing any new forms, and when \u2013 crucially for inflected languages \u2013 we have elicited all the implicit inflected forms that didn&#8217;t happen to come up in non-directed elicitation.<\/p><\/blockquote>\n<p>They recognise, however, that work on the lexicon and texts is very different but still insist that:<\/p>\n<blockquote><p>even in the more open-ended aspects of syntax and lexicon, we know we are coming to an endpoint when new constructions and lexical items become rarer and rarer in non-directed elicitation<\/p><\/blockquote>\n<p>Unfortunately, this begs the question of what we mean by &#8220;rarer and rarer&#8221;: is 1 million words of annotated text like the <a href=\" http:\/\/khnt.hit.uib.no\/icame\/manuals\/brown\/INDEX.HTM\">Brown Corpus<\/a> enough to judge? What about 100 million like the <a href=\" http:\/\/www.natcorp.ox.ac.uk\/\">British National Corpus<\/a>? Or the billion word <a href=\" http:\/\/www.askoxford.com\/oec\/mainpage\/?view=uk \">Oxford English Corpus<\/a>?<br \/>\nNow researchers studying lesser known languages may never record and analyse textual corpora of this size but I wonder if it is only exhaustion or death that brings a &#8220;natural point&#8221; at which the linguist&#8217;s work is done.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>There has been an interesting discussion on the LINGTYP linguistic typology list over the past week about publishing fieldwork data (archived here). David Gil argued that: One&#8217;s collection of transcribed texts constitutes a set of complete objects, each of which could (if there were a willing publisher) stand alone as an electronic or hardcopy publication. &#8230; <a title=\"When is a linguist&#8217;s work done and dusted?\" class=\"read-more\" href=\"https:\/\/www.paradisec.org.au\/blog\/2007\/04\/when-is-a-linguists-work-done-and-dusted\/\" aria-label=\"Read more about When is a linguist&#8217;s work done and dusted?\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[5],"tags":[],"class_list":["post-3556","post","type-post","status-publish","format-standard","hentry","category-linguistics"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/3556","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/comments?post=3556"}],"version-history":[{"count":2,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/3556\/revisions"}],"predecessor-version":[{"id":4561,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/3556\/revisions\/4561"}],"wp:attachment":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/media?parent=3556"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/categories?post=3556"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/tags?post=3556"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}