{"id":5102,"date":"2011-04-04T01:00:06","date_gmt":"2011-04-03T14:00:06","guid":{"rendered":"http:\/\/www.paradisec.org.au\/blog\/?p=5102"},"modified":"2011-04-09T23:12:04","modified_gmt":"2011-04-09T12:12:04","slug":"theyre-out-to-get-you-or-your-data-at-least","status":"publish","type":"post","link":"https:\/\/www.paradisec.org.au\/blog\/2011\/04\/theyre-out-to-get-you-or-your-data-at-least\/","title":{"rendered":"They&#8217;re out to get you (or your data at least)"},"content":{"rendered":"<p>A couple of years ago I wrote a <a href=\"http:\/\/www.paradisec.org.au\/blog\/2008\/07\/copy-right\/\">blog post<\/a> about Professor Phillip M Parker PhD, a Professor of Marketing in France who had established a website called <i>Webster\u2019s Online Dictionary<\/i> that contained materials on endangered languages taken from copyrighted sources. ((After much discussion (see the 19 comments on my post), Professor Parker appeared to take on board feedback from a number of researchers about apparent violations of intellectual property rights and moral rights, and wrote that he was planning to write an \u201cOpen Letter to Field Linguists\u201d and a guide to \u201cCopyrights and Moral Rights for Languages and their Translations\u201d &#8212; neither has so far been published.)) Parker also published a set of books based on materials taken from copyrighted websites, such as <i>Webster&#8217;s Kamilaroi-English Thesaurus Dictionary<\/i>. ((This is now showing as &#8220;out of print&#8221; at <a href=\"http:\/\/www.amazon.com\/Webster%C2%92s-Kamilaroi-English-Thesaurus-Dictionary\/dp\/0497835398\">Amazon.com<\/a> but is still available as an e-Book from the <a href=\"http:\/\/www.bookdepository.co.uk\/book\/9780497835392\/Websters-Kamilaroi---English-Thesaurus-Dictionary\">Book Depository<\/a> among other sources.))<\/p>\n<p>Well, it looks like someone else is also harvesting data on languages from copyrighted sources without attribution. This is the <a href=\"http:\/\/panlex.org\/\">PanLex project<\/a> funded by the <a href=\"http:\/\/utilika.org\/info\/research.shtml\">Utilka Foundation<\/a> that:<\/p>\n<blockquote><p>\n&#8220;gathers knowledge about all the words in all the languages of the world, so that any word may be translated into any language, a step toward panlingual communication. For this work we consult multilingual, bilingual, and monolingual resources named \u201cdictionaries\u201d, \u201cthesauri\u201d, \u201clexical databases\u201d, \u201cwordnets\u201d, \u201cglossaries\u201d, \u201cterminologies\u201d, \u201cvocabularies\u201d, and \u201cword lists\u201d, as well as individuals.&#8221;\n<\/p><\/blockquote>\n<p>Although the website gives a <a href=\"http:\/\/utilika.org\/info\/plrefs.shtml\">list<\/a> of &#8220;the resources we are now consulting&#8221;, a simple search using the <a href=\"http:\/\/panlex.org\/demo\/treng.html\">TerraDict<\/a> tool shows that in fact unlisted materials are also being used. I searched for &#8220;left&#8221; in the Dieri (Diyari) language (which I have worked on for the past 35 years) and got the following result (click image to enlarge it):<\/p>\n<p><a href=\"http:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2011\/04\/TeraDict_Diyari.png\"><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2011\/04\/TeraDict_Diyari-300x158.png\" alt=\"\" title=\"TeraDict_Diyari\" width=\"300\" height=\"158\" class=\"aligncenter size-medium wp-image-5113\" srcset=\"https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2011\/04\/TeraDict_Diyari-300x158.png 300w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2011\/04\/TeraDict_Diyari-1024x540.png 1024w, https:\/\/www.paradisec.org.au\/blog\/wp-content\/uploads\/2011\/04\/TeraDict_Diyari.png 1261w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/p>\n<p>This can only have come from the vocabulary list in my 1981 book <i>A Grammar of Diyari, South Australia<\/i> (published by Cambridge University Press) because it is only in that book that I used the letter <b>d<\/b> for the trill sound &#8212; in later publications I used <b>rrh<\/b>. This word would now be spelled as <b>warrangantyu<\/b> in the orthography that the Dieri Aboriginal community prefers. QED, the Diyari material has been nicked without attribution from my copyrighted book.<\/p>\n<p>Johnathan Poole, President of the <i>Utilika Foundation<\/i>, realises they are playing fast and loose here as the following statements from the <a href=\"http:\/\/utilika.org\/org\/minutes-2011.xhtml\">minutes<\/a> of their 2011 Annual Meeting held just last month make clear (note the last sentence in particular):<\/p>\n<blockquote><p>\n&#8220;intellectual-property obstacles to the expansion of PanLex have not yet been a major problem. If they prevented us from using one resource, we could move on to the next. The creators of many resources assert rights that, taken literally, would prohibit a person reading a resource from later making use of what he or she had learned from it. From the beginning of the project, I have considered such usage prohibitions unenforceable, and I have considered our use of any resource to be the recording of facts asserted by it, in a novel form, not the creation of a copy of it and thus not copyright infringement. &#8230; I believe that our normalization, structuring, and selective use of published data, combined with our provision of links to the original data, will satisfy most content creators. However, the inclusion of funds for legal services in the 2012 budget reflects an assumption that intellectual-property issues, as well as contractual issues more generally, will likely become more complex as resource deployment progresses.&#8221;\n<\/p><\/blockquote>\n<p>Well, as far as I can see there is no &#8220;complex[ity]&#8221; surrounding &#8220;intellectual-property issues&#8221; here &#8212; the Diyari materials (and possibly lots more on lots more languages) are copyright and subject to fair dealing. Anything else is theft.<\/p>\n<hr>\n<p><b>PS<\/b>: Thanks to David Nathan for passing on pointers to the PanLex project, including the Annual Meeting minutes quoted here. He bears no responsibility for the content of this blog post.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A couple of years ago I wrote a blog post about Professor Phillip M Parker PhD, a Professor of Marketing in France who had established a website called Webster\u2019s Online Dictionary that contained materials on endangered languages taken from copyrighted sources. ((After much discussion (see the 19 comments on my post), Professor Parker appeared to &#8230; <a title=\"They&#8217;re out to get you (or your data at least)\" class=\"read-more\" href=\"https:\/\/www.paradisec.org.au\/blog\/2011\/04\/theyre-out-to-get-you-or-your-data-at-least\/\" aria-label=\"Read more about They&#8217;re out to get you (or your data at least)\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[37,33,38],"tags":[],"class_list":["post-5102","post","type-post","status-publish","format-standard","hentry","category-copyright","category-endangered-languages","category-intellectual-property-rights"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/5102","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/comments?post=5102"}],"version-history":[{"count":31,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/5102\/revisions"}],"predecessor-version":[{"id":5240,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/posts\/5102\/revisions\/5240"}],"wp:attachment":[{"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/media?parent=5102"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/categories?post=5102"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.paradisec.org.au\/blog\/wp-json\/wp\/v2\/tags?post=5102"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}