The July edition of LIP was led by David Gil from the Max Planck Institute for the Science of Human History, Jena. The night was well attended with representatives from all the usual suspects: University of Melbourne, Monash, and La Trobe. Attendees this month also came from the University of New England, ANU, as well as from SOAS, London and NTNU, Norway. The evening’s discussion centred on issues related to Malay and Indonesian languages and varieties, but also included discussion of language documentation and description in general. Continue reading ‘The challenge that language variation poses to language description – a LIP recap’ »
Archive for the ‘Fieldwork’ Category.
Keeping track of what is recorded in the course of fieldwork is critical, both for your own future work and for longterm archiving. Recordings of dynamic performance (audio or video) are easy to misplace or misidentify and very difficult to locate once you forget what a file was named and what you recorded on a particular day. We ran a survey about how people record their metadata from January 21st to April 25th, 2016 and had 142 responses (see also the earlier blog post here). There were two multiple choice questions each allowing selection of more than one checkbox and the entry of free text responses. I can send the full results of the survey on request. This information will help inform the development of new tools for metadata entry. The responses are summarised below.
HyperCard (HC) was a brilliant program that came free with every Macintosh computer from 1987 and was in development until around 2004. It made it possible to create multimedia ‘stacks’ (of cards) and was very popular with linguists. For example, Peter Ladefoged produced an IPA HyperCard stack and SIL had a stacks for drawing syntactic trees or for exploring the history of Indo-European (see their listing here). Texas and FreeText created by Mark Zimmerman allowed you to create quick indexes of very large text files (maybe even into the megabytes! Remember this is the early 1990s). I used FreeText when I wrote Audiamus, a corpus exploration tool that let me link text and media and then cite the text/media in my research.
My favourite HC linguistic application was J.Randolph Valentine’s Rook that presented a speaker telling an Ojibwe story (with audio), with interlinear text linked to a grammar sketch of the language. I adapted that model for a story in Warnman, told by Waka Taylor, and produced as part of a set of HC stacks called ‘Australia’s languages’ and released in 1994. Continue reading ‘Reading HyperCard stacks in 2016’ »
In the spirit of solving small frustrations I offer my weekend experience of getting Toolbox files into Elan. I have over a hundred texts in Nafsan, most of which are time-aligned and interlinearised. I am working with Stefan Schnell on adding GRAID annotation to some of these texts and the preferred way of doing this is in Elan, with the GRAID annotation at the morphemic-level. I tried importing Toolbox files using the Elan ‘Import’ menu, and had listed all field markers in Toolbox, together with their internal dependencies (which should then map to Elan’s relationship between tiers). These settings are stored in an external file. Unfortunately, the import failed several times, despite changing the settings slightly after each attempt. Continue reading ‘Toolbox to Elan’ »
Jonathan Schlossberg recaps the April Linguistics in the Pub, a monthly informal gathering of linguists in Melbourne to discuss topical areas in our field.
Topic: Elicitation Methods
In 2011, LIP ran a discussion on techniques and activities used in the field by linguists to elicit particular grammatical phenomena, compare cognition across languages or simply record naturalistic talk-in-interaction. What is new today? We would like to follow on the same idea and give the opportunity to present activities which were successful or unsuccessful in the field. Of particular interest would be activities using grammaticality judgments or aimed at analysing semantic functions, such as aspect.
A small but dedicated cohort representing linguists from Melbourne’s three linguistics departments showed up at April’s LIP to discuss elicitation methodologies, moderated by Giordana Santosuosso.
We are in the process of identifying gaps in tools for fieldwork and data analysis that can be filled as part of the Centre of Excellence for the Dynamics of Language. I’d like to ask for your input into the requirements for a metadata entry tool. In part, this analysis asks for your opinions on the value of existing tools (listed below) and their relative strengths and weaknesses, and asks if it may be worth putting effort into developing any of them further, rather than starting from scratch.
The high-level requirement of this tool is to make it easy to describe files created in fieldwork, to be available both off- and on-line and to deliver the description as a text file for upload to an archive. This includes capturing as much metadata from the files themselves; providing controlled vocabularies of terms to select from (preferably via drag-and-drop rather than keyboard entry); allowing the metadata to be exported in a range of formats to suit whichever archive will host the collection; allowing the metadata to be imported to the tool for use by collaborative team members; allowing controlled vocabularies to be amended to suit the local situation. This tool could also allow users to visualise the state of a collection: which media files have been transcribed, which have been interlinearised, have text files been scanned, OCRed …. what other processes have been applied, which have been archived, what the rights are for each file, also allowing the user to specify what these criteria are for their own type of collection.
You can either add comments below, or else write to me separately (thien [at] unimelb.edu.au) with your ideas that can contribute to how we develop this tool.
Jonathan Schlossberg recaps last week’s Linguistics in the Pub, a monthly informal gathering of linguists in Melbourne to discuss topical areas in our field.
Linguistics in the Pub on Wednesday 29th of October, 2014 centred around the theme: Issues in the documentation of newer varieties. Felicity Meakins (University of Queensland) led the discussion. The announcement and short background reading are here.
This session marked the 5th anniversary of Linguistics in the Pub. Organiser Ruth Singer would like to extend a thank you very much to all participants, including ‘retired’ co-organiser Lauren Gawne. Lauren’s gap has been partly filled by the Monash PhD students coalition: Harriet Shepard, Jonathon Lum, Alan Ray and Jonathan Schlossberg (University of Newcastle) will be co-organising when they are not in the field. Interstate/international visitors – don’t forget let me know when you’re coming to Melbourne so we can have you along too!
Continue reading ‘Issues in the documentation of newer language varieties’ »
From Myfany Turpin
The names for ‘sand goanna’ (Varanus gouldii) in the languages of areas where they are found often correspond to two ethnospecies. Photographed here are the small arlewatyerre and the large aremaye, both from near Barrow Creek, NT, as they are called in Arandic languages (Arrernte, Kaytetye, Anmatyerr and Alyawarr). On this day my companions successfully hunted both in close proximity, so I thought I’d see if there were differences in the scientific taxonomy that could improve my translations of ‘small sand goanna’ and ‘large sand goanna’ respectively.
Continue reading ‘Myfany Turpin on Sand goannas in central Australian languages –’ »
David Nathan writes
EL Publishing is a new online publisher which was launched on 18th July and which will publish a journal, multimedia, and monographs, focussing on documentation and description of endangered languages. EL Publishing has an international editorial board and operates a fully double-blind peer-review process for all submitted materials.
Sharing the load? Problems with the ‘lone depositor’ model for the archiving of materials in endangered language archives
Ruth Singer recaps last week’s Linguistics in the Pub, a monthly informal gathering of linguists in Melbourne to discuss topical areas in our field.
Traditionally collections in endangered languages archives are identified with a single depositor. This depositor is typically a researcher, who is not a member of the community in which the recordings were made. This depositor decides on access restrictions to the materials, ideally in consultation with the community. There are a number of quite separate problems with this position, for those who manage archives and for those who find themselves in the position of lone depositor. In this era of collaborative fieldwork, we can also ask whether the lone depositor model is the best one for communities who speak endangered languages. One suggestion is to make collections open access so that the depositor does not need to be contacted. Another suggestion is to name a number of depositors for each collection, so that no single person has sole responsibility. In this LIP we will discuss potential solutions to the problems of the lone depositor model in the light of participants experiences as depositors and archivists.
Continue reading ‘Sharing the load? Problems with the ‘lone depositor’ model for the archiving of materials in endangered language archives’ »