Nick Thieberger

Pacific Manuscripts now in PARADISEC

23 June 201622 June 2016 by Nick Thieberger

After some discussion between PARADISEC and the Pacific Manuscripts Bureau (PAMBU) we now have access to linguistic records in the PAMBU microfilm collection, either for tagging in the PARADISEC catalog, or as digital versions of the microfilm in the PARADISEC collection.
Kylie Maloney at PAMBU kindly made available a list of items in PAMBU that have linguistic content (about 70 items). I sent this list to linguists interested in this field and got a priority list from them. PAMBU then entered into negotiations with their depositors to allow the microfilms to be digitised and produced as pdf files for distribution via PARADISEC’s repository.

Results of the metadata survey

7 June 20166 June 2016 by Nick Thieberger

Keeping track of what is recorded in the course of fieldwork is critical, both for your own future work and for longterm archiving. Recordings of dynamic performance (audio or video) are easy to misplace or misidentify and very difficult to locate once you forget what a file was named and what you recorded on a particular day. We ran a survey about how people record their metadata from January 21st to April 25th, 2016 and had 142 responses (see also the earlier blog post here). There were two multiple choice questions each allowing selection of more than one checkbox and the entry of free text responses. I can send the full results of the survey on request. This information will help inform the development of new tools for metadata entry. The responses are summarised below.

Chasing John Z’graggen’s records

29 February 201628 February 2016 by Nick Thieberger

This week a suitcase of audio tapes will arrive in Melbourne from Madang in PNG. While a lot of the effort of building collections in PARADISEC goes in finding tapes and encouraging people to deposit their recordings, there are some collections that stand out for the amount of work required. This is the story of one of them.

Reading HyperCard stacks in 2016

4 November 202120 February 2016 by Nick Thieberger

HyperCard (HC) was a brilliant program that came free with every Macintosh computer from 1987 and was in development until around 2004. It made it possible to create multimedia ‘stacks’ (of cards) and was very popular with linguists. For example, Peter Ladefoged produced an IPA HyperCard stack and SIL had a stacks for drawing syntactic trees or for exploring the history of Indo-European (see their listing here). Texas and FreeText created by Mark Zimmerman allowed you to create quick indexes of very large text files (maybe even into the megabytes! Remember this is the early 1990s). I used FreeText when I wrote Audiamus, a corpus exploration tool that let me link text and media and then cite the text/media in my research.

My favourite HC linguistic application was J.Randolph Valentine’s Rook that presented a speaker telling an Ojibwe story (with audio), with interlinear text linked to a grammar sketch of the language. I adapted that model for a story in Warnman, told by Waka Taylor, and produced as part of a set of HC stacks called ‘Australia’s languages’ and released in 1994.

Toolbox to Elan

1 February 2016 by Nick Thieberger

In the spirit of solving small frustrations I offer my weekend experience of getting Toolbox files into Elan. I have over a hundred texts in Nafsan, most of which are time-aligned and interlinearised. I am working with Stefan Schnell on adding GRAID annotation to some of these texts and the preferred way of doing this is in Elan, with the GRAID annotation at the morphemic-level. I tried importing Toolbox files using the Elan ‘Import’ menu, and had listed all field markers in Toolbox, together with their internal dependencies (which should then map to Elan’s relationship between tiers). These settings are stored in an external file. Unfortunately, the import failed several times, despite changing the settings slightly after each attempt.

Songs of the Empty Place

14 July 2015 by Nick Thieberger

Jimmy Weiner and Don Niles have published Songs of the Empty Place: The Memorial Poetry of the Foi of the Southern Highlands Province of Papua New Guinea. This new book contains songs recorded by Weiner between 1979 and 1995 and can be downloaded from ANU E-Press here. All audio was digitised by PARADISEC and is available in the collection JW1. The songs are organised under three main categories: 7 Women’s Sago Songs (Obedobora), 44 Men’s Songs (Sorohabora), and 7 Women’s Songs (Sorohabora) and accompanied by some 40 photographs.

Generating word forms

18 May 2015 by Nick Thieberger

Have you ever wanted to create a list of possible words in a language you are working on? Have you started creating a dictionary but now need to find words that are not yet recorded? This could be the app for you. Word Generator is a free web service that lets you upload a list … Read more

Seeking your help with tool development

1 May 201527 March 2015 by Nick Thieberger

We are in the process of identifying gaps in tools for fieldwork and data analysis that can be filled as part of the Centre of Excellence for the Dynamics of Language. I’d like to ask for your input into the requirements for a metadata entry tool. In part, this analysis asks for your opinions on … Read more

Grammar writing: where are we now?

1 May 201527 February 2015 by Nick Thieberger

Ruth Singer recaps last week’s Linguistics in the Pub, a monthly informal gathering of linguists in Melbourne to discuss topical areas in our field.

Linguistics in the Pub on Tuesday the 24th of February, 2015 centred around the theme: grammar writing. Harriet Sheppard (Monash University) led the discussion. The announcement and short background reading are here.

The descriptive grammar although often reported to be dead is a form of scholarship that is still very much alive. And although e-grammars are said to be the way of the future, most grammars still take the form of a hard copy, whether it is a PhD thesis or published book. The discussion in this session of linguistics in the pub was kicked off with a discussion of the article by Ulrike Mosel cited below, part of a special publication of LDC on grammar writing.

Playing texts and media—EOPAS again

19 July 2014 by Nick Thieberger

While I obviously like EOPAS as a model for corpus presentation (see the earlier blog post about it here), I found a renewed enthusiasm for it today as I was checking the meaning of a word in a text I was translating from South Efate. The word lunak does not appear in any of my notes nor in the dictionary, but appears a few times in a story told by the late Kalsarap Namaf. I wrote to Joel Kalpram, who is from Erakor village and speaks the language, and asked him if he knew the word.