Preferred deposit formats for PARADISEC
PARADISEC has so far focussed on digitising audio tapes, but we intend to build up a repository of digital objects including all the textual material that is currently being produced for theses, dictionaries, grammars, articles and so on.
We are able to digitise audio tapes (reel to reel and cassette). We can arrange for analog movie formats to be digitised.
For existing digital material these are the formats for deposit:
Text: txt, xml, pdf, rtf (we can undertake to convert from MS Word to these formats if necessary)
Audio: DAT, .WAV (as close to the archival standard of 24bit, 96kHz as possible). At present we are not able to ingest 16bit 32khz (LP mode) but depositors should contact us to discuss these options. We have the capacity to ingest minidisc and can assist in archiving minidisc files. We will not archive hundreds of small audio files, please concatenate them into a larger wav file before depositing.
Images: TIF/ TIFF. For scans of notes in colour: 300 dpi, scanning ratio: 100%, 8 bits per channel RGB, embedded ICC colour profile encouraged, OCR6 encouraged (PDF or PDF/A complies). For no colour then use grayscale (See the National Archives of Australia recommendations here)
For photographic negative or slide scans TIF, 600 dpi, 4000 pixels on the longest side, 48 bit colour RGB or 16 bit grayscale.
Video: mxf, mov, mp4, DV native format can be accessioned (and possibly other RAW formats). We use an external company, DamSmart, to transcode video to required formats, but this represents a cost to us that we need to have funding to carry out.
Files created in Elan are in an xml format and the .eaf file can be deposited as it is, ideally with the same name of the media file it is a transcript of.
Fieldworks (FLEx) files need to be exported to xml (one for the lexicon and one for the text). If you have bundled sound and image files into the FLEx project, they need to be separated as they are typically not in an archival format. We can archive a fwbackup file if it contains only text in xml format (its native format) , but not if it contains audio and images.