File formats

File formats 2018-10-21T17:35:38+00:00

Preferred deposit formats for PARADISEC

PARADISEC has so far focussed on digitising audio tapes, but we intend to build up a repository of digital objects including all the textual material that is currently being produced for theses, dictionaries, grammars, articles and so on.

We are able to digitise audio tapes (reel to reel and cassette). We can arrange for analog movie formats to be digitised.

For existing digital material these are the formats for deposit:
Text: txt, xml, pdf, rtf (we can undertake to convert from MS Word to these formats if necessary)
Audio: DAT, .WAV (as close to the archival standard of 24bit, 96kHz as possible). At present we are not able to ingest 16bit 32khz (LP mode) but depositors should contact us to discuss these options. We have the capacity to ingest minidisc and can assist in archiving minidisc files.
Images: TIF/ TIFF. For scans of notes in colour:  300 dpi, scanning ratio: 100%, 8 bits per channel RGB, embedded ICC colour profile encouraged, OCR6 encouraged (PDF or PDF/A complies). For no colour then use grayscale (See the National Archives of Australia recommendations here)
For photographic negative or slide scans TIF, 600 dpi, 4000 pixels on the longest side, 48 bit colour RGB or 16 bit grayscale.
Video: mxf, mov, DV native format can be accessioned. We use DamSmart to transcode video to required formats.

Files created in Elan are in an xml format and the .eaf file can be deposited as it is, ideally with the  same name of the media file it is a transcript of.
Fieldworks files need to be exported to xml (one for the lexicon and one for the text). If you have bundled sound and image files into the FLEx project, they need to be separated as they are typically not in an archival format. We can archive a fwbackup file if it contains only text, but not if it contains audio and images.