PARADISEC Activity Update – August 2021

PARADISEC operations are proceeding relatively normally around lockdowns and working from home. The collection now houses 135 TB of records, having averaged about 102 GB per day of data added to the archive throughout 2021 so far.

Continuing to dig into some figures, the 343,519 essence objects (i.e. files) have an average size of about 390 MB. Comparing back to May 2017, the average filesize then was about 163 MB, so our average filesize has grown by about 1 MB per week over the past 4.5 years.

Our colleagues at the Centre of Excellence for the Dynamics of Language (CoEDL) have been busy as usual, continuing to add to the many valuable collections they have contributed to the archive. In addition, CoEDL Senior Data Manager Julia Miller has recently created some fantastic technical guides for depsitors and other catalog users; some are still in progress, but all are available to view and use here: https://paradisec-archive.github.io/PARADISEC_workflows/

Happy Archiving!