Web Magazine for Information Professionals

Digitising an Archive: The Factory Approach

Duncan Burbidge describes a new approach to digitising an archive both as a future-proof substitute and for Web delivery.

The FP6 PrestoSpace Project [1] aims to develop systems that will permit quick, efficient and economically accessible preservation of analogue media [2].

Stream UK has built on the knowledge gained from three years of working on this project along with expertise from over seven years encoding within the industry to develop a complete encoding factory solution, based on the PrestoSpace project where the focus is on developing a semi-automated 'preservation factory' approach to preservation of audio-visual collections aimed at driving down the cost of digitising the archive below the 1€ per minute level.

The solution is housed in a stand-alone rack unit and includes barcode tracking of tapes through the system to allow archive owners to take an existing collection of off-line material and convert it into a high-quality archive and a Web-ready preview format including all existing metadata and maintaining an overview of the process via the barcode tracking system which produces a searchable interface that can be accessed via a LAN or Internet connection.

Each unit includes:

photo (312KB) : Figure 1: The Stream UK Encoding Factory

Figure 1: The Stream UK Encoding Factory

This article looks at:

Complete Tracking of the Process

During the encoding of an archive, maintaining control of the assets and an overview of the process is essential. Stream UK has developed an interface which allows:

Any tape that is introduced to the system goes through a five-step process:

  1. Any metadata that exists in electronic format is imported into the database. If this includes editing points then these will be recognised by the system.
  2. A barcode is added to the tape and scanned into the system.
  3. The tape is inserted into the player and the appropriate encoding profiles selected.
  4. When the tape has finished encoding (real-time) it is removed from the system.
  5. The status of the job is recorded on the board.

The Encoding Technology

The engine that powers the Stream UK Encoding Factory is based on the Digital Rapids [3] range of hardware encoders. These encoding cards are considered by experts to be the best solution for obtaining a digital format from a non-digital source and include the facility to ingest via SDI-capture [4].

screenshot (52KB) : Figure 2: The Encoding Administration Area

Figure 2: The Encoding Administration Area

The key video features of the hardware video processors include:

Different cards are available to support hardware MPEG-2 or MJPEG 2K encoding for the archive, with ingest being either via an in-built player or via inputs attached to the breakout panel.

All encoding is done in real-time, with deck-control allowing hands-free operation. The Factory will plug in directly to any robotics-based playout system.

screenshot (22KB) : Figure 3: Details of Individual Tapes

Figure 3: Details of Individual Tapes

Storage of the Archive

Storage of a multimedia archive needs to be appropriate to the use that media is put to. The increasing affordability of disk storage makes RAID5 an ideal option for both the high-quality archive and the Web-ready fomats.

According to the PrestoSpace Storage Media Quality Assurance report [5], RAID5 offers the ideal combination of access speed, price and reliability.

The Stream UK Encoding Factory comes with a standard 2 terabytes of storage included, enough for around 100 hours of archive footage encoding in both high-quality archive (MPEG-2 25Mpbps, for example) and preview formats.

Additional storage is available within the same chassis up to 1 petabyte.

Encoding Formats

Stream UK will work with the archive owner to develop a complete set of appropriate profiles for the encoding formats.

Any archive owner seeking to digitise will need to select formats that are appropriate to the end-user. Normally this means:

The Stream UK Encoding Factory supports all of the following formats, including support for Windows Media DRM. The table below provides a summary of the comparisons between the various formats:

Table 1 - Comparison of Encoding Formats

Format
Comments
Use
MPEG-1Part of the Motion Pictures Experts Group [6] set of formats, MPEG-1 has now been superceded by some of the later profiles.  It does still have advantages in terms of playability across a wide range of software clients, but it has been left behind in terms of quality - any preview material needs to be encoded, stored and delivered at a comparitively high bit-rate.MPEG-1 is good for preview encoding at bit-rates from 500Kbps to 2Mbps.
MPEG-2Still the benchmark for hig-quality archive encoding.  Although MPEG-2 is a lossy format (meaning that some of the original information is thrown away by the encoding process), data-rates over about 20Mbps result in excellent quality.  Parameters such as GOP need to be chosen with care. MPEG-2 is the standard for a wide range of applications across the entire A/V industry including all the transcoding software, playout over broadcast channels, editing suites etc.  Archive-quality footage can also be easily viewed on a standard PC.MPEG-2 is good for archive encoding at bit-rates from 8Mbps to 80Mbps.
MPEG-4This codec belongs to the next generation of encoding profiles, along with the common proprietory formats like Windows Media.  The quality is comparable and MPEG-4 is supported by a wide range of platforms. Digital rights management and copyright issues remains outstanding.MPEG-4 is good for preview encoding at bit-rates from 200Kbps to 2Mbps.
MJPEG 2KThis is a relatively new codec and it is only in 2006 that some of the major encoding card manufacturers began to implement the scheme within hardware encoding cards. Encoding in MJPEG 2K is lossless, which is attractive to many archives.  Playback is not yet possible on normal PCs but transcoding gives excellent results.MJPEG 2K results in bit-rates of around 80Mbps.
Windows Media 9Windows Media 9 is the proprietory codec with the highest market penetration, it can be viewed by around 73% of all Internet-connected computers.  This codec is also very useful for preview-quality encoding since the DRM capabilities are the most advanced and allow content owners to maintain control of assets even through open systems. Later versions of editing software can edit this format natively.Windows Media is good for preview encoding at bit-rates from 200Kbps to 2Mbps.
QuicktimeThere is no single QuickTime format, but the .mov file extension includes a range of codecs of which the Sorenson 3 video codec and the Qdesign2 audio codec are leading the way.  Quality is comparable with Windows Media and Mpeg-4 although a shade lower for most types of footage. QuickTime is easily viewable by those with Apple OS, something that is often attractive to the creative or educational sectors. Editing support is good from all major providers, as is transcoding support.Suitable for both preview quality and archive quality at a range of bit-rates from 200Kbps to 30Mbps. The QuickTime formats are well supported by the professional edit suites available. Quality of transcoding is not as high as for MPEG-2 and MJPEG 2K.

Conclusion

The conversion of an archive into digital format is one of the most important steps that the owner will ever take. Essential to this process is the correct choice of process and equipment. The Stream UK Encoding Factory allows archive owners to take control over the choices presented to them and to use an expert resource before and during the process. The solution as presented is most suitable for archives that have between 50 and 500 hours of footage to digitise, although multiple units can be integrated for larger archives.

References

  1. PrestoSpace Project http://www.prestospace.org
  2. Daniel Teruggi, "Can We Save Our Audio-visual Heritage?", April 2004, Ariadne Issue 39.
    http://www.ariadne.ac.uk/issue39/teruggi/
  3. Digital Rapids PCI cards http://www.digital-rapids.com/Products_PCICards.html
  4. Editor's note: SDI = Serial Digital Interface
  5. PrestoSpace Storage Media Quality Assurance report http://www.prestospace.org/project/deliverables/D12-6.pdf
  6. Motion Pictures Experts Group: http://www.mpeg.org

Author Details

Duncan Burbidge
Director
Stream UK Media Service Limited

Email: duncan@streamuk.com
Web site: http://www.streamuk.com

Return to top