HOME | ILLUSTRATIONS | HELP | CONTACT
 

Project Notes

 

Site Content

The Digital Bridges Web site consists of a collection thirty representative 19th century American bridge engineering monographs, manuals, and documents from the Lehigh University Libraries' Special Collections. Many of these items are relatively rare an in some cases quite fragile.

With the growing interest in material culture, technology studies, the history of engineering practices, and the renovation and restoration of historical structures, this site is designed to provide a rich, interactive research tool for students, historians, and engineering professionals.

The source documents have been scanned, converted to text, and partially-corrected to make all significant terms, personal, and proper names retrievable through the site search engine. In addition, all items in the collection have been structured for browsing in the manner of a book, with organization units such as chapters and subchapters randomly viewable, and with page-to-page forward and back links.

Project Background

Underwritten in part by Library Services & Technology Act (LSTA) funds administered through the Pennsylvania Commonwealth Library's NETShare program, the Digital Bridges Web site was developed by a team of librarians and technical staff within Lehigh University's Library and Technology Services organization, with significant contributions by several independent professionals with whom Lehigh University contracted for specific services.

Major development and production activities for the site occurred from May 2001 through March 2002, but a number of key site enhancements are still in process as of this writing (April, 2002). Members of the Digital Bridges project team include (individuals are affiliated with Lehigh University unless otherwise noted):

Marie Boltz, Library Assistant
Metadata Input

Ilhan Citak, Special Collections Assistant
Scanning & Production Management, Editorial Oversight

Jennifer Hanf and Andre Szejko of Hanf Design, Inc., Kutztown, Pennsylvania
Site Graphics and Interface Design

Joe Lucia, Director for Library Systems & Access
Project Planning & Administration, Technical Consulting

Tim McGeary, Library Systems Specialist
Server Adminsitration & Support, Ongoing Site Development

Judy McNally, Senior Cataloging Librarian
Metadata Development & Production, Site Quality Control

John McPherson, New Zealand Digital Library Project, University of Waikato
Greenstone Software Modification and Development

Philip Metzger, Curator of Special Collections
Project Planning, Materials Selection, Metadata Development, Documentation

John Misinco, Student Assistant
Document Scanning & Editing

George Motter, Senior Systems Specialist
System Administration, Programming, Technical Problem Solving

Svetlana Oshkai, Graduate Student Assistant
Production Process Development, Document Scanning & Editing

Christine Roysdon, Director for Collection Management
Project, Planning, Metadata Development & Production, Site Quality Control

Site Design & Development

Graphical and interface design for the site was provided by Hanf Design of Kutztown, Pennsylvania. Ornaments and illustrations on the site main page were drawn from volumes included in the online collection. The title pages from each volume in the collection include a representative illustration or "signature" design element from the printed book. Navigational graphics and menus were also developed to reflect the historical character of the collection. At the level of individual book and page views, the design goal was to preserve in the online form of each document as much as possible of the look and feel of the physical volumes themselves.

Production scanning of volumes for the site was completed using a Minolta PS 7000 overhead scanner. Archival scans of all items were made at 600 DPI and stored in TIFF format. From these sources files, 150 DPI (full page view) and 50 DPI (thumbnail) JPEG files were generated, along with PDF versions of these image files for facilitating print output by end-users. Much of the file conversion work was automated through use of a series of macro utilities. Optical character recognition (OCR) software was used to generate searchable text from the page scans. To facilitate text retrieval, rough edits (working from printed source documents) of the converted text were completed to make certain that all important words (personal names, bridge names, locations names, key engineering terms, etc.) were correct.

A series of Unix and Perl scripts were developed to "wrap" the raw page files in HTML. The Metabrowser software was used to apply descriptive and structural metadata to these HTML documents. Creating structural metadata to bring together internal organizational units from the printed volumes (chapters, subchapters, sections, pages, etc.) and render these usable by the site database engine proved to be an enormously complicated challenge. The converted text data from each scanned page was loaded into a metadata field in the source HTML to render it indexable by the site search engine.

Server Technical Information

The Digital Bridges site runs on a Dell Pentium III server class machine with 512MB of RAM and 30GB of redundant disk. The server operates under RedHat Linux version 7.2 and utilizes the Apache Web software for page service and the Greenstone Digital Library software for organization, indexing, retrieval, and dynamic page creation for materials in the online collection. Extensive local Perl and Unix shell scripts support generation of site pages within Greenstone. Information of the Greenstone software, and open source solution for digital library applications developed in New Zealand, can be found at http://www.greenstone.org/english/home.html

Ongoing Work

As of this writing, the site glossaries and the site list of illustrations are still under development. The organization and display of search results within the retrieval is also undergoing refinement.

Page information last updated: April 29, 2002 by J.Lucia