High Energy Physics Libraries Webzine |
|
HEP
Libraries Webzine
Issue 2 / October 2000
The First Shock
On a hot afternoon of August 1999 I retreat to the cool underground sector of CERN's library, to consult an issue of Physical Review D (PRD). On my way downstairs, I pay almost no attention to an apparently innocent sign posted on the wall, indicating that as of the beginning of August all issues after 1985 of Phys. Rev. and Phys. Rev. Letters (PRL) have been archived, as they are available from the Web site of the American Physical Society. Evidently my brain is not ready to receive this message, and presumably decodes its contents in a different way, as I continue my walk to the location where I confidently expect to find the PRD volumes. The surprise of not finding them there turns rapidly into anger, as I finally realize the meaning of the message upstairs. The following 20 minutes were spent in frustration, walking back upstairs to find an available terminal in the library, surfing my way through the PRD E-archives, trying (unsuccessfully) to print the paper on the local printer, browsing with mouse clicks through the paper in search of the relevant sections, and finally finding out that what I really needed was to befound in a Zeitschrift für Physik article quoted in the references. This implied logging out of the terminal and walking back downstairs, at the risk of having to repeat the whole exercise once more should the Zeitschrift für Physik reference have quoted some other interesting paper from PRD or PRL, as it surely did. A journal consultation which 2 weeks before would have taken 3 minutes, ended up taking over 20. Is this progress?
After that frustrating experience, I felt as if my car's maker had recalled all vehicles to replace the steering wheel with a mouse: click on the left button to steer left, click on the right one to go right. Memories of long afternoons spent in the library of my University surfaced: as a student, one of my most rewarding experiences was to randomly browse through old issues of physics journals, searching for to-me-unknown, but possibly interesting, articles. I then felt sorry for the future generations of students, who may not be given the pleasure of skimming through a full volume of PRD while holding it in their hands. How much time will be wasted consulting issues electronically, waiting for the PDF file to pop up so that we can gauge whether the promise issued in the title or in the abstract is held?
In
spite of my own personal frustration, it is quite clear even to me that
the electronic media revolution will affect publishing strategies,
the way people access scientific information, and the way
libraries function. What is, however, less clear to me is whether the changes
we are seeing right now in the commercial publishing arena are improving
our way of working, and justify an immediate transition away from the library
"as we know it''. It is important that library managers don't get
tempted by the potential for change which the new
technologies open to them, and maintain contact with the users' needs,
to ensure that the transition to the "virtual'' library willfulfil
everybody's dreams.
Potential Benefits
Neglecting
the advantages open to the library staff and administrators, let me consider
here the main potential benefits for the users
of a virtual library:
1. Scientists working in Institutes with limited resources and poor libraries may in principle have access to the same material as scientists working in richer environments.2. Electronic access to journal issues will allow us to consult articles from home or while travelling, connecting our laptops from the lounges of airports or from hotels.3. "Active'' documents, namely articles with internal links from text to equations, and external links from bibliographic items to their electronic counterpart, may make electronic consultation competitive with a live session in the library.
I should perhaps notice that points 1 and 2 are actually solved
by the fully-electronic journal JHEP (Journal of High Energy Physics),
managed and produced by SISSA. The presence of an outstanding editorial
board, and of a serious refereeing process, ensures a quality control equal
to that of the commercial journals.
On the other hand, the absence of subscription fees makes it universally
available. The implications of electronic-only journals go well beyond
the subject of this note, which is limited to the easyness of access and
of use, and I chose not to let them divert my main focus here.
One of the difficulties in making scientific papers easily readable on a computer screen is related to the way we read them. Very rarely does one go through them sequentially, line after line and page after page, as one would do with a normal written text. Forward and backward referencing to text, tables, figures and equations is very frequent. Several pages at once need to be under our eyes to compare equations, to follow the development of a calculation across pages, to compare the contents of different tables or figures. In spite of all bookmarking tools made available by the current software, nothing matches the power of our fingers to flip through real pages back and forth and to keep track of where we are. It will still be sometime before the temptation to send to the printer every article popping upon the computer screen will disappear. Until then, an enormous amount of paper is wasted every day. The time taken by an average printer to print e-versions of journal articles is also a bothersome limitation, at this stage.
With
the much-reduced burden of preparing papers for publication, given that
most of the typesetting is done by the authors, publishing companies should
at least earn their fees by producing more
versatile electronic versions of the articles. To the occasional
user, even locating the reference is an unnerving multi-step operation.
If one just logs on to a library computer with as only reference the standard
volume number, year, and page, it will be a while
before arriving at the paper, especially if useful bookmarks are lacking.
In
the case of papers stored in the Los Alamos archives,
access is instead immediate, as there is a standard, easy-to-remember URL,
to which it is sufficient to add the document number, given in a fixed,
standard format directly related to the reference itself (e.g.
http://arXiv.org/ps/hep-ph/0008001
for the first paper of Septbember 2000 in
the hep-ph archive).
The Reality
While preparing this article, I made some experiments from a terminal in the CERN library. I tried to access papers for which I had a precise reference, and I tried to follow a search path for works of which I knew only the author name and the journal. I started byusing the tool suggested by the CERN library, a search engine which points directly to a paper provided the exact reference is known (http://library.cern.ch/electronic_journals/ej.html). This turned out to be very effective in quickly reaching the documents. The problem came with looking at the documents. In the case of Nucl. Phys., B : 537 (1999) 443, it took 65 seconds for the 15-page document to appear. In the case of Nucl. Phys., B : 485 (1997) 291 I quit after a minute,with my browser indicating that 17 minutes and 29 seconds were still remaining to download the document (130 pages, admittedly long; however, when I look in a journal the time it takes to find the paper in the issue usually gets shorter the longer the paper!).
I repeated this exercise, with the help of the Library staff, over
the following days and at different times. In the case of the PRD server,
we noticed similar performances during the day times, but much faster response
after 7:30pm (CERN time). Phys. Rev., D : 50 (1995) 2966 was downloaded
in about 5 minutes at this time of the day. In the case of Nuclear Physics,
the behaviour was more random, with peak performances of one minute to
download Nucl. Phys., B : 485 (1997) 291.
Suggestions for Improvements
Aside
from the obvious advice to invest in better servers and better connection
lines, here are just some suggestions for featureswhich would make the
use of the e-versions of published articles competitive
with the hard copies, and with the versions available from theLANL arXiv:
1. Internal links to equations, tables, figures. The links would ideally generate a pop-up window with the required information, keeping the page from which the link originate savailable.2. Possibility to repaginate the document (e.g. to collect a series of equations or a set of figures into a single page) or to easily and quickly display multiple pages at the same time, so that cross-comparisons can be carried out by looking at a single screen.3. Ability to extend the above to several papers, so that one can work on more than one paper at the same time (as we often do with the paper versions).4. Fast location of a paper starting from its volume and page numbers. Each journal should have an easy-to-remember root for the URL of the paper archives, to which to add volume and page with a simple syntax (e.g. http://npb.com/500-111 for the article appearing on page 111 of volume 500 of Nucl Phys B.)
Michelangelo
Mangano
Theory Division
|