International Business Machines Corp.
The document file of this WALNUT information retrieval system contains the equivalent of 3,000 books. A punched-card inquiry system locates the desired filmstrip for viewing or photographic reproduction.
International Business Machines Corp.
This image converter of the WALNUT system optically reduces and transfers microfilm to filmstrips for storage. Each strip contains 99 document images. As a document image is transferred from microfilm to filmstrip, the image converter simultaneously assigns image file addresses and punches these addresses into punched cards controlling the conversion process.
The key to information retrieval lies in efficient abstracting. It has been customary to let people do this task in the past because there was no other way of getting it done. Unfortunately, man does not do a completely objective job of either preparing or using the abstract, and the result is a two-ended guessing game that wastes time and loses facts in the process. A machine abstracting system, devised by H. Peter Luhn of IBM, picks the words that appear most often and uses them as keys to reduce articles to usable, concise abstracts. A satisfactory solution seems near and will be a big step toward a completely computerized IR system.
For several years there has been a running battle between the computer IR enthusiast and the die-hard “librarian” type who claims that information retrieval is not amenable to anything but the human touch. It is true that adapting the computer to the task of information retrieval did not prove as simple as was hoped. But detractors are in much the same fix as the man with a shovel trying to build a dike against an angry rising sea, who scoffs at the scoop-shovel operator having trouble starting his engine. The wise thing to do is drop the shovel and help the machine. There will be a marriage of both types of retrieval, but Verner Clapp, president of the Washington, D.C., Council on Library Resources, stated at an IR symposium that computers offer the best chance of keeping up with the flood of information.
One sophisticated approach to IR uses symbolic logic, the forte of the digital computer. In a typical reductio ad logic, the following request for information:
An article in English concerning aircraft or spacecraft, written neither before 1937 or after 1957; should deal with laboratory tests leading to conclusions on an adhesive used to bond metal to rubber or plastic; the adhesive must not become brittle with age, must not absorb plasticizer from the rubber adherent, and must have a peel-strength of 20 lbs/in; it must have at least one of these properties—no appreciable solution in fuel and no absorption of solvent.
becomes the logical statement:
KKaVbcPdeCfg, and KAhiKKKNjNklSmn.
Armed with this symbolic abbreviation, the computer can dig quickly into its memory file and come up with the sought-for article or articles.
It has been suggested that the abstracting technique be applied at the opposite end of the cycle with a vengeance amounting to birth control of new articles. A Lockheed Electronics engineer proposes a technical library that not only accepts new material, but also rejects any that is not new. Here, of course, we may be skirting danger of the type risked by human birth control exponents—that of unwittingly depriving the world of a president, or a powerful scientific finding. Perhaps the screening, the function of “garbage disposal,” as one blunt worker puts it, should be left as an after-the-fact measure.
Despite early setbacks, the computer is making progress in the job of information retrieval. Figures of a 300 per cent improvement in efficiency in this new application are cited over the last several years. Operation HAYSTAQ, a Patent Office project in the chemical patent section accounting for one-fifth of all patents, showed a 50 per cent improvement in search speed and 100 per cent in accuracy as a result of using automated methods. Desk-size computer systems with solid-state circuits are being offered for information retrieval.
The number of scientific information centers in this country, starting with one in 1830, reached 59 in 1940 and now stands at 144. Significantly, of 2,000 scientists and engineers working at these centers, 381 are computer people.
Some representative information retrieval applications making good use of computer techniques are the selection of the seven astronauts for the Mercury Project from thousands of jet pilots, Procter & Gamble’s Technical Information Service, demonstration of an electronic law library to the American Bar Association, and Food Machinery and Chemical Corporation’s Central Research Laboratory. The National Science Foundation, the National Bureau of Standards, and the U.S. Patent Office are among the government agencies in addition to the military services that are interested in electronic information retrieval.
Summary
The impact of the computer on education, language and communication, and the handling of information is obviously already strongly felt. These inroads will be increased, and progress hastened in the years ahead of us. Perhaps of the greatest importance is the assigning to the machine functions closer to the roots of all these things. Rather than simply read or translate language, for example, the computer seems destined to improve on it. The same applies to the process of teaching and to the storage and retrieval of data. The electronic computer has shown that it is not a passive piece of equipment, but active and dynamic in nature. It will soon be as much a part of the classroom and library as books; one day it may take the place of books themselves.
Lichty, © Field Enterprises, Inc.
“How come they spend over a million on our new school, Miss Finch, and then forget to put in computer machines?”
“’Tis one and the same Nature that rolls on her course, and whoever has sufficiently considered the present state of things might certainly conclude as to both the future and the past.”
—Montaigne