Semi Automatic Indexing State of the Art - FTP Directory Listing - Nato

More documents

Recommendations

Info

6. CONCLUSIONS This report should have made evident the feasibility of mechanization in text processing. Although quality considerations were not discussed, it should be mentioned that most authors are satisfied by the results obtained with their methods. Thus automation in the indexing process should go ahead, particularly as other obstacles for automation in the IR process have also been largely overcome. These were primarily due to the lack of powerful programming languages and appropriate software either in the retrieval process, or in data base management. However, full advantage of indexing by computer assistance can only be taken when the textual information is in machine-readable form. As there is a trend towards automatic type-setting techniques, this condition may be fulfilled in the near future. Other techniques to transfer textual information in machine-readable form without re-writing and coding are already within the state-of-the-art. Automation in storage of information can then be achieved by using fully automatic and semi-automatic indexing techniques. The advantages of fully automatic indexing techniques are well known: — consistency is attained as the computer assigns index terms directly from the natural language text of the document, applying the same algorithm for each document; (In human indexing the indexer makes a separate judgement for each document.) — simplicity of re-indexing, which is important, because a scientific library is a living thing and classification schemes must always change according to either the aims of the library, or developments in science; — accuracy, which is guaranteed by the ability of the computer to select, transfer and re-arrange data reliably without making typographical errors; — economy, achieved by large-scale processing and computing speed; — facility for editing. From the quality point of view some fully automatic indexing techniques can be considered to have reached already the same level as purely intellectual indexing, at least in a production environment (37(1970), 100(1972)]. However, automatic indexing might not be fully satisfactory if high standard indexing is required. In general, one may state the better the index, the less intellectual effort is needed to search for information. Hence, the quality required for an index depends strictly upon the effort an average index user is willing to spend on retrieving information. A considerable amount of research is still required in order to have the machine do it well and efficiently especially in fully automatic text processing such as linguistic text analysis. Most approaches apply surrogates (statistical analysis) in order to overcome lack of knowledge in linguistics. Thus, semi-automatic computer controlled techniques are often preferred. (They do play a useful role in linguistic research also.) Machine assisted indexing can be thought of as a simulation of a manual process combined with some of the advantages of machine processing, such as accuracy, economy and facility of editing. For obvious reasons semi-automatic indexing is not fully consistent and re-indexing of the entire data base is costly. Economy will be achieved in a long-range period only since a better index can be produced which is the assumption of the success and effectiveness of any information system. Machine-aided indexing can also take full advantage of the preferences of intellectual indexing. Indexers (cited from [103(1969)]: — are able to make discriminations as to the relative importance of technical concepts as they appear in an abstract or document, — have access to the entire document and — can go beyond the document itself to reference books, to consultation with experts, or other sources as deemed appropriate, to aid in properly indexing the document at hand, — can apply inductive reasoning to formulate and index concepts which are implied by the document but not expressly stated (assignment indexing), — become familiar with the requirements of the users of the system by participating in search request analysis, search strategy formulation and search screening. Thus, semi-automatic indexing will be preferable to purely intellectual indexing. On the other hand it is believed that fully automatic indexing techniques can be developed in the near future to satisfy requirements. Machine-assisted methods may help in achieving this aim. Acknowledgements / wish to express my gratitude to Miss G. Pozzi. Head of the European Scientific Information Processing Centre. C.E.T.I.S., of the Commission of the European Community, who gave her support to the compilation of this report. In particular, I am grateful to Prof. F. W. Lancaster and my colleague W. Kolar for fruitful discussions and some useful suggestions. IMSI but not least, I acknowledge the helpful assistance that I received from the Library staff, the Publication and Typing Office of the EURATOM- Joint Research Center in Ispra.
REFERENCES 1 ACKERMANN H.J., HAGLIND J.B., LINDWALL H.G., MAIZELL R.E. SWIFT: Computerized Storage and Retrieval of Technical Information. J. of Chemical Documentation, 8, 1 (1968) 14-19 2 ARMITAGE JANET E., LYNCH MICHAEL F. Articulation in the Generation of Subject Indexes by Computer. J. of Chemical Documentation, 7, 3 (1967) 170-178 3 ARMITAGE JANET E., LYNCH MICHAEL F. Some Structural Characteristics of Articulated Subject Indexes. Inform. Stor. Retr., 4, 2 (1968) 101-111 4 ARMITAGE JANET E., LYNCH MICHAEL F., PETRIE J.H. Computer Generation of Articulated Subject Indexes. American Soc. of Inform. Science, Annual Meeting 3nd, 6 (1969) 5 ARMITAGE JANET E., LYNCH MICHAEL F., PETRIE J.H., BELTON M. Experimental Use of a Program for Computer-aided Subject Index Production. Inform. Stor. Retr., 6, 1 (1970) 79-87 6 ARTANDI SUSAN Automatic Book Indexing by Computer. Amer. Doc, 15,4(1964)250-257 7 ARTANDI SUSAN Book Indexing by Computer. Ph.D. Thesis. Graduate School of Library Service, Rutgers University (1963) 8 ARTANDI SUSAN Mechanical Indexing of Proper Nouns. J. Doc, 19,4(1963) 187-196 9 ARTANDI SUSAN, BAXENDALE STANLEY Project Medico (Model Experiment in Drug Indexing by Computer. First Progress Report LM-94 Grant). Graduate School of Library Service Rutgers, The State University, New Brunswick, New Jersey (1968) 10 AUSTIN D. PRECIS Indexing. The Information Scientist, (1971) 95-114 11 AXHAUSEN W.E.A., WESSEL A.E. Machine-aided Indexing and Analysis for Document and Fact Retrieval. Proceedings of ISLIC International Conference on Information Science, Tel Aviv, Israel, 29.8 - 3.9.71 12 BAXENDALE PHYLLIS B. Auto-Indexing and Indexing by Automatic Processes. Special Libraries (1965) 715-719 13 BELTON M. Computer-aided Production of the Subject Index to the SMRE Bibliography. The Indexer, 8, 1 (1972) 44-49 14 BENNETT JOHN L. On-Line Access to Information: NSF as an Aid to the Indexer/ Cataloger. Amer. Doc, 20, 3 (1969) 213-220 15 BENNETT JOHN L. On-Line Computer Aids for the Indexer. Presented at the 31st Ann. Meet, of the Amer. Soc. for Inform. Science, Columbus, Ohio 24.10.68. User Discussion Group VIII. Interactive Language Processing to the Working Inform. Scientist. 16 BENNETT JOHN L., CLARKE DAN C, MUSSON W.D. Observing and Evaluating an Interactive Process: A Pilot Experiment in Indexing IBM Research Report, RJ 1040 (1972) 17 BERNHARDT RUEDIGER Production of Indexes. Libri (Denmark), 21 (1971) 1-3, 215-25 18 BERNIER CHARLES L. Indexing and Thesauri. Spec. Libr., 59, 2(1968)98-103 19 BERNIER CHARLES L. Indexing Process Evaluation. Amer. Doc, 16,4 (1965) 323-328 20 BERNIER CHARLES L., CRANE E.J. Correlative Indexes VIII: Subject-Indexing vs Word-Indexing. J. of Chemical Documentation, 2, 2 (1962) 117-122 21 BORKOHARALD Experiments in Book Indexing by Computer. Inform. Stor. Retr., 6, 1 (1970) 5-16 22 BORKO HARALD Interactive Document Storage and Retrieval Systems Design Concepts. FID-IFIP Conference Rome, June 1967 23 CAMPEY LUCILLE H. Generating and Printing Indexes by Computer. ASLIB, Occasional Publication no. 11 (1972) 24 CARASGUSJ. Indexing from Abstracts of Documents. J. of Chemical Documentation, 8, 1 (1968) 20-22 25 CARNEY GERARD J. Computer-Assisted Index Preparation. Black, Donald V. (Ed.) Proceedings of the 1966 ADI Annual Meeting. Adreanne Press, Woodland Hills, Calif. (1966) 329-338 26 CARROLL JOHN M., FRASER WILLIAM, GILL GREGORY Automatic Content Analysis in On-Line Environment. Inform. Process. Letters, Amsterdam, 1 (1972) 134-140 27 CLARKE DAN C. Query Formulation for On-Line Reference Retrieval: Design Considerations from the Indexer/Searcher Viewpoint. Proceedings of the Amer. Soc. for Inform. Science, 7, 33rd Annual Meeting, Philadelphia, Oct. 11-15 (1970) 28 CLARKE DAN C, BENNETT JOHN L. An Experimental Framework for Observing the Indexing Process. J. of the Amer. Soc. for Inform. Science, 24, 1 (1973) 9-24 29 CLOUGH R., ROBINSON F., SAUNDERSON K.M. ASSASSIN - Ein System zur Deckung des oertlichen Datennachweisbedarfs und zur Versorgung zwischenbetrieblicher Netze. Paper presented at the 1st Europaeischen Tagung ueber Dokumentationssysteme und -Netze, Luxenburg, 16-18 May 1973 30 COATES EJ. Computer Assistance in the Production of BTI. Libr. Asso. Rec, 70, 10 (1968) 255-257 31 DAVIS CHARLES H., KEARNEY W. ROBERT, DAVIS BONNIE M. A Computer-based Procedure for Keyword Indexing of Newspaper. J. of the Amer. Soc. for Inform. Science, 22,4 (1971)348-351 32 DELHAYE LISE, FREDERIC ANNE-MARIE, HIRSCHBERG LYDIA, LENGER MARIE-THERESE, MORLET ERIC Projet de Construction Semi-Automatique d'Index pour la Bibliotheca Belgica'. T.A. Information, 2 (1966) 65-72 33 DOWELL N.G., MARSHALL J.W. Experience with Computer Produced Indexes. ASLIB Proceedings, 14 (1962) 323-332 34 ETZWEILER LARRY, MARTIN CARL Binary Cluster Division and its Application to a Modified Single Pass Clustering Algorithm. Report ISR-21 to the Nat. Sci. Foundation, Section XVII Cornell University, Department of Computer Science. Dec. 1972 19
Page 1: • Q < < P198486 N?07 AGARDograph
Page 4 and 5: THE MISSION OF AGARD The mission of
Page 7 and 8: Summary SEMI-AUTOMATIC INDEXING Sta
Page 9 and 10: — alphabetically adjacent index t
Page 11 and 12: Intellectual improvements of the KW
Page 13 and 14: ANNUAL REPORT EDITORIAL:* 1966= ANN
Page 15 and 16: (example contd.) 3.1.5. Proper Noun
Page 17 and 18: CHEMICAL ABSTRACT Journal Compound
Page 19 and 20: Some rules were established in orde
Page 21 and 22: The principle for the creation of s
Page 23: significance or non-significance ha
Page 27 and 28: 68 LOCKHEED MISSILES AND SPACE CO.
Page 31 and 32: AGARDograph No. 179 Advisory Group
Page 33 and 34: AGARDograph No. 179 Advisory Group
Page 36: DISTRIBUTION OF UNCLASSIFIED AGARD

Semi Automatic Indexing State of the Art - FTP Directory Listing - Nato

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?