skip to main content
article
Free Access

New Methods in Automatic Extracting

Published:01 April 1969Publication History
Skip Abstract Section

Abstract

This paper describes new methods of automatically extracting documents for screening purposes, i.e. the computer selection of sentences having the greatest potential for conveying to the reader the substance of the document. While previous work has focused on one component of sentence significance, namely, the presence of high-frequency content words (key words), the methods described here also treat three additional components: pragmatic words (cue words); title and heading words; and structural indicators (sentence location).

The research has resulted in an operating system and a research methodology. The extracting system is parameterized to control and vary the influence of the above four components. The research methodology includes procedures for the compilation of the required dictionaries, the setting of the control parameters, and the comparative evaluation of the automatic extracts with manually produced extracts. The results indicate that the three newly proposed components dominate the frequency component in the production of better extracts.

References

  1. 1 Automatic abstracting. RADC-TDR-63-93, TRW Computer Div., Thompsoa-Ramo- Wooldridge, Inc., Canoga Park, Calif., Feb. 1963.Google ScholarGoogle Scholar
  2. 2 EDMUNDSON, H. P. Problems in automatic abstracting. Comm. ACM 7, 4 (Apr. 1964), 259-263. Google ScholarGoogle Scholar
  3. 3 EnMUNDSON, H. P., AND WYLLYS, R. E. Automatic abstracting and indexing survey and recommendations. Comm. ACM 4, 5 (May 1961), 226-234. Google ScholarGoogle Scholar
  4. 4 Final report on the study for automtic abstracting. Cl07-1U12, Thompson-Ramo- Wooldridge, Inc., Canoga Park, Calif., Sept. 1961.Google ScholarGoogle Scholar
  5. 5 KUNs, J.L. An application of logical probability to problems in automatic abstracting and information retrieval. Joint Man-Computer Indexing and Abstracting, Sess. 13, First Congress on the Information System Sciences, Nov. 1962.Google ScholarGoogle Scholar
  6. 6 LUHN, H.P. The automatic creation of literature abstracts, iBM J. Res. Develop. 2, 2 (1959), 159-165.Google ScholarGoogle Scholar
  7. 7 RATH, G. J., RESNICK, A., AND SAVAGE, T. R. Comparisons of four types of lcxical indicators of content. Amer. Docum. 12, 2 (Apr. 1961), 126-130.Google ScholarGoogle Scholar

Index Terms

  1. New Methods in Automatic Extracting

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Journal of the ACM
          Journal of the ACM  Volume 16, Issue 2
          April 1969
          157 pages
          ISSN:0004-5411
          EISSN:1557-735X
          DOI:10.1145/321510
          Issue’s Table of Contents

          Copyright © 1969 ACM

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 April 1969
          Published in jacm Volume 16, Issue 2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader