English for the Computer: The Susanne Corpus and Analytic Scheme Contributor(s): Sampson, Geoffrey (Author) |
|
![]() |
ISBN: 0198240236 ISBN-13: 9780198240235 Publisher: Clarendon Press OUR PRICE: $289.75 Product Type: Hardcover - Other Formats Published: March 1995 Annotation: Computer processing of natural language is a burgeoning field, but until now there has been no agreement of a standardized classification of the diverse structural elements that occur in real-life language material. This book attempts to define a 'Linnaean taxonomy' for the English language: an annotation scheme, the SUSANNE scheme, which yields a labelled constituency structure for any string of English, comprehensively identifying all of its surface and logical structural properties. The structure is specified with sufficient rigour that analysts working independently must produce identical annotations for a given example. The scheme is based on large samples of real-life use of British and American written and spoken English. The book also describes the SUSANNE electronic corpus of English which is annotated in accordance with the scheme. It is freely available as a research resource to anyone working at a computer connected to Internet, and since 1992 has come into widespread use in academic and commercial research environments on four continents. |
Additional Information |
BISAC Categories: - Language Arts & Disciplines | Grammar & Punctuation - Language Arts & Disciplines | Linguistics - General - Computers | Natural Language Processing |
Dewey: 425.012 |
LCCN: 94018387 |
Lexile Measure: 1570 |
Physical Information: 1.42" H x 6.47" W x 9.54" (1.53 lbs) 508 pages |
Descriptions, Reviews, Etc. |
Publisher Description: Computer processing of natural language is a burgeoning field, but until now there has been no agreement on a standardized classification of the diverse structural elements that occur in real-life language material. This book attempts to define a Linnaean taxonomy for the English language: an annotation scheme, the SUSANNE scheme, which yields a labelled constituency structure for any string of English, comprehensively identifying all of its surface and logical structural properties. The structure is specified with sufficient rigor that analysts working independently must produce identical annotations for a given example. The scheme is based on large sample of real-life use of British and American written and spoken English. The book also describes the SUSANNE electronic corpus of English which is annotated in accordance with the scheme. It is freely available as a research resource to anyone working at a computer connected to Internet, and since 1992 has come into widespread use in academic and commercial research environments on four continents. |