We present Corpus Pattern Analysis (CPA), a method of investigating corpora developed by linguist and lexicographer Patrick Hanks. CPA aims to find the normal patterns of usage in a language, that is, those patterns that represent how we use words to make meanings. Patterns are cognitively and socially relevant; they are recurrent structures we rely on in communication and can tell us much about our cognitive structure. In this chapter, we first outline the key features of CPA. Then, we illustrate the details of the methodology and introduce the inventories of patterns that have been developed so far by applying CPA to different languages. Finally, we report the ongoing attempts to automatize the CPA procedure, originally conceived as a manual technique, to speed up pattern acquisition from corpora using traditional and neural machine-learning techniques.
Corpus Pattern Analysis
Jezek, E.
2025-01-01
Abstract
We present Corpus Pattern Analysis (CPA), a method of investigating corpora developed by linguist and lexicographer Patrick Hanks. CPA aims to find the normal patterns of usage in a language, that is, those patterns that represent how we use words to make meanings. Patterns are cognitively and socially relevant; they are recurrent structures we rely on in communication and can tell us much about our cognitive structure. In this chapter, we first outline the key features of CPA. Then, we illustrate the details of the methodology and introduce the inventories of patterns that have been developed so far by applying CPA to different languages. Finally, we report the ongoing attempts to automatize the CPA procedure, originally conceived as a manual technique, to speed up pattern acquisition from corpora using traditional and neural machine-learning techniques.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


