Natural Language Processing is a range of computational techniques
for analyzing and representing naturally occurring texts at
one or more levels of linguistic analysis for the purpose of
achieving human-like language processing for a range of particular
tasks or applications. The levels of linguistic analysis are:
- Phonological: interpretation
of speech sounds within and across words
- Morphological: componential
analysis of words, including prefixes, suffixes and roots
- Lexical: word level
analysis including lexical meaning and part of speech analysis
- Syntactic: analysis
of words in a sentence in order to uncover the grammatical
structure of the sentence
- Semantic: determining
the possible meanings of a sentence, including disambiguation
of words in context
- Discourse: interpreting
structure and meaning conveyed by texts larger than a sentence
- Pragmatic: understanding
the purposeful use of language in situations, particularly
those aspects of language which require world knowledge
The above levels of linguistic processing reflect an increasing
size of unit of analysis as well as increasing complexity
and difficulty as we move from top to bottom. The larger the
unit of analysis becomes (i.e., from morpheme to word to sentence
to paragraph to full document), the less precise the language
phenomena and the greater the free choice and variability.
This decrease in precision results in fewer discernible rules
and more reliance on less predictable regularities as one
moves from the lowest to the highest levels. Additionally,
higher levels presume reliance on the lower levels of language
understanding, and the theories used to explain the data move
more into the areas of cognitive psychology and artificial
intelligence. As a result, the lower levels of language processing
have been more thoroughly investigated and incorporated into
NLP systems. The technologies developed at CNLP utilize all
these levels where appropriate, as well as relevant subsets
where appropriate.
|