Parsing is an important method used in both computer science and linguistics. The term is synonymous with ‘syntactic analysis’, and refers to the process of taking a sequence (e.g. of characters), determining its structure, and checking whether it is legal in a given language. This is done by checking the structure of the sequence against a given formal grammar.

Sentences consist of a tree structure, where the sentence itself is the ‘root’, which branches out into phrases and eventually symbols, which are the ‘leaf’ nodes.

‘Parsing’ can also refer to the diagramming of sentences - usually for inflected languages such as the Romance languages or Latin.

In computer science, this method is sometimes related to the process of system quality assurance and code testing.

Related methods include: System quality assurance and code testing.

tool: Lucene

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.
Methods relating to this toolCategory
Cataloguing and indexingData structuring and enhancement
CollatingData analysis
CollocatingData analysis
Content analysisData analysis
Data miningData analysis
IndexingData analysis
ParsingData analysis
Text miningData analysis
Topic Detection and TrackingData analysis
Lifecycle stage:
Alternate tool(s):

InQuira, Verity, dtSearch, ISYS

Syndicate content