UiT > Divvun
 
Font size:      

Morphological disambiguation

Morphological disambiguation

The project uses Eckhard Bick and Tino Dideriksens vislcg3, a compiler for constraint grammar, a formalism originally developed by a.o. Fred Karlsson and Pasi Tapanainen at the University of Helsinki.

So far, we have a disambiguation file sme-dis.rle, for North Sámi, and a file smj-dis.rle for Lule Sámi. In addition to that, we are working on beta versions for Greenlandic and Faroese. The content of the North Saami file is documented here.

Other versions of constraint grammar

The philosophy behind constraint grammar is relatively simple, and the programming algorithms are found in several different version (the version we use at giellatekno is no. 5 on this list).

  1. The original CG formalism was developed by Fred Karlsson, at Dept. of Linguistics in Helsinki
  2. Bart Jongejan's C++ version of the original CG
  3. A new version of CG, called CG-2, was developed by Pasi Tapanainen, cf. Tapanainen 1996. It is maintained by Connexor
  4. vislcg, an open source implementation of CG-2, at http://visl.hum.sdu.dk/visl/. This parser is available for download ad sourceforge, here.
  5. vislcg3, a new generation version ov the open source vislcg is currently being developed at http://visl.hum.sdu.dk/visl/.

Other disambiguation tools

Grammar-based disambiguation tools

For various reasons, do not use these tools, but have chosen constraint grammar instead.

Statistically-based disambiguation tools

This is the dominating approach to disambiguation. Here, we think the grammar-based approach is on the right track, and we thus do not use statistically-based tools. In the future it still may be relevant to have a look at what statixtically-based tools may achieve when applied on the top of grammar-based disambiguation tools, in order to resolve on statistical grounds such open cases that cannot be dealt with by grammatical tools alone.

Thus, we should at some point start orientating ourself in the world of statistically-based disambiguation tools.

Links to old documentation

These links are kept for reference, but not relevant for present usage.

A disambiguation bibliography

Forthcoming...

Last modified: $Date: 2012-11-22 13:41:43 +0100 (tor, 22 nov 2012) $, by $Author: trond $