Font size:
Documentation common to all languages
Linguistic issues
- Preprocessing
- Documentation of the perl preprocessor file gt/script/preprocess
- Morphological tagging
- Disambiguation
Divvun/Giellatekno corpus documentation
- The raw corpus repository
- The corpus improvement project
- Corpus collector's manual
- Corpus conversion
- Language recognition
- ccat, the program used to list the text in corpus files.
-
tca2, the corpus alignment program.
- ... with test results for the parallelisation
- The tagged corpus files
- The parallel corpus files

