meeting_2007-01-23
Meeting with Polderland 23.1.2007
Participants:
- Peter Beinema
- Sjur Moshagen
Agenda
- Since last time
- Possible issues
- Next meeting
Since last time
Polderland:
- did poll technical contacts, no answer yet
- mklex: beta test results:
- some things we can't / won't deal with (multiword expr,
- repaired for problem with initial non- A-Za-z, requires rerun of test
- some things we can't / won't deal with (multiword expr,
- investigating possibilities for 2 extra word/noun classes:
- proper noun (as second part of compound),
- word that can follow genitive stem (as second part of compound),
- proper noun (as second part of compound),
- words in lexicon starting with hyphen:
- is not even processed "correctly": hyphen is skipped prior to checking
- is not even processed "correctly": hyphen is skipped prior to checking
Divvun:
PLX conversion made good progress last week as well, noun compounding now ok, and all POSes except numbers should be ok. We hope to deliver the first large-scale PLX lexicon today or tomorrow. It will not contain derivations, though, which will account for a large portion of the total size.
Possible issues
Name "prefixes"
Some nouns are common as prefixes to names, mainly words for North,
- davvi-Norgga -> Davvi-Norgga (= North(ern) Norway)
That is:
name-prefix + name => upper case + hyphen
Thus, to correctly handle these cases, we need to identify names as different from other nouns, such that we can direct the upppercased and hyphenated
Compound where first part is genitive:
similar issue as with name-prefix. Proposed solution:
Hyphen as prefix
In constructions of coordinated compounds with common first part (YX and YZ =>
Decision: such constructions will be handled automatically, and should not
Next meeting
Next Tuesday (30.1.) at the usual time.
TODO:
- get back on linguistic issue regarding proper nouns vs. common nouns
- done, see above
- done, see above
- get back on linguistic issue re. hyphen as prefix
- hard-coded, the speller will accept both with and without hyphen
- hard-coded, the speller will accept both with and without hyphen
- check if North Sámi hyphenation can be disabled when processing Lule Sámi (PLD)
- can be done with a simple work-around
- can be done with a simple work-around
- make complete PLX data set (Tomi)
- approaching: -)
- approaching: -)
- get language codes to work with Mac Office 2004 (and check MacOffice 2007)
- deliver mklex + hyphen script
- try to find proper compiler version for Adobe Indesign (old version will
- try to get an answer to the language codes in MS Office for Mac question from
- done, responded via e-mail