Speller Test Results for: «sp-regression-hu-sme.txt»
Overview
Technical data
Language tested: sme
Document tested: sp-regression-hu-sme.txt
Speller tool: Hunspell, command line version
Speller tool version: @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2)
Speller lexicon version: 1.0beta10
Test Date: 20130516-2145
Test Type: regression
Processing time (user+system time): No data available
Speed: No data available
Max memory usage of the speller (max value of /bin/ps -o rss= PID sampled every 10 second): 0 Kb
Result summary
Nº of input words: 145
Nº of spelling errors: 80 (55,17% of all words)
| Speller view | Wrong tokenisation | |||
|---|---|---|---|---|
| Speller Positive (Nº of flagged words): 0 | Speller Negative (Nº of accepted words): 0 | All tokenisation errors: 0 |
||
| Reality | Nº of real errors: 80 | Nº of true positives (detected real errors): 0 | Nº of false negatives (unflagged spelling errors): 0 | Nº of errouneously tokenized spelling errors: 0 |
| Nº of real correct words: 65 | Nº of false positives (incorrectly flagged words): 0 | Nº of true negatives (unflagged correct words): 0 | Nº of errouneously tokenized correct words: 0 |
|
Precision (tp/(tp+fp)):
NaN
Recall (tp/(tp+fn)):
NaN
Accuracy ((tp+tn)/words):
NaN
| Spelling errors: 80 | ||||
|---|---|---|---|---|
| Simple errors (edit dist. 1): | Errors with edit distance 2: | Errors with edit distance ≥3: |
||
| Suggestion statistics for true positives (= 0 = 100%): | 59 (73,75%) | 19 (23,75%) | 2 (2,5%) | |
| Nº of detected spelling errors with correct suggestion in first position: | 0 (NaN %) | 0 | 0 | 0 |
| Nº of detected spelling errors with correct suggestion in top 5: | 0 (NaN %) | 0 | 0 | 0 |
| Nº of detected spelling errors with correct suggestion below top 5: | 0 (NaN %) | 0 | 0 | 0 |
| Nº of detected spelling errors with only wrong suggestions: | 0 (NaN %) | 0 | 0 | 0 |
| Nº of detected spelling errors with no suggestions at all: | 0 (NaN %) | 0 | 0 | 0 |
| Undetected spelling errors: | 0 | 0 | 0 | 0 |
True negatives (0)
The correctly accepted words are listed here for easy copy&paste into Word to quick check regressions in new versions of the speller: No correctly spelled words accepted by an earlier speller should be rejected by later spellers. To check, just copy and paste these words to Word, and see if you get any red underlines. Duplicate words are not repeated, unless one is followed by some punctuation. The words are reverse sorted according to Unicode character code.
Grouped by bug #
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Avvila | inflections not recognised | |||
| čuovvut | inflections not recognised |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| hillgurralaš | hillágurralaš | 1 | downcase derived proper | |
| hillágurralaš | downcase derived proper | |||
| perlaččat | perulaččat | 1 | downcase derived proper | |
| perulaččat | downcase derived proper |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| vuostaš | vuosttaš | 1 | still numerals missing words | |
| moaddeloi | moaddelogi | 1 | still numerals missing words | |
| mánga | máŋga | 1 | still numerals missing words | |
| vuosttaš | still numerals missing words | |||
| moaddelogi | still numerals missing words | |||
| golma | golmma | 1 | still numerals missing words | |
| guovti | guovtti | 1 | still numerals missing words | |
| guvtiin | guvttiin | 1 | still numerals missing words | |
| okt | okta | 1 | still numerals missing words | |
| guokt | guokte | 1 | still numerals missing words | |
| golbna | golbma | 1 | still numerals missing words | |
| golmma | still numerals missing words | |||
| guovtti | still numerals missing words | |||
| guvttiin | still numerals missing words | |||
| okta | still numerals missing words | |||
| guokte | still numerals missing words | |||
| golbma | still numerals missing words | |||
| máŋga | still numerals missing words | |||
| ovccičuođiovccilogiovcciduhátovccičuođiovccilogiovcci | still numerals missing words |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| garraseabbu | garraseabbo | 1 | eabbo/eamos Nom+Sg missing words | |
| čuovgadeabbu | čuovgadeabbo | 1 | eabbo/eamos Nom+Sg missing words | |
| boarraseamos | boarráseamos | 1 | eabbo/eamos Nom+Sg missing words | |
| dábáleamus | dábáleamos | 1 | eabbo/eamos Nom+Sg missing words | |
| dehált | dehálet | 1 | eabbo/eamos Nom+Sg missing words | |
| Váddáseamus | Váddáseamos | 1 | eabbo/eamos Nom+Sg missing words | |
| boarrasamos | boarrásamos | 1 | eabbo/eamos Nom+Sg missing words | |
| garraseabbo | eabbo/eamos Nom+Sg missing words | |||
| čuovgadeabbo | eabbo/eamos Nom+Sg missing words | |||
| boarráseamos | eabbo/eamos Nom+Sg missing words | |||
| dábáleamos | eabbo/eamos Nom+Sg missing words | |||
| Váddáseamos | eabbo/eamos Nom+Sg missing words | |||
| boarrásamos | eabbo/eamos Nom+Sg missing words | |||
| dehálet | eabbo/eamos Nom+Sg missing words |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| smat | mat | 1 | suggestions:smat,sbat, sdat | |
| sbat | bat | 1 | suggestions:smat,sbat, sdat | |
| sdat | dat | 1 | suggestions:smat,sbat, sdat | |
| tgen | gen | 1 | suggestions:smat,sbat, sdat | |
| valáštalaama | valáštallama | 1 | suggestions:smat,sbat, sdat | |
| valáštalana | valáštallama | 2 | suggestions:smat,sbat, sdat | |
| valáštalamat | valáštallamat | 1 | suggestions:smat,sbat, sdat | |
| ltnot | litnot | 1 | suggestions:smat,sbat, sdat | |
| Stno | Sudno | 2 | suggestions:smat,sbat, sdat | |
| Suitno | Sudno | 2 | suggestions:smat,sbat, sdat | |
| Sultno | Sudno | 2 | suggestions:smat,sbat, sdat | |
| Shutno | Sudno | 2 | suggestions:smat,sbat, sdat | |
| Suino | Sudno | 1 | suggestions:smat,sbat, sdat | |
| Søtno | Sudno | 2 | suggestions:smat,sbat, sdat | |
| Sunno | Sudno | 1 | suggestions:smat,sbat, sdat | |
| Wutno | Sudno | 2 | suggestions:smat,sbat, sdat | |
| lágádusna | lágádussan | 2 | suggestions:smat,sbat, sdat | |
| lágádusano | lágádussan | 2 | suggestions:smat,sbat, sdat | |
| lágádusana | lágádussan | 2 | suggestions:smat,sbat, sdat | |
| servona | servoma | 1 | suggestions:smat,sbat, sdat | |
| fuomama | fuomáša | 2 | suggestions:smat,sbat, sdat | |
| Marjasna | Mariana | 2 | suggestions:smat,sbat, sdat | |
| Marjaama | Marjamaa | 2 | suggestions:smat,sbat, sdat | |
| Marjana | Mariana | 1 | suggestions:smat,sbat, sdat | |
| Marjalana | Marjalan | 1 | suggestions:smat,sbat, sdat | |
| Marjasana | Marjalan | 2 | suggestions:smat,sbat, sdat | |
| Marjažana | Marjalan | 2 | suggestions:smat,sbat, sdat | |
| Marjašna | Marjalan | 3 | suggestions:smat,sbat, sdat | |
| Marjaina | Mariana | 2 | suggestions:smat,sbat, sdat | |
| Marjanna | Marjanen | 2 | suggestions:smat,sbat, sda |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| hotella | hotealla | 1 | hotealla is missing words | |
| hotelli | hotellii | 1 | hotealla is missing words | |
| hotealla | hotealla is missing words | |||
| hotellii | hotealla is missing words |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| almmuhuvvojt | almmuhuvvojit | 1 | some long passives not recognized | |
| muitaluvvojt | muitaluvvojit | 1 | some long passives not recognized | |
| almmuhuvvojit | some long passives not recognized | |||
| muitaluvvojit | some long passives not recognized | |||
| rahčagoahtet | rahčagohtet | 1 | not recognized | |
| rahčagohtet | not recognized | |||
| muitalivčii | muitalivččii | 1 | not recognized | |
| muitalivččii | not recognized | |||
| dárbbašivčii | dárbbašivččii | 1 | not recognized | |
| dárbbašivččii | not recognized | |||
| badjelass | badjelasas | 1 | not recognized | |
| badjelasas | not recognized | |||
| duisska | duiska | 1 | not recognized | |
| duiska | not recognized | |||
| beassaš | beassáš | 1 | not recognized | |
| beassáš | not recognized | |||
| idjadallata | idjadallat | 1 | not recognized | |
| idjadallat | not recognized |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| nai | no particles in Hunspell | |||
| na | no particles in Hunspell | |||
| goit | no particles in Hunspell | |||
| ges | no particles in Hunspell | |||
| ge | no particles in Hunspell |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| le | lea | 1 | single letter suggestions | |
| NSR:ii | NSR:i | 1 | single letter suggestions |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| beaggigođii | gođii- | |||
| boahtigođii | gođii- | |||
| bealkigođiime | gođii- | |||
| ballagohten | gođii- |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| Juovlamánno | Juovlamánno- | 1 | compound-form accepted as is | |
| Juovlamánno- | compound-form accepted as is |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| jeagge | jeagge- | 1 | compound patterns | |
| jeagge- | compound patterns | |||
| boazo-rodjon | boazorodjan | 2 | compound patterns | |
| Muitalusat-girjiin | Muitalusgirjiin | 3 | compound patterns | |
| Obadja-badjálagaid | bádja-badjálagaid | 2 | compound patterns | |
| vuodjenváldi | compound patterns | |||
| boazorodjan | compound patterns | |||
| Muitalusgirjiin | compound patterns | |||
| bádja-badjálagaid | compound patterns |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| luonddu- | cmp-forms w fin hyph gets marked | |||
| giella- | cmp-forms w fin hyph gets marked |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| kiinnálaččat | no lowering | |||
| troanddinlaš | no lowering |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| boahte | boahtte | 1 | missing wordforms | |
| gslbma | galbma | 1 | missing wordforms | |
| gehppa | geahppa | 1 | missing wordforms | |
| guovgga | guovga | 1 | missing wordforms | |
| boarres | boares | 1 | missing wordforms | |
| geargus | gearggus | 1 | missing wordforms | |
| presidenta | presideanta | 1 | missing wordforms | |
| boahtte | missing wordforms | |||
| galbma | missing wordforms | |||
| geahppa | missing wordforms | |||
| guovga | missing wordforms | |||
| boares | missing wordforms | |||
| gearggus | missing wordforms | |||
| presideanta | missing wordsforms |
| Input word | Expected correction | Editing distance | Suggestions | Comment |
|---|---|---|---|---|
| sámedike | sámedikke | 1 | no gen-allegroforms | |
| dike | dikke | 1 | no gen-allegroforms | |
| giete | giette | 1 | no gen-allegroforms | |
| lávvde | lávdde | 1 | no gen-allegroforms | |
| mere | meare | 1 | no gen-allegroforms | |
| darrfe | darffe | 1 | no gen-allegroforms | |
| vuro | vuoro | 1 | no gen-allegroforms | |
| sámedikke | no gen-allegroforms | |||
| dikke | no gen-allegroforms | |||
| giette | no gen-allegroforms | |||
| lávdde | no gen-allegroforms | |||
| meare | no gen-allegroforms | |||
| darffe | no gen-allegroforms | |||
| vuoro | no gen-allegroforms |

