Tracking misspellings

Authors

DOI:

https://doi.org/10.7557/sda.8321

Keywords:

North Saami, orthography, writing skills, misspellings, spellchecking program

Abstract

This article presents a study of North Saami texts published in Norway and Finland and collected from the Internet in order to see how native writers master the orthography, and what kind of misspellings are the most frequent ones. In the material, 4% of the words are not written according to the norm. The analysis of the mistakes reveals that it is not enough to know the relationship between pronunciation and the orthographic rules; knowledge of formal grammar is also necessary. The biggest problem is to distinguish between a and á, and to write the consonant centre of the word correctly. Finding the correct combination of stem and suffix is problematic, especially for i-stems. The writer also needs to know how to adapt loan words to the orthography. The existence of a wide range of possible variants of some frequent loan words only makes it worse. The biggest group of misspellings contains incorrect case suffixing of acronyms and numbers. These rules are complex and contra intuitive. The North Saami spell checking program discovers 78% of the misspellings and offers the correct spelling among the five first candidates in 82% of the cases.

References

Dutkanmateriála (Teavsttat leat vižžojuvvon interneahtas čuovvovaš čujuhusain):

http://klemetti.blogspot.com

http://naturvernforbundet.no

http://nsr.no

http://rauna.wordpress.com

http://se.wikipedia.org5

http://www.enontekionsanomat.fi

http://www.galdu.org

http://www.kafjord.com

http://www.kpk.no

http://www.lapinkansa.fi

http://www.lapinliitto.fi

http://www.marianpaivat.fi

http://www.metsa.fi

http://www.nesseby.kommune.no

http://www.nrk.no

http://www.samefolketsparti.no

http://www.saminissonforum.org

http://www.samisiida.org

http://www.samiskhs.no

http://www.siif.no

http://www.skabmagovat.fi

http://yle.fi

Girjjálašvuohta:

Antonsen, Lene & Trosterud, Trond 2010: Manne dihtor galgá máhttit grammatihka? – Sámi dieđalaš áigečála 1/2010: 3–28. https://doi.org/10.7557/sda.8605

Bergsland, Knut 1946: Røros-lappisk grammatikk – et forsøk på strukturell språkbeskrivelse. Instituttet for sammenlignende kulturforskning. Serie B. Skrifter XLIII. Oslo: Aschehoug.

Bergsland, Knut 1952: Hvordan den nye samiske rettskrivningen ble til. – Sameliv. Samisk selskaps årbok 1951–52: 28–50.

Čállinrávagirji 2003. Sámedikki giellaossodat ja Sámedikki oahpahusossodat. <http://www.samit.no/callinravvagat.pdf> (25.02.2014).

Damerau, Frederick J. 1964: A technique for computer detection and correction of spelling errors. – Communications of the ACM 7(3): 171–176. https://doi.org/10.1145/363958.363994

Deorowicz, Sebastian & Ciura, Marcin G. 2005: Correcting spelling errors by modelling their causes. – International Journal of Applied Mathematics and Computer Science 15(2): 275–285. <https://www.amcs.uz.zgora.pl/?action=paper&paper=246> (26.02.2014).

Divvun-čielggadeapmi 2009 = Asplan Viak: Sametinget – Permanent driftsorganisasjon for Divvun-prosjektet. Endelig utgave 2009-03-31. Asplan Viak AS. <http://www.divvun.no/adm/board/Permanent_driftsorganisasjon_for_Divvun-prosjektet.pdf> (26.02.2014).

Kåven, Brita & Jernsletten, Johan & Nordal, Ingrid & Eira, John Henrik & Solbakk, Aage 1995: Sámi-dáru sátnegirji. Kárášjohka: Davvi Girji.

Kukich, Karen 1992: Techniques for Automatically Correcting Words in Text. – ACM Compiling Surveys 24(4): 377–439. https://doi.org/10.1145/146370.146380

Levenshtein, V. I. 1966: Binary codes capable of correcting deletions, insertions and reversals. – Soviet Physics Doklady 10(8): 707–710 [Jorgaluvvon publikašuvnnas Doklady Akademii Nauk SSSR 163(4): 845–848.] <http://profs.sci.univr.it/~liptak/ALBioinfo/files/levenshtein66.pdf> (24.02.2014).

Länsman, Outi 2009: Oahppiid feaillat čállosiin. – Johanna Ijäs & Nils Øivind Helander (doaimm.), Sáhkavuoruin sáhkan. Sámegiela ja sámi girjjálašvuođa muhtin áigeguovdilis dutkanfáttát. Dieđut 1/2009. [Guovdageaidnu]: Sámi allaskuvla. 70–85.

Magga, Ole Henrik 1985: Davvisámegiela čállinvuogi gielalaš vuođđu. – Giela dutkan, dikšun ja oahpaheapmi. Dieđut 2/1985. [Guovdageaidnu]: Sámi Instituhtta. 32–41.

Magga, Ole Henrik 1998: Noen normeringsproblem i samisk. – Ruth Vatvedt Fjeld & Boye Wangensteen (doaimm.), Normer og regler. Festskrift til Dag Gundersen 15. januar 1998. [Oslo]: Nordisk forening for leksikografi. 90–101.

Magga, Ole Henrik 2010: Fonetihka vuođđu. [Oahppomateriála čoakkáldagas Teakstačoakkáldat – Sám 108. Sámi allaskuvla.]

Mitton, Roger 2009: Ordering the suggestions of a spellchecker without using context. – Natural Language Engineering 15(02): 173–192. https://doi.org/10.1017/S1351324908004804

Moshagen, Sjur Nørstebø 2008: A language technology test bench – automatized testing in the Divvun project. – Rickard Domeij, Sofie Johansson Kokkinakis, Ola Knutsson & Sylvana Sofkova Hashemi (doaimm.), Proceedings of the Workshop on NLP for Reading and Writing – Resources, Algorithms and Tools. NEALT Proceeding Series 3. Stockholm: SLTC. 19–21. <http://www.academia.edu/3117605/Proceedings_of_the_Workshop_on_NLP_for_Reading_and_Writing_-_Resources_Algorithms_and_Tools> (26.02.2014).

Nickel, Klaus Peter 1994: Samisk grammatikk. Karasjok: Davvi Girji.

Nickel, Klaus Peter & Sammallahti, Pekka 2008: Duiskka-sámi sátnegirji. Karasjok: Davvi Girji.

Nickel, Klaus Peter & Sammallahti, Pekka 2011: Nordsamisk grammatikk. Karasjok: Davvi Girji.

Nielsen, Konrad 1979 (1926–1929): Lærebok i lappisk (samisk). Bind I: Grammatikk. Oslo: Universitetsforlaget.

Nielsen, Konrad 1979 (1932–1962): Lappisk (samisk) ordbok grunnet på dialektene i Polmak, Karasjok og Kautokeino. Oslo: Universitetsforlaget.

Pieski, Tomi & Moshagen, Sjur & Omma, Thomas 2008: Goallosteapmi Divvunreaidduin. <http://giellatekno.uit.no/background/Goallosteapmi_Divvun.pdf> (29.10.2013).

Sammallahti, Pekka 1985: Muhtin fuomášahttimat nuppi stávvala vokálaid čállima birra oktasaš čállinvuogis. – Giela dutkan, dikšun ja oahpaheapmi. Dieđut 2/1985. [Guovdageaidnu]: Sámi Instituhtta. 68–74.

Sammallahti, Pekka 1989: Sámi-suoma sátnegirji. Ohcejohka: Jorgaleaddji.

Sammallahti, Pekka 1993: Sámi-suoma-sámi sátnegirji. Ohcejohka: Girjegiisá.

Sammallahti, Pekka 2006: Jietnadatoahpa vuođđokursa. Logaldatmáŋggus. [Oulu]: Oulu universitehta.

Sammallahti, Pekka & Nickel, Klaus Peter 2006: Sámi-duiskka sátnegirji. Karasjok: Davvi Girji.

SG ruovttusiidu = Sámi Giellagáldu ruovttusiidu. <http://samigiellagaldu.org> (26.01.2014).

SGL 2005 = Sámi giellalávdegoddi, Davvisámegiela juogus Čoahkkingirji 2/05. 13.12.2005. Kárášjohka.

SGR 1993 = Sámi Giellaráđđi – čoahkkingirjeoassi. Čoahkkin geassemánu 14.–15. b. 1993. Ášši 22/93 Mo merket gehčosiid loguide. https://doi.org/10.1108/EUM0000000000983

SNSO 2000 = Dáru-sámi sátnegirji. Kárášjohka: Davvi Girji.

Svonni, Mikael 1990: Sámi-ruoŧa, ruoŧa-sámi sátnegirji. Jokkmokk: Sámi Girjjit.

Svonni, Mikael 2013: Davvisámegiela-ruoŧagiela, ruoŧagiela-davvisámegiela sátnegirji. Karasjok: ČálliidLágádus.

Wiechetek, Linda 2012: Constraint Grammar based Correction of Grammatical Errors for North Sámi. – Guy De Pauw, Gilles-Maurice de Schryver, Mikel L. Forcada, Kepa Sarasola, Francis M. Tyers & Peter Waiganjo Wagacha (doaimm.), Proceedings of the Workshop on Language Technology for Normalisation of Less-Resourced Languages. SaLTMiL8 – AfLaT 2012. European Language Resources Association (ELRA). 35–40.

Published

2025-09-26

Issue

Section

Articles

How to Cite

Antonsen, L. (2025). Tracking misspellings. Sámi dieđalaš áigečála, (2), 7–32. https://doi.org/10.7557/sda.8321