As Scots doesn't have a standardised spelling orthography, various spellings of words are used on the Scots wikipedia. Some of these spellings are common to many dialects, some are relatively rare and used in only a few dialects, some are archaic spellings and some were clearly made up for the wiki where no Scots spelling existed before.
As a non-native speaker, it is difficult to pop out into a Scottish street or pub and chat to the fellows there to establish which category the spellings fall into. Instead it is necessary to research each spelling, checking different online sources, this can be repetitive and time consuming.
I would like to propose three classes of spelling
This is a list of lemmas (in English) with Scots spelling variants, their word frequency on the scowiki corpus, the Scottish Corpus (Scots language) and other corpi as available.
Lemma | Spelling | Scowiki occurrences (2020-09-01) | Scottish Corpus occurrences | Scots Pairlament Corpus occurrences | Notes |
---|---|---|---|---|---|
addition | addeetion | 292 | 0 | 0 | Type C |
addition | addition | 387 | 10 | 3 | Type A |
addition | addeetional | 101 | 0 | 0 | Type C |
addition | additional | 126 | 3 | 1 | Type A |
addition | addeitional | 1 | 0 | 0 | Type C |
administrative | admeenistrative | 4204 | 0 | 0 | Type C |
administrative | administrative | 177 | 1 | 2 | Type A |
area | aurie | 10281 | 0 | 0 | |
area | area | 390 | 500 | 3 | |
british | breetish | 2326 | 1 | 1 | [a] |
british | british | 417 | 150 | 4 | |
city | ceety | 17840 | 0 | 0 | |
city | ceetie | 43 | 0 | 0 | |
city | city | 844 | 150 | 4 | |
city | ceity | 80 | 5 | 0 | |
city | ceities | 21 | 3 | 0 | |
city | citie | 0 | 5 | 0 | |
civil | ceevil | 835 | 16 | 0 | |
civil | civil | 128 | 100 | 0 | |
discover | diskivert | 44 | 0 | 0 | |
discover | discovered | 315 | 2 | 0 | |
discover | discovert | 7 | 4 | 0 | |
discover | fund | 267 | 600 | 0 | [b] |
independent | unthirldom | 933 | 0 | 0 | |
independent | independent | 726 | 50 | 1 | |
independent | unthirlt | 165 | 0 | 0 | |
independent | thirled | 0 | 15 | 3 | |
independent | sequestrate | 4 | 0 | 0 | |
independent | sequestratit | 0 | 1 | 0 | |
independent | unthirled | 0 | 0 | 0 | |
system | seestem | 1970 | 8 | 0 | |
system | system | 292 | 800 | 26 | |
symbol | seembol | 586 | 0 | 0 | |
symbol | symbol | 23 | 6 | 1 |
Let me know if there are other words, spellings and corpi to consider.
The English spelling is used as the lemma here because I'm English and one of the main points of the exercise is to establish which Scots spelling should represent the lemma, and which are secondary.