Difference between revisions of "Australian-English Spelling Dictionary"

From Apache OpenOffice Wiki
Jump to: navigation, search
(Objectives: Added note about Phonetics)
(Proposed Australian Word list Categories)
 
(28 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Objectives ==
+
== History ==
  
 
The Australian-English Spelling dictionary is currently packaged with the English dictionaries in the OpenOffice 3.1 distribution.  
 
The Australian-English Spelling dictionary is currently packaged with the English dictionaries in the OpenOffice 3.1 distribution.  
It is available as an add-on extension for OpenOffice 3.0 at [http://extensions.services.openoffice.org/project/AustralianDictionary]. And can be added to OpenOffice 2.0 [http://wiki.services.openoffice.org/wiki/Dictionaries#English_.28AU.2CCA.2CGB.2CNZ.2CUS.2CZA.29].
+
 
 +
It is available as an [http://extensions.services.openoffice.org/project/AustralianDictionary add-on extension] for OpenOffice 3.0. And can be added to OpenOffice 2.X [http://wiki.services.openoffice.org/wiki/Dictionaries#English_.28AU.2CCA.2CGB.2CNZ.2CUS.2CZA.29].
 +
 
 +
The current dictionary is based on the the United Kingdom-English word list from [http://wordlist.sourceforge.net/scowl-readme SCOWL] (see links below) with an ad hoc collection of 'Australian' words added. I ([[User:Dnw|David Wilson]]) have built up this list from a variety of public sources over several years, so that it includes place names, botanical and animal names, common personal names and 'Australianisms'. These have been added to an Australian word list over time.
 +
 
 +
[http://en.wikipedia.org/wiki/Australian_English Australian-English] generally follows the conventions of [http://en.wikipedia.org/wiki/British_English British-English] rather than [http://en.wikipedia.org/wiki/American_English American English]. It may useful to see [http://en.wikipedia.org/wiki/American_and_British_English_differences American and British English differences].
 +
 
 +
The [http://en.wikipedia.org/wiki/Macquarie_Dictionary Macquarie Dictionary] is generally used by publishers, schools, universities and governments as the standard spelling reference. " The dictionary records standard Australian English spelling, which is closer to British and Canadian  English  than American English, with spellings like ''colour'', ''centre'', ''defence'' and  ''practice/practise'' (noun/verb).  It also gives ''-ise'' spellings first, listing ''-ize'' spellings as acceptable  variants, unlike the  [http://en.wikipedia.org/wiki/Oxford_English_Dictionary Oxford English Dictionary] and some other dictionaries of British English, that continue to  prefer ''-ize'' to ''-ise'' in spite of the opposite tendency amongst the British general public (see [http://en.wikipedia.org/wiki/Oxford_spelling Oxford spelling])." [http://en.wikipedia.org/Macquarie_Dictionary]
 +
 
 +
I suggest using this dictionary as the project's authoritative reference.
 +
 
 +
== Objectives ==
  
 
The object of this project is improve the organisation and maintenance of the Australian-English Spelling Dictionary.
 
The object of this project is improve the organisation and maintenance of the Australian-English Spelling Dictionary.
  
The current dictionary is based on the the United Kingdom English word list with an ad hoc collection of 'Australian' words added. I have build up this list from a variety of public sources over several years, so that it includes place names, botanical and animal names, common personal names and 'Australianisms'. These have been added to an Australian word list over time.
+
To improve the maintenance of the word list I propose to add structure to the project by maintaining separate word lists that are combined to produce the new versions of the Australian-English Spelling Dictionary. The separate word lists should help us to ensure we have the best coverage possible in each category and to check each category for adequacy and completeness.
  
To improve the maintenance of the word list I propose to add structure to the project by maintaining separate word lists that are combined to produce the new versions of the Australian-English Spelling Dictionary. The separate word lists should help us to ensure we have the best coverage possible in each category.
+
The Australian-English Spelling Dictionary currently uses [[Australian-English_Spelling_Dictionary/phonetic_hints | phonetic hints]] from the US-English Spelling Dictionary. These may need to be adjusted for Australian usage.
  
The Australian-English Spelling Dictionary currently uses phonetic hints from the US-English Spelling Dictionary. These may need to be adjusted for Australian usage.
+
The OpenOffice spelling checker now supports hyphenated words, so a new version needs to be created with appropriate hyphenated words.
  
== Proposed Australian Word list Categories ==
+
'''If you would like to assist in this project please contact me''' [[User:Dnw|David Wilson]] 06:15, 29 October 2009 (UTC)
  
* Place Names
+
== Issues ==
* Botanical Names (flora)
+
 
* Animal names (fauna)  
+
We also need to work out how to deal with the ''-ise'' vs ''-ize'' word endings. Currently only the ''-ise'' words are included in the dictionary and ''-ize'' words are shown as errors.
* Australianism  
+
 
* Common personal and Family names
+
We have at least three options:
 +
 
 +
# Include only ''-ise'' words.
 +
# Include both ''-ise'' and ''-ize'' words.
 +
# It may be possible to include both ''-ise'' and ''-ize'' words but only suggest ''-ise'' spellings when suggesting spelling alternatives. This may be possible Using the NOSUGGEST option which is currently used to suppress a couple of obscene words from being displayed in the suggested spellings list.
 +
 
 +
'''Personal Names:''' Currently the dictionary includes common personal and surnames. I am inclined NOT to include these in the next edition due the large range of spelling variations in names abd difficulties in building any definitive list. For people who want to include names in the spelling checker can use my add-on spelling checker dictionary- [http://extensions.services.openoffice.org/project/en_AU_Names Australian English Names Dictionary]
 +
 
 +
== Proposed Australian Word list Categories and Sources ==
 +
 
 +
* Place Names use [http://www1.auspost.com.au/postcodes/index.asp?sub=2 Australian Postcode list], possibly use the Australian [http://www.geonames.org/ GeoNames] list ?
 +
* Botanical Names (flora). Botanical Family names list derived from data on the Integrated Botanical Information System (IBIS), Australian National Botanic Gardens, Australian National Herbarium [http://www.anbg.gov.au/apni/index.html website]
 +
* Animal names (fauna) [http://www.environment.gov.au/cgi-bin/sprat/public/publicthreatenedlist.pl?wanted=fauna EPBC Act List of Threatened Fauna], [http://en.wikipedia.org/wiki/Fauna_of_Australia Wikipedia, Fauna of Australia]
 +
* Australianism (drongo, bludger, reffo, etc.) [http://en.wiktionary.org/wiki/Appendix:Australian_English_terms_for_people wiktionary: Australian English terms for people]; [http://en.wiktionary.org/wiki/Appendix:Australian_English_vocabulary Australian English vocabulary], and others
 +
* Medical and scientific terms. (i.e. haematology vs hematology)
 +
* Australian Aboriginal language [http://www.dnathan.com/VL/eMUcat_20.htm names], Aboriginal regional and tribal names (i.e. Anjumarla, Arabana, Arrernte). [http://en.wikipedia.org/wiki/List_of_English_words_of_Australian_Aboriginal_origin Aboriginal words] used in Australian English.
 +
 
 +
== Hyphenated Words Rules ==
 +
 
 +
* Hyphenate words prefixed by ex-, self-, or all-, and some words prefixed by cross-.
 +
Examples:  ex-wife; self-evident; all-inclusive; cross-reference.
 +
* When written as words, fractions and cardinal numbers consisting of two words are hyphenated.
 +
Examples:  twenty-three, twenty-fifth, one-fourth, two-thirds.
 +
* Do not hyphenate words prefixed by non, un, in, dis, co, anti, hyper, pre, re, post, out, bi, counter, de, semi, mis, mega, micro, inter, over, and under (among others).
 +
Examples:  nonaffiliated, nonemergency, uninfected, inpatient, disorder, disbar, coworker, copayment, antismoking,
 +
antimanagement, hyperactive, hyperrealism, preoperative, prejudge, reoccur, readjust, resubmit, postoperative, 
 +
posttraumatic, outpatient, outmoded, bimonthly, biannual, counterrevolutionary, counterculture, decompress, semifinal,
 +
semiannual, misinformed, misprint, megabyte, microcircuit, interconnected, interoffice, overempha­size, override,
 +
underrepresent, underestimated.
 +
 +
EXCEPTIONS:  When the second element is capitalized, as in Un-American and non-English, a hyphen is used.  Also, occasional
 +
exceptions exist where the prefix and the second element have not (yet) "grown together," such as de-emphasize,  pre-owned,
 +
co-op (to distinguish from coop) and anti-inflammatory (and all words with anti- prefix and second element beginning with i).
 +
 
 +
[http://www.grammarmudge.cityslide.com/articles/article/426348/2805.htm source]
  
 
== Links ==
 
== Links ==
Line 28: Line 73:
 
* The [http://www.geonames.org/ GeoNames] geographical database covers all countries and contains over eight million place names that are available for download free of charge. The Australian place names list - [http://download.geonames.org/export/dump/AU.zip AU.zip]
 
* The [http://www.geonames.org/ GeoNames] geographical database covers all countries and contains over eight million place names that are available for download free of charge. The Australian place names list - [http://download.geonames.org/export/dump/AU.zip AU.zip]
  
 +
* [http://en.wikipedia.org/wiki/Australian_English_vocabulary Australian English vocabulary] and [http://en.wiktionary.org/wiki/Appendix:Australian_English_terms_for_people Australian English terms for people] on Wikipedia.
 +
 +
* Australian National Dictionary Centre, Australian National University, [http://www.anu.edu.au/ANDC/res/aus_words/index.php Australian words]
 +
 +
* [http://www.grammarmudge.cityslide.com/articles/article/426348/2805.htm Hyphenated Words: A Guide]
  
 
[[Category:Localization]]
 
[[Category:Localization]]

Latest revision as of 00:49, 27 January 2010

History

The Australian-English Spelling dictionary is currently packaged with the English dictionaries in the OpenOffice 3.1 distribution.

It is available as an add-on extension for OpenOffice 3.0. And can be added to OpenOffice 2.X [1].

The current dictionary is based on the the United Kingdom-English word list from SCOWL (see links below) with an ad hoc collection of 'Australian' words added. I (David Wilson) have built up this list from a variety of public sources over several years, so that it includes place names, botanical and animal names, common personal names and 'Australianisms'. These have been added to an Australian word list over time.

Australian-English generally follows the conventions of British-English rather than American English. It may useful to see American and British English differences.

The Macquarie Dictionary is generally used by publishers, schools, universities and governments as the standard spelling reference. " The dictionary records standard Australian English spelling, which is closer to British and Canadian English than American English, with spellings like colour, centre, defence and practice/practise (noun/verb). It also gives -ise spellings first, listing -ize spellings as acceptable variants, unlike the Oxford English Dictionary and some other dictionaries of British English, that continue to prefer -ize to -ise in spite of the opposite tendency amongst the British general public (see Oxford spelling)." [2]

I suggest using this dictionary as the project's authoritative reference.

Objectives

The object of this project is improve the organisation and maintenance of the Australian-English Spelling Dictionary.

To improve the maintenance of the word list I propose to add structure to the project by maintaining separate word lists that are combined to produce the new versions of the Australian-English Spelling Dictionary. The separate word lists should help us to ensure we have the best coverage possible in each category and to check each category for adequacy and completeness.

The Australian-English Spelling Dictionary currently uses phonetic hints from the US-English Spelling Dictionary. These may need to be adjusted for Australian usage.

The OpenOffice spelling checker now supports hyphenated words, so a new version needs to be created with appropriate hyphenated words.

If you would like to assist in this project please contact me David Wilson 06:15, 29 October 2009 (UTC)

Issues

We also need to work out how to deal with the -ise vs -ize word endings. Currently only the -ise words are included in the dictionary and -ize words are shown as errors.

We have at least three options:

  1. Include only -ise words.
  2. Include both -ise and -ize words.
  3. It may be possible to include both -ise and -ize words but only suggest -ise spellings when suggesting spelling alternatives. This may be possible Using the NOSUGGEST option which is currently used to suppress a couple of obscene words from being displayed in the suggested spellings list.

Personal Names: Currently the dictionary includes common personal and surnames. I am inclined NOT to include these in the next edition due the large range of spelling variations in names abd difficulties in building any definitive list. For people who want to include names in the spelling checker can use my add-on spelling checker dictionary- Australian English Names Dictionary

Proposed Australian Word list Categories and Sources

Hyphenated Words Rules

  • Hyphenate words prefixed by ex-, self-, or all-, and some words prefixed by cross-.
Examples:  ex-wife; self-evident; all-inclusive; cross-reference.
  • When written as words, fractions and cardinal numbers consisting of two words are hyphenated.
Examples:  twenty-three, twenty-fifth, one-fourth, two-thirds.
  • Do not hyphenate words prefixed by non, un, in, dis, co, anti, hyper, pre, re, post, out, bi, counter, de, semi, mis, mega, micro, inter, over, and under (among others).
Examples:  nonaffiliated, nonemergency, uninfected, inpatient, disorder, disbar, coworker, copayment, antismoking,
antimanagement, hyperactive, hyperrealism, preoperative, prejudge, reoccur, readjust, resubmit, postoperative,   
posttraumatic, outpatient, outmoded, bimonthly, biannual, counterrevolutionary, counterculture, decompress, semifinal,
semiannual, misinformed, misprint, megabyte, microcircuit, interconnected, interoffice, overempha­size, override, 
underrepresent, underestimated.

EXCEPTIONS:  When the second element is capitalized, as in Un-American and non-English, a hyphen is used.  Also, occasional
exceptions exist where the prefix and the second element have not (yet) "grown together," such as de-emphasize,  pre-owned, 
co-op (to distinguish from coop) and anti-inflammatory (and all words with anti- prefix and second element beginning with i).

source

Links

  • Kevin's Word List Page, including the official Hunspell en_UK, en_US and en_CA dictionaries, created from SCOWL (Spell Checker Oriented Word Lists). Revision 6 August 10, 2004 by Kevin Atkinson
  • OpenOffice spelling checker is based on Hunspell. Which is a spell checker and morphological analyzer library and program designed for languages with rich morphology and complex compounding or character encoding.
  • The GeoNames geographical database covers all countries and contains over eight million place names that are available for download free of charge. The Australian place names list - AU.zip
  • Australian National Dictionary Centre, Australian National University, Australian words
Personal tools