# Copyright 2002,2007 by Eric House (xwords@eehouse.org). All rights # reserved. # # This program is free software; you can redistribute it and/or # modify it under the terms of the GNU General Public License # as published by the Free Software Foundation; either version 2 # of the License, or (at your option) any later version. # # This program is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU General Public License for more details. # # You should have received a copy of the GNU General Public License # along with this program; if not, write to the Free Software # Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA. LANGCODE:ru_RU CHARSET:windows-1251 # deal with DOS files LANGFILTER: tr -d '\r' # uppercase all LANGFILTER: | tr [àáâãäåæçèéêëìíîïðñòóôõö÷øùÚûüýþÿ] [ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞß] # LANGFILTER: | tr -s '\n' '\000' # note: don't turn off sorting! Can't do it with GNU 'sort' without # setting LANG D2DARGS: -r -term 10 LANGINFO:

Russian wordlists must be in the Windows-1251 LANGINFO: codepage. Lower-case letters are converted to upper case and LANGINFO: any words that contain letters not listed below are LANGINFO: removed.

# High bit means "official". Next 7 bits are an enum where # Russian==0x0F. Low byte is padding. XLOC_HEADER:0x8F00 8 1 'À' 2 3 'Á' 4 1 'Â' 2 3 'Ã' 2 2 'Ä' 7 1 'Å' 1 4 'Æ' 1 3 'Ç' 7 1 'È' 1 2 'É' 4 2 'Ê' 4 2 'Ë' 2 3 'Ì' 4 1 'Í' 9 1 'Î' 4 2 'Ï' 5 1 'Ð' 5 1 'Ñ' 7 1 'Ò' 4 2 'Ó' 1 5 'Ô' 1 4 'Õ' 1 4 'Ö' 1 3 '×' 1 4 'Ø' 1 5 'Ù' 1 10 'Ú' 2 2 'Û' 4 1 'Ü' 1 8 'Ý' 1 5 'Þ' 2 2 'ß' 2 0 {"_"} # should ignore all after the above