Commit graph

170 commits

Author SHA1 Message Date
Eric House
6f9ba42e21 iso-8859-1 -> utf8 for files and the dicts they build 2010-10-14 05:58:49 -07:00
Andy2
73bd9be80a add Makefile for CollegeEng dict 2010-08-10 21:44:53 -07:00
Eric House
75cb1a7bcb ignore lots; new Makefile for TWL06 2010-06-15 19:24:25 -07:00
Eric House
e5c8720593 first shot at building dicts for Slovak. Result seems to work. 2010-06-06 07:14:15 -07:00
ehouse
8abdd729e5 Czech files for utf-8-encoded input 2009-11-27 19:22:03 +00:00
ehouse
5bf8258190 remove _ from langname 2009-10-28 03:04:50 +00:00
ehouse
c0040616b1 add option to output dict formatted as json 2009-09-24 04:23:30 +00:00
ehouse
57890381e3 first cut at Greek. Seems to work via Makefile, but need counts and
values info still.
2009-04-23 12:10:10 +00:00
ehouse
1ac648f3e6 add umlaut to filenames as additional test 2009-03-29 15:38:17 +00:00
ehouse
f66fa25e93 hide files; fix path to dict. 2009-03-23 00:08:11 +00:00
ehouse
b7f32f28be allocate 4x the memory to biggest Polish dictionary fits; fix
backwards map declaration (these should be moved to trunk).  Always
assume utf8 input; fix size calculations.
2009-03-22 23:47:12 +00:00
ehouse
5b78c4183a U rather than W to pack 2009-03-22 23:45:15 +00:00
ehouse
b0ae16c5e4 Replace A and E with umlaut'd unicode chars for a better test case 2009-03-20 03:45:36 +00:00
ehouse
eae332764d dictionary built on this branch, with utf-8 tile faces 2009-03-19 03:16:05 +00:00
ehouse
92b8a0b694 fix for unicode: don't assume all chars have value <= 255 2009-03-19 03:08:25 +00:00
ehouse
eb3d97a334 First changes for building dictionaries (with tile faces in UTF-8).
Test cases are English and Polish, and with these changes English
seems to build ok and decode via dawg2dict.pl while Polish does too
but a lot of words are missing.  Building using Makefiles only so far,
not BYOD.
2009-03-18 04:16:26 +00:00
ehouse
83a6970573 Changes to fix BYOD (though still need native speaker confirmation) 2009-03-14 22:33:53 +00:00
ehouse
f0ccaa92bc Fix so can build iso-8859-2 Polish dicts using make (won't work on
BYOD yet): add encoding to emacs mode line and fix the letters,
including hard-coding them as decimal numbers until I can figure out
how to get perl (in xloc.pm) to emit iso-8859-2 instead of utf8.
2009-03-14 19:27:29 +00:00
ehouse
121b8253cf accept ISO-8859-2; remove unused param; add assert that EOF/EOL aren't
part of a multibyte char
2009-03-14 19:22:15 +00:00
ehouse
b8a0da5366 build dict2dawg with debug symbols 2009-03-14 19:21:09 +00:00
ehouse
d46152b5f6 if setlocale doesn't work, try again with en_US -- works around
problem on my ISP.
2009-01-28 03:32:21 +00:00
ehouse
74c5f79338 Set locale based on params passed in, only on ENV if not specified. 2009-01-25 20:13:36 +00:00
ehouse
4661ac8227 Cleanup to run on a machine that's utf8: specify iso-8859-1 when needed. 2009-01-25 18:57:05 +00:00
ehouse
64407edd26 make tarball ready to be dropped into byod 2009-01-25 18:48:29 +00:00
ehouse
c551f6b372 Need to escape period in grep pattern to get literal dot! 2009-01-22 04:30:35 +00:00
ehouse
ad9bc8b5cc change default dictionary 2009-01-21 05:36:43 +00:00
ehouse
8043829ece smartphone-size small bitmaps 2009-01-21 05:25:43 +00:00
ehouse
70371d12f5 Make smaller bitmaps 8x8 since that's the smallest size that can be
required and StretchBlt to smaller can't work for letters.
2009-01-18 18:25:33 +00:00
ehouse
7201ef2fa8 Tweaks to bitmaps; build for wince by default 2009-01-17 18:39:08 +00:00
ehouse
74c6457f83 add emacs mode line 2009-01-14 13:41:25 +00:00
ehouse
b45fc82771 Allow language Makefile to specify encoding. Pass to perl and c++
dict builders, using it to open files and to determine whether to do
multi-to-wide conversion.
2009-01-13 13:32:07 +00:00
ehouse
2691b00dc9 Add target to build all languages. Stops on Swedish at the moment. 2009-01-13 13:19:15 +00:00
ehouse
dd81444a45 To support Catalan, add Makefile and bitmaps for three special tiles.
The first of these, L-high-dot-L, requires Unicode to be properly
drawn, but the current dict format doesn't support it so it'll be L-L
for now.  Bitmaps are still rough.
2009-01-13 13:17:58 +00:00
ehouse
a719d6fc50 Add type Letter to represent what are Tiles in Crosswords:
lang-independent indices into the set of letters in use.  Should be no
change in functionality or code generated.
2009-01-07 05:13:45 +00:00
ehouse
0ba80fa89f Fix compiler warnings. Should be no change in generated code. 2009-01-07 05:03:13 +00:00
ehouse
d695420b95 Compress user-visible name so more likely to fit on-device widgets 2008-10-29 08:47:12 +00:00
ehouse
5d7a82a73f Don't choke when words are longer than 15 letters. 2008-10-08 04:37:44 +00:00
ehouse
3ac39b1290 Make new FAA 4.1 the default Spanish dictionary source; build three
dicts (8, 9 and 15) by default (all: target).
2008-09-18 03:55:04 +00:00
ehouse
78c5574bce Fix to compile with newer g++; increase size of buffer to handle largest Spanish wordlist. 2008-09-18 03:44:43 +00:00
ehouse
147f8a4357 Accept words that contain no vowels. 2008-07-12 19:37:27 +00:00
ehouse
d29968a3c1 update in accordance with current Dutch practice (says an informant) 2008-07-10 03:13:33 +00:00
ehouse
265a2bcc28 Break Czech into two "languages" as a way to support the two encodings in common use. 2008-03-19 04:47:03 +00:00
ehouse
835a0ae45e Remove windows LF chars just in case; take SOURCEDICT via cmdline; add emacs modeline. 2008-03-15 15:00:46 +00:00
ehouse
15a2154720 Compile dict2dawg by default since dict2dawg.pl has problems; fix warnings. 2008-03-15 14:52:23 +00:00
ehouse
e06b2d502b line column heads up correctly again 2008-02-23 22:00:40 +00:00
ehouse
0ab471e8c3 Fix to work with BYOD: pass -r rather than use grep to pull illegal words; fix language code; include charset. 2008-02-23 21:59:38 +00:00
ehouse
e6f5ab9d01 First cut at handling Czech. Correspondent says the Palm dict looks right. Still need to test on Windows and on BYOD. 2008-02-20 03:50:32 +00:00
ehouse
8ade36c336 cleanup; add lineno so number of letters is apparent 2008-02-20 03:44:31 +00:00
ehouse
c8ead8477e add target for dict2dawg 2008-01-02 01:44:12 +00:00
ehouse
b527110379 replace all __FUNCTION__ with __func__ 2007-12-02 19:13:25 +00:00