Eric House
07e93971d3
makefile for latest CSW
2012-01-17 18:19:57 -08:00
Eric House
cfa4c96d22
just for grins: japanese dict-building files. There are too many kana
...
for the current format so this can only be for demos, but I might as
well record it.
2011-08-29 20:42:27 -07:00
Andy2
332767105c
express size in K (rounding up)
2011-05-15 07:37:29 -07:00
Andy2
7ccacdc26d
switch size and wordcount columns
2011-05-15 07:28:10 -07:00
Andy2
deeb2f3cba
fix compile-command
2011-04-29 06:24:41 -07:00
Eric House
1ab5aa02b9
Makefile for new dict containing 4288 words: good for the robot.
2011-04-14 22:09:44 -07:00
Andy2
4272686034
Makefile for new smaller Dutch wordlist
2011-04-08 22:13:31 -07:00
Andy2
ce61427bba
generate md5 sum file optionally. Later I'll want to download these
...
to check that the file arrived safely.
2011-03-02 19:00:25 -08:00
Eric House
beaa7ba5a5
assume dict is utf8-encoded but check and fail if it isn't
2011-02-08 20:57:41 -08:00
Eric House
481a533e58
ignore uncompressed dicts too
2011-01-24 22:21:44 -08:00
Eric House
c7b6d799f0
switch to utf8
2011-01-07 18:05:57 -08:00
Andy2
5459631c76
No need for empty .dict when creating empty .dict.gz
2011-01-06 18:20:56 -08:00
Andy2
6f2cde1304
create an index at the top of page; indent dict lines; drop ".xwd"
2011-01-06 18:09:10 -08:00
Andy2
2cc46d8a69
get rid of unused but oft-included file
2010-12-17 19:02:01 -08:00
Andy2
0ee156c9f0
add empty: case for WINCE type too
2010-12-17 18:55:44 -08:00
Andy2
c0bec75fd8
fix crash when input wordlist is empty by not counting zero-length
...
word as a word.
2010-12-17 18:55:25 -08:00
Andy2
c5e0955460
simplify build rule
2010-12-17 17:39:33 -08:00
Andy2
7e46163988
add counts and values -- from wikipedia article, as are Arabic and
...
Turkish files just checked in.
2010-12-17 17:38:47 -08:00
Andy2
18f8b0d4e4
switch to utf-8, adding an iconv call to translate the wordlists.
2010-12-17 17:37:57 -08:00
Andy2
32fccca995
Turkish. As with Arabic, untested.
2010-12-17 17:36:38 -08:00
Andy2
71559e27c6
add Arabic. I have no wordlist but this should still allow play
...
between humans, even over the net. Untested, though, as my phone
doesn't have any Arabic glyphs.
2010-12-17 17:36:03 -08:00
Andy2
d1605c4493
fix: convert to utf8 and replace grep that didn't work (presuambly
...
because ranges have different meanings in utf-8) with one that does.
2010-12-13 20:39:04 -08:00
Andy2
d78584fddf
remove obsolete, pre-utf8 files
2010-12-13 20:09:26 -08:00
Andy2
bb0a79914b
add conversion from ISO88591 since the default dict's in that format.
2010-12-13 20:09:09 -08:00
Andy2
dc807c948a
use sed instead of tr since as with Slovak a letter was getting
...
dropped. Same one in fact.
2010-12-13 19:58:37 -08:00
Andy2
299c84bb2b
use sed rather than tr to uppercase letters. tr was dropping the Á
...
letter for some reason. The sed feature I'm using is a gnu extension
but has the advantage of working. Should probably do this for all
languages and in the info files.
2010-12-13 18:16:22 -08:00
Eric House
894afdc0cb
take words up to 15 letters long. This makes no difference with any
...
dict I've tried as there just aren't any words over 7 letters long
made up of only a-f.
2010-12-12 20:02:28 -08:00
Eric House
e8e0b25fad
go back to old dict -- correcting a change I didn't mean to check in.
2010-12-12 20:01:33 -08:00
Eric House
9c5b2c9f4f
add for current French list
2010-12-09 21:22:37 -08:00
Eric House
98456dd652
fix to build dicts, wince/android format by default
2010-12-09 21:22:14 -08:00
Eric House
6b58c9031f
script to build html page for downloading dicts
2010-12-09 21:21:41 -08:00
Andy2
39b40a9a3d
build with a header giving word count
2010-12-06 18:31:12 -08:00
Andy2
12508b7cd5
cleanup stderr output
2010-12-06 07:23:22 -08:00
Andy2
0072112b5a
fix syntax for including newheader so only one gets included. Fixes
...
bug building multiple dicts where headers would accumulate.
2010-12-06 07:23:05 -08:00
Eric House
c4cdc24b78
initial changes to add a header to xwd format so that stuff like
...
number of words can be included. Changed to build dicts and linux to
open them. Android still needs to learn. Also, some of the tools in
dawg/ need to be fixed to read old-format (pre-utf8) .xwd files.
2010-12-05 19:33:10 -08:00
Eric House
eff2324950
fix compile command
2010-12-05 19:30:00 -08:00
Eric House
bef1e125bf
ignore .pdb files
2010-12-05 19:29:15 -08:00
Andy2
e89feb62d8
second part of manual merge of unicode_branch's dawg/ directory into
...
this one. This adds the directories and their files created inside
dawg.
2010-11-30 18:38:05 -08:00
Andy2
79990bc7b1
first set of changes formed by applyinig diff of android_branch's
...
dawg/ directory against unicode_branch's. The two branches seem to
have to common ancestor -- probably didn't survive translation from
svn -- so this is the best I can do.
This checkin is all the files that were modified by the patch plus a
couple of simple additions. Next I'll be adding directories that the
patch created. It also reintroduced a bunch of .cvsignore files; I
won't check those in.
2010-11-30 18:35:11 -08:00
Eric House
2a2f4d4395
been a while since cvs...
2010-11-09 05:53:49 -08:00
Eric House
3716218a1d
ignore files in dawg/
2010-07-07 23:18:14 -07:00
Eric House
48946996b8
ignore file in dawg/
2010-07-07 23:17:13 -07:00
ehouse
8dca48b3ea
Useful ftell, commented out.
2009-03-29 18:13:09 +00:00
ehouse
9e5b3f8f29
Changes to fix BYOD (though still need native speaker confirmation)
2009-03-14 22:33:53 +00:00
ehouse
690bf80b7b
Fix so can build iso-8859-2 Polish dicts using make (won't work on
...
BYOD yet): add encoding to emacs mode line and fix the letters,
including hard-coding them as decimal numbers until I can figure out
how to get perl (in xloc.pm) to emit iso-8859-2 instead of utf8.
2009-03-14 19:27:29 +00:00
ehouse
0b0bf96cd5
accept ISO-8859-2; remove unused param; add assert that EOF/EOL aren't
...
part of a multibyte char
2009-03-14 19:22:15 +00:00
ehouse
b16a07d0ba
build dict2dawg with debug symbols
2009-03-14 19:21:09 +00:00
ehouse
b9dce19a93
if setlocale doesn't work, try again with en_US -- works around
...
problem on my ISP.
2009-01-28 03:32:21 +00:00
ehouse
b7fa674c28
Set locale based on params passed in, only on ENV if not specified.
2009-01-25 20:13:36 +00:00
ehouse
90f8a276e1
Cleanup to run on a machine that's utf8: specify iso-8859-1 when needed.
2009-01-25 18:57:05 +00:00