NO SCCS FILE!! "@(#)yyget.c 5.2 (Berkeley) 6/29/90"
[unix-history] / usr / src / usr.bin / spell / README
CommitLineData
04fdc240
KM
1Copyright (c) 1987 Regents of the University of California.
2All rights reserved. The Berkeley software License Agreement
3specifies the terms and conditions for redistribution.
3995a398 4
aa0ae12e 5 @(#)README 1.5 (Berkeley) %G%
04fdc240 6
aa0ae12e 7All files and subdirectories of /usr/share/dict are recommended for
6aafa36e
KM
8rdisting except web2 and web2a (because of their size), and some of
9the files hlist* depending on needs of your machine (details below).
10Descriptions of most of these files are given under FILES below.
04fdc240
KM
11
12The new subdirectory "special" contains lists of words in specialized
6aafa36e
KM
13fields, which may be hashed in with the regular lists on machines having
14many users working in these fields. As of this writing, there are two
15such specialized wordlists.
04fdc240
KM
16
17It is advised that system managers also create a directory
6aafa36e
KM
18/usr/local/dict. This can be used to maintain files of particular
19interest to users of each machine (e.g., surnames of members of the
04fdc240
KM
20department on a departmental machine).
21
6aafa36e
KM
22The hashed wordlists hlista and hlistb in this distribution include
23the words in the file special/4bsd, comprising current 4bsd
24commands, system calls, etc. (from "abs" to "zcat"). Machines
25whose primary users are programmers should take these files by
26rdist. For machines with other user populations, a file "hlist" is
27provided which only contains the contents of /usr/dict/words. Managers
28of such machines should rdist this file, and use "spellin" to produce
29files hlist{a,b} which contain the words from
aa0ae12e 30/usr/share/dict/{american,british} respectively, plus any other files
6aafa36e
KM
31appropriate to the needs of the majority of their users. (Some basic
32unix commands and terms that general users are likely to encounter,
aa0ae12e 33e.g. troff, emacs, tty, have been included in /usr/share/dict/words.
6aafa36e
KM
34More may be added as suggestions are received.) Here, for instance is a
35script that might be used to create the hashlists on a particular
aa0ae12e 36machine, so as to include the words in /usr/share/dict/special/math, as well
6aafa36e
KM
37as two local lists which we will assume are called
38/usr/local/dict/surnames and /usr/local/dict/acronyms.
04fdc240
KM
39
40 #
aa0ae12e 41 cd /usr/share/dict
6aafa36e
KM
42 cat american special/math /usr/local/dict/{surnames,acronyms} | \
43 spellin hlist > hlista
44 cat british special/math /usr/local/dict/{surnames,acronyms} | \
45 spellin hlist > hlistb
04fdc240 46
6aafa36e
KM
47 Hashlists can also be created from scratch using
48/usr/src/usr.bin/spell/Makefile. This is now written so that if "make"
49is run with no options it will produce the hashed files as presently
50distributed, but so that the extra wordlists used can be controlled with
51variables LOCAL and SPECIAL. For instance, the results given by the
52above script can be obtained by doing:
04fdc240 53
6aafa36e
KM
54 cd /usr/src/usr.bin/spell
55 make LOCAL='/usr/local/dict/surnames /usr/local/dict/acronyms' \
56 SPECIAL=special.math
57 make install
58
59 Returning to the subject of the wordlists in /usr/dict, these are,
60in general, ordered as in sort -df. This makes no difference for
61spell's hashing process, but makes a difference for other commands,
62such as "look", that perform binary searches on the unhashed lists.
63
64Complaints, and any additional suggestions for words or wordlists,
65should be sent to me. I cannot fix bugs involving the code of "spell",
66but I am maintaining a list of these bugs, and of other ideas for
67improvement.
68 George Bergman, gbergman@cartan.Berkeley.Edu
69 18 March, 1987
04fdc240
KM
70
71--------------------------------------------------------------------
aa0ae12e 72FILES and subdirectories of /usr/share/dict:
6aafa36e
KM
73
74 words -- common words, and important technical terms from all
04fdc240 75 fields, that are spelled the same in British and American usage.
6aafa36e
KM
76 american -- spellings preferred in American but not British usage.
77 british -- spellings preferred in British but not American usage.
78 stop -- forms that would otherwise be derivable by "spell" from
04fdc240 79 words in one of the above files, but should not be accepted.
6aafa36e
KM
80 hlist -- hashed list, formed from the file "words" only.
81 hlista -- hashed list, formed from files {words,american,special/4bsd}.
82 hlistb -- hashed list, formed from files {words,british,special/4bsd}.
83 hstop -- hashed list, formed from file "stop".
84 web2 -- words from Webster's 2nd International (see WEB below).
85 web2a -- compounds and phrases from same source.
86 README -- this file
87 papers/ -- an (out-of-date specialized) bibliographical database,
88 used as the default by the program "refer".
89 special/ -- directory of less common terms from specialized fields.
90 It presently contains:
91
92 special/4bsd -- commands and system calls (from filenames in
aa0ae12e
TH
93 /usr/share/man/man[1238n]), and builtin csh commands (named in
94 /usr/share/man/man1/csh.1) of the current version of 4bsd Unix.
04fdc240 95 (Supersedes old "/usr/src/usr.bin/spell/local".)
aa0ae12e 96 special/math -- some mathematical terms not in /usr/share/dict/words.
6aafa36e
KM
97
98WEB ---- (introduction provided by jaw@riacs) -------------------------
04fdc240 99
3995a398
KM
100Welcome to Webster's Second International, all 234,936 words worth.
101The 1934 copyright has elapsed, according to the supplier. The
102supplemental 'web2a' list contains hyphenated terms as well as assorted
103noun and adverbial phrases. The wordlist makes a dandy 'grep' victim.
104
105 -- James A. Woods {ihnp4,hplabs}!ames!jaw (or jaw@riacs)