The URL of this file is http://www.umich.edu/~jlawler/README.TXT This is the README file for the Lawler/Rhodes Simplex Word Database which contains over 5000 English monosyllables and near-monosyllables. It is placed in the public domain and available on the Web in ZIPfile format. Use Winzip, PKunzip, or some other utility to unpack it. The URL of the database is http://www.umich.edu/~jlawler/monosyl.zip ---------- To whom it may concern: This is DOS only. It's a database with source code in TurboPascal 3. I built it as a tool for my own research, and anybody who can type can use it. But the interface is rather sparse. It's completely character-driven. I did it a long time ago, on a sabbatical, and then froze development, except for data changes. It'll run in a DOS box on any Windows machine. Unload all the files in the same directory with PKUNZIP, then type "MONOSYL", press RETURN, and follow the directions. (Postscript. Twelve years after I froze it, I thawed it briefly to change the database to lowercase from its Heritage Mainframe Upper Case format. This turned out to be very simple, to my surprise.) You can ask for displays of Rimes, Assonances, or Suffixes (one letter responses are enough), then you get asked to enter the Rime, Assonance, or Suffix. You'll find lists below of all of them below in the file descriptions. For details, consult the source code (MONOSYL.PAS). It does not give semantics, only data. And the data is probably not complete, since the database was constructed in the first instance by going through the entire list of possible Assonances and Rimes and entering all combinations that were meaningful to me. I have not parsed in any dictionaries, and have probably missed a few words. Feel free to add them to MONOSYL.DAT, using any ASCII editor. See below for details of format. I built it to show me lists of words with assonances and rimes, so I could see what kinds (if any) of semantic relations they had in common. For comparison purposes, you can generate lists of the same size, chosen randomly from the same database. You're prompted every time you display any Rime or Assonance. You can also save the lists to file, or print them, and you're prompted for that, too. (Since the compiler -- Borland's Turbo Pascal V.3 -- is totally obsolete, and Borland gives it away free anyway, I've felt free to include it with this. It was widely used and was very fast even on an 8088; on a modern PC it's much faster. It's also small and easily configurable. Put the .MSG files in C:\BIN. The TINST files are the installation and configuration program; you can use them to configure the built-in editor. It's quirky but it works. If you want details about the compiler, haunt used bookstores to find a manual for TurboPascal Version 3. There are lots around. TURBO.COM will produce a .COM file, not a .EXE file. They both work under Windows, no problem. If you want to convert it to a .EXE file, I've also included a freeware utility package, LZEXE, that will accomplish this.) A general bibliography of phonosemantics can be found at http://www.conknet.com/~mmagnus/Bibliography.html and further URLs of relevant materials can be found at http://www.conknet.com/~mmagnus/Links.html I have also included in this archive a bunch of the relevant papers, which are also available in the Web. The links are in the bibliography MONOBIB.PDF, also included here. Enjoy. -John Lawler http://www.umich.edu/~jlawler/ U Michigan Linguistics Dept --------------------------------------------------------------------------- "Language is the most massive and inclusive art we know, a - Edward Sapir mountainous and anonymous work of unconscious generations." Language (1921) ---------------------- There are 28 files in the MONOSYL.ZIP archive. ============================================================================ 1) README.TXT is this file. ---------------------------------------------------------------------------- 2) MONOSYL.EXE is the database program. Run it with MONOSYL at the DOS prompt, and then follow the directions. It is compressed with LZEXE. ------------------------------------------------------------------- 3) MONOSYL.PAS is the TurboPascal source code for this program. The compiler version is TurboPascal 3.01 for DOS. ---------------------------------------------------------------------------- 4) INDEX.EXE is the indexing and update program. Run it if you have modi- fied or damaged the database file or an index file. It is an LZEXE file. ------------------------------------------------------------------- 5) INDEX.PAS is the source code (in TurboPascal V.3.01) for the program. ------------------------------------------------------------------- 6) INDEX.T55 is the source code (in TurboPascal V.5.5) for an earlier version of the program. I have not bothered to update it for later versions, which are compiled on V.3.01. ---------------------------------------------------------------------------- 7) MONOSYL.DAT is the data file, containing the words in the following format (which I call "T/S ASCII" or "T-S ASCII" or "T.S.ASCII"): +---------------------------+ | Spelling....Ass Rime Suf | The columns that the various +---------------------------+ parts start on are important, | fragile.....fr aej @l | so don't modify the file unless | fraction....fr aeksh @n | you're *very* careful to keep | fractious...fr aeksh @s | them straight. There are no tabs +---------------------------+ in this file, only blanks. The words are sorted in order of Assonance, then Rime, then Suffix. The order is ASCII order, so " " (blank) precedes "@" (representing schwa). The rest of the letters are all in alphabetic order. The dialect is Midwestern American. The phonemic representation is a modified Trager-Smith (T/S) system: Consonants Vowels +----+------+------+----+ +-------+---------+-------+ | p | t | ch | k | | iy | | uw | | b | d | j | g | | i | | u | +----+----+-+-+----+----+ +-------+-+ @ +-+-------+ | f | th | s | sh | h | | ey | | ow | | v | dh | z | zh | | | e | | o | +----+----+---+----+----+ +---------+--+--+---------+ | m | n | ng | | ae | a | +----+----+---+----+----+ +-------+----+---+--------+ | w | l | r | y | w | | ay | oy | aw | +----+----+---+----+----+ +-------+--------+--------+ === != != != != != != != != != ! WARNING ! =! =! =! =! =! =! =! =! =! === MONOSYL.DAT is *THE* master file. Don't fuck around with it. If you want to change it, to add or subtract words, be *SURE* to keep them in the same format and in the same columns, and keep them sorted *in order of assonance*. You are urged to make a backup copy of MONOSYL.DAT before modifying it. After you change anything in this file, be sure to run the INDEX.EXE program to generate new indices. It's a drag, I know, but the only alternative is a full-blown database system, and I judge that to be overkill (not to mention an unnecessarily restrictive requirement) for this kind of research. === != != != != != != != != != ! DANGER! ! =! =! =! =! =! =! =! =! =! === ---------------------------------------------------------------------------- Danger off :-) The other files below are automatically generated by running the INDEX program. Do anything you like with them, but be sure to make a copy first. If you *do* change them, destroy them and run INDEX to make you a new copy. If the program works but the displays aren't correct, delete the .NDI files and run INDEX. ----------------------------------- 8) ASSON.NDX is an index of the assonances in ASCII format. (see below) List of Assonances (67): +--------------+---------------+--------------+ | Ass. No. | Ass. No. | Ass. No.| +--------------+---------------+--------------+ | (zero) 273 | hy 7 | sl 75 | | sh 94 | j 96 | sm 25 | | shr 14 | k 253 | sn 44 | | shw 1 | kl 84 | sp 75 | | b 275 | kr 89 | spl 14 | | bl 71 | kw 39 | spr 21 | | br 93 | ky 10 | spy 2 | | by 4 | l 222 | st 108 | | ch 77 | m 252 | str 38 | | d 208 | my 4 | sw 47 | | dh 16 | n 140 | t 230 | | dr 50 | p 254 | th 32 | | dw 3 | pl 47 | thr 21 | | f 202 | pr 78 | thw 2 | | fl 82 | py 7 | thy 1 | | fr 61 | r 225 | tr 83 | | fy 9 | s 241 | tw 25 | | g 140 | sf 2 | v 82 | | gl 42 | sk 88 | vy 1 | | gr 78 | skr 35 | w 151 | | h 231 | skw 30 | y 50 | | hw 37 | sky 1 | z 22 | | | | zh 1 | +--------------+---------------+--------------+ ------------------------------------------------------------------- There is no corresponding ASSON.NDI because the words are already sorted in order of assonance and therefore need no index. Which is why you have to keep them that way (see "Danger" above). ---------------------------------------------------------------------------- 9) SUFFIX.NDX is an index of the suffixes in ASCII format. (see below) List of Suffixes (21): +-------------+-----------+-----------+-----------+ | Suffix No. | Suff. No. | Suff. No. | Suff. No. | +-------------+-----------+-----------+-----------+ | (zero) 3582 | @t 133 | @d 27 | @b 3 | | @r 407 | iy 272 | ow 83 | @ns 2 | | li 2 | @s 89 | @ng 10 | @rn 2 | | @l 468 | @k 56 | @ 24 | @z 1 | | @n 173 | @m 46 | @sh 28 | @j 1 | | | | | @nt 1 | +-------------+-----------+-----------+-----------+ ------------------------------------------------------------------- 10) SUFFIX.NDI is a binary file index of suffixes (see above for definitions). ----------------------------------------------------------------------------- 11) RIME.NDX is an index of the rimes in ASCII format. ------------------------------------------------------------------ Rimes, in T/S ASCII...Number of occurrences in database (483 Rimes total) ------- ------- ------- ------- ------- ------- ------- ------- er...76 @k...26 ek...17 ift..11 eynt..7 uwf...4 ang...2 ilm...1 uw...60 eyz..26 uwl..17 orn..11 olt...7 @lk...4 ayst..2 apt...1 or...53 et...26 ist..17 ash..11 amp...7 idh...4 @rz...2 ilj...1 aet..53 eyv..25 ij...17 @ng..11 ayz...7 orj...4 @ns...2 alm...1 ik...48 uwp..25 @j...16 ach..11 ars...7 iyld..4 ath...2 @kt...1 it...47 aesh.25 uwd..16 awl..10 uwth..7 org...4 orp...2 ayth..1 aek..47 @d...25 ent..16 inj..10 eynj..7 orf...4 esp...2 iksh..1 il...47 iyz..24 iych.16 arl..10 oth...7 iysh..4 @ks...2 ug....1 ay...46 uwm..24 @sh..16 eyj..10 @rp...7 ant...4 oyng..2 @dh...1 ey...44 iyp..24 eym..15 ot...10 owf...6 ilk...4 oynt..2 eyg...1 ow...44 owt..23 iyv..15 @rv..10 awnd..6 orb...4 iyzh..2 eych..1 ir...43 aen..23 @l...15 od...10 eg....6 ank...4 imf...2 ond...1 iy...42 own..23 ayv..15 awz..10 oyst..6 ith...4 anz...2 iynd..1 eyn..42 awt..23 ays..15 ayb..10 ud....6 imb...4 @ft...2 @lkt..1 ip...41 iyl..23 ong..15 ech..10 @sk...6 aenth.3 awth..2 @nth..1 owk..40 aemp.22 uws..15 @z...10 arp...6 aench.3 ength.2 aensh.1 aeg..39 @nk..22 @rj..15 uk...10 arch..6 as....3 elch..2 @sp...1 ayt..38 iym..22 ayp..14 aez..10 eft...6 empt..3 enth..2 ensh..1 @r...38 @rn..22 owd..14 aeks.10 @rst..6 emb...3 ipt...2 owlm..1 owl..38 est..22 owv..14 @nch.10 owlt..6 aept..3 its...2 eyzh..1 eyt..37 art..22 @v...14 ish..10 @nj...6 awst..3 owdh..2 @th...1 @t...37 aes..21 @rm..14 iyf..10 iks...6 aelp..3 owks..2 aynth.1 eyl..37 iv...21 aech.14 uwb..10 ept...6 elk...3 ism...2 aemf..1 aem..35 ayd..21 owg..13 ins..10 @rg...6 oft...3 oyd...2 @mpk..1 ink..34 o....21 em...13 iys...9 oz....6 old...3 ird...2 aynt..1 iyk..34 iz...21 @p...13 orm...9 ul....6 ild...3 av....2 iln...1 iyt..34 end..21 eyb..13 ork...9 @rf...5 ayld..3 osh...2 angst.1 aeb..34 uwn..21 aev..13 arm...9 eks...5 @lp...3 ob....2 us....1 @b...34 @rk..21 @mb..13 a.....9 ekt...5 @.....3 anch..2 inth..1 ab...34 @s...20 ilt..13 inch..9 aedh..5 arf...3 uwdh..2 ampt..1 @m...34 ev...20 oy...13 aeft..9 aj....5 esk...3 ansh..2 ets...1 ing..33 ark..20 os...13 aynd..9 arv...5 uf....3 ilf...2 epth..1 at...33 es...20 @rs..13 and...9 eth...5 elth..3 awnj..2 uwzh..1 ak...32 id...20 elt..13 aws...9 awr...5 orth..3 emp...2 aedz..1 aw...31 eyd..20 ord..13 iyg...9 ans...5 ekst..3 uwst..2 alv...1 @f...31 ayr..20 arn..12 ur....9 ef....5 amb...3 oyz...2 eksh..1 uwt..31 int..20 ayf..12 @rch..9 aeth..5 orch..3 @rdh..1 aezh..1 in...31 @rt..19 aef..12 @nd...9 onch..5 ezh...3 aft...1 olm...1 aend.31 eyp..19 ens..12 owsh..8 iyth..5 ast...3 @lj...1 owlj..1 @g...31 @rl..19 ind..12 awns..8 eyf...5 enz...3 arth..1 enj...1 aed..31 ar...19 on...12 awn...8 iyb...5 isp...3 @lb...1 ers...1 aenk.31 of...19 @nt..12 aesp..8 owch..5 uwsh..3 eytth.1 @mpsh.1 ap...30 is...19 ard..12 aekt..8 ush...5 az....3 awj...1 ikt...1 ayn..30 ort..19 ok...12 @ch...8 aeksh.5 uwj...3 edth..1 op....1 aep..30 uwz..18 ag...12 esh...8 ost...5 oyt...3 @lt...1 adh...1 el...29 aest.18 oyl..12 ont...8 elm...5 owth..3 uz....1 ardh..1 ad...29 owp..18 ors..12 awnt..8 uwg...5 @lf...3 uwy...1 @rps..1 ig...29 imp..18 owld.12 arb...8 oyn...5 eydh..3 aenj..1 aelk..1 iyn..29 @rd..18 ep...11 @rth..8 arg...4 ut....3 aepch.1 ayg...1 al...28 ayk..18 ej...11 owb...8 arj...4 @lch..3 aelb..1 aelks.1 ol...28 if...18 @rb..11 eb....8 elf...4 ilch..3 olk...1 eyps..1 iyd..28 ib...18 aemb.11 owst..8 arsh..4 eysh..3 imps..1 ikst..1 @n...27 ich..18 isk..11 iydh..8 uwv...4 eld...3 ern...1 @rzh..1 aeng.27 aym..18 ench.11 uwch..8 elv...4 asp...3 olf...1 aelv..1 ed...27 aens.18 og...11 iyst..8 oys...4 aps...2 ifth..1 @rld..1 @mp..26 aent.18 aks..11 awch..8 inks..4 ols...2 orps..1 idth..1 im...26 eys..18 owm..11 uwk...7 af....4 ows...2 @mz...1 olts..1 ael..26 ayl..18 an...11 eyst..7 elp...4 eyth..2 @ts...1 ormth.1 eyk..26 am...17 awd..11 aesk..7 aydh..4 aeps..2 oyb...1 ulf...1 en...26 @st..17 owz..11 aej...7 iyj...4 ilth..2 imz...1 uwnd..1 irs...2 om....1 awndz.1 ------------------------------------------------------------------- 12) RIME.NDI is a binary file index of rimes (see above for definitions). ----------------------------------------------------------------------------- 13) TURBO.COM is the TurboPascal 3.01a compiler. ------------------------------------------------------------------- 14) TURBO.MSG is the error message file for the compiler. Put it in the directory C:\BIN; this version of TURBO.COM is patched to look for it there. ------------------------------------------------------------------- 15) TINST.COM is the installation program for TURBO.COM. Use it to configure the compiler and its editor if you want; it's optional. If you do use it, it will look for TURBO.COM in the same directory. ------------------------------------------------------------------- 16) TINST.MSG is the message file for TINST.COM. ----------------------------------------------------------------------------- 17) LZEXE.ZIP is the zipfile for LZEXE, a DOS freeware utility package that compresses executable .EXE files to save space. It includes a program (COMTOEXE.EXE) that converts .COM files (like those generated by TurboPascal) to .EXE files, which can then be compressed). This is unnecessary given modern HD and RAM sizes, but it's convenient to have the database in .EXE format so it will run cleanly with MONOSYL.PIF under Windows. If you want to use LZEXE, unpack LZEXE.ZIP separately, read the documentation, and follow the directions. It's simple and reliable. ------------------------------------------------------------------- 18) MONOSYL.PIF is a Windows program information file that allows MONOSYL.EXE to run under Windows with some reasonable default values for screen preferences and icon. Leave it in the same directory as all the other files, then put an alias of it where you want to access it from. You may need to tell it where the other files are, once, if you do that. ----------------------------------------------------------------------------- 19) assonances.pdf is the most recent list of phonosemantic assonances with some information from the appendices of some of my papers. ----------------------------------------------------------------------------- 20) kl-chart.pdf is a Venn diagram of the phonosemantics of the KL- assonance. ----------------------------------------------------------------------------- 21) MONOBIB.PDF is a short bibliography of the studies that have appeared so far based on this resource, including the ones in this zipfile (see below). ------------------------------------------------------------------- 22) Bergen-Dissertation.pdf is Benjamin Bergen's Berkeley dissertation (see MONOBIB.PDF for citation) ------------------------------------------------------------------- 23) Bergen-Phonaesthemes-Lg.pdf is his article in Language (ditto) ------------------------------------------------------------------- 24) Rhodes-Lawler-1981-cls17.pdf is my first article, with Rich Rhodes (ditto) ------------------------------------------------------------------- 25) WomenMen&BristlyThings.pdf is my second article (ditto) ------------------------------------------------------------------- 26) STYLE.PDF is my third article (ditto) ------------------------------------------------------------------- 27) rimecoherence.pdf is my fourth article (ditto) ------------------------------------------------------------------- 28) RIMES.ZIP is an archive containing all the individual analyses for each vigesimal rime treated in rimecoherence.pdf, suitable for student projects. ============================================================================ Last update 20 June 2008