?

Log in

No account? Create an account

Scrabbling for Mastery

a patzer's journey

Previous Entry Share Next Entry
Lists of words grouped by number of anagrams in two lexicons
zyzzyva
boshvark
magratheazaphod asked me whether it is possible to create lists of words that have M anagrams in one lexicon and N anagrams in another. For example, is it possible to create a list of CSW07 words that have 2 anagrams, only 1 of which is acceptable in OWL2?

I thought through the problem and got pretty close to a Zyzzyva-only solution, but it wasn't complete. To complete my original idea, Zyzzyva will need an "In Anagram List" search condition similar to the current "In Word List" condition. I'm planning to implement that eventually, but not today.

Instead, I wrote a script to generate some word lists, which you can use with In Word List conditions. I've placed the word lists here:

http://zyzzyva.net/misc/grouped-by-num-anagrams/

Under the "words" directory, there is a zip file for each combination of: CSW07/OWL2 ; CSW12/OWL2 ; and CSW12/CSW07. Inside the zip file are several files named like "Lex1-M-Lex2-N.txt", which signifies that each word in that list has M anagrams in Lex1 and N anagrams in Lex2. For example, the file "CSW07-02-OWL2-01.txt" contains all the CSW07 words that have 2 anagrams in CSW07 but only a single anagram in OWL2.

One interesting thing I discovered while creating these lists is that AIKMS has the greatest number of CSW anagrams (5) with no anagrams in OWL2! You can see these words in the file called "CSW07-05-OWL2-00.txt".

Anyway, feel free to use these word lists if you find them useful. Let me know if you have any questions.


  • 1
awesome!! thank you so much! you are the man

Heh, thanks. It was an interesting problem and a good excuse to write a little Perl.

OMG i was just thinking that i wanted exactly this! thank you

Cool! You're welcome. I think I'll find them useful myself when I get back to studying Collins.

Below is some more interesting info... also, I've further broken down the anagrams by word length and placed them here:

http://zyzzyva.net/misc/grouped-by-length-and-num-anagrams/

Darryl Francis asked me a couple interesting questions, and it wasn't too much extra trouble to find the answers. First, for each different word length 2-15, what is the greatest number of CSW12 anagrams with no anagrams in OWL2? Second, for each different word length 2-15, what is the greatest difference between the number of CSW12 anagrams and the number of anagrams in OWL2?

I'll start with the second question first. Here are the alphagrams, by word length, with the greatest difference between the number of CSW12 anagrams and OWL2 anagrams. Note that the greatest difference of any word length belongs to AERST, which has 13 CSW2 anagrams but only 6 in OWL2, a difference of 7. The next greatest difference belongs to EORSTW, AEGINST, AEINORST, which each have a difference of 5.

Length: CSW2 - OWL2 = Diff, Alphagrams
02: 2 - 0 = 2, GU
03: 3 - 0 = 3, ANS, GSU, IOS
04: 4 - 0 = 4, EKST
05: 13 - 6 = 7, AERST
06: 7 - 2 = 5, EORSTW
07: 11 - 6 = 5, AEGINST
08: 7 - 2 = 5, AEINORST
09: 3 - 0 = 3, AGINOSSTT, EEIIMNOST, EIMNNOPST, GINORSTTU
4 - 1 = 3, ACEEINRSS
5 - 2 = 3, AEINORSST, EEGINRSTT
10: 4 - 0 = 4, AEEILNPRST
11: 3 - 0 = 3, ACDEEIMORST, ACEEILNORST, CEEELPRSSST
4 - 1 = 3, ADEGIILNOST
6 - 3 = 3, ACEIIMNORST
12: 3 - 0 = 3, ABDEEIILRSST
13: 2 - 0 = 2, (20 alphagrams)
14: 2 - 0 = 2, (8 alphagrams)
15: 3 - 0 = 3, AACDEIIILMNOSST

The answer to the first question is found in the answer to the second question, for all lengths except these:

05: 5, AIKMS
06: 4, AEKNRS, CHIRST, ELLORS, IMSSTU
07: 3, (24 alphagrams)
08: 3, (8 alphagrams)

  • 1