Discussion:
Word counts
(too old to reply)
George Francis
2005-09-26 02:48:21 UTC
Permalink
Hi,
Just for fun I did some word counts on the text for the first 10 books.
Some of the results I got were interesting;

for character names:
'rand' - 8500
'nynaeve' - 4700
'egwene' - 4200
'mat' - 3800
'perrin' - 3200

Interesting that both Mat & Perrin score lower than either Nyn or Eg...

for other items:
'eyes' - 4582
'hair' - 3000
'dress' - 780
'trolloc' - 360
'tea' - 300
'breasts' - 93

I guess it gives you some idea of the relative importance RJ places on
hair, mentioned on average 300 times per book..!

--
I wish I had a kryptonite cross, because then you could keep Dracula
_and_ Superman away.
-Jack Handey
Matt Schroeder
2005-09-26 03:12:01 UTC
Permalink
Post by George Francis
Hi,
Just for fun I did some word counts on the text for the first 10 books.
Some of the results I got were interesting;
'dress' - 780
'trolloc' - 360
'tea' - 300
These three numbers say so very, very much. Any chance you can do a
pre-LOC and post-LOC count? Just curious.

cheers,
matt.
g***@yahoo.com
2005-09-26 10:45:04 UTC
Permalink
George Francis wrote:> for character names:> 'rand' - 8500> 'nynaeve' -
4700> 'egwene' - 4200> 'mat' - 3800> 'perrin' - 3200Funny, I find
slightly more for all those words, but the relativesvalues are kept.>
'hair' - 3000For this one, I find less than 1800.> 'trolloc' -
360Remember that trollocs act in bands: I find 450 "trolloc", but
almost1600 "trolloc(s)" (and almost 1500 in the first sox books).
g***@yahoo.com
2005-09-26 10:57:13 UTC
Permalink
Sorry. This time I previewed it, and it was ok. I'll have to find a
real usenet account and do not let it go stale.
ataha
2005-09-29 22:08:27 UTC
Permalink
Post by George Francis
Hi,
Just for fun I did some word counts on the text for the first 10 books.
Some of the results I got were interesting;
'rand' - 8500
'nynaeve' - 4700
'egwene' - 4200
'mat' - 3800
'perrin' - 3200
Interesting that both Mat & Perrin score lower than either Nyn or Eg...
'eyes' - 4582
'hair' - 3000
'dress' - 780
'trolloc' - 360
'tea' - 300
'breasts' - 93
I guess it gives you some idea of the relative importance RJ places on
hair, mentioned on average 300 times per book..!
Can you check these ones out too ?

'sniff'
'braid'
'tug'
'bath'
'smooth'

Let's all get a good laugh!
Taha
ireadRJordan
2005-09-30 02:15:01 UTC
Permalink
Don't forget Hooked Nosed and Snort.
George Francis
2005-09-30 14:37:08 UTC
Permalink
'sniff' 66
'braid' 271
'tug' 49
'bath' 65
'smoothed' 90
Post by ataha
Post by George Francis
Hi,
Just for fun I did some word counts on the text for the first 10 books.
Some of the results I got were interesting;
'rand' - 8500
'nynaeve' - 4700
'egwene' - 4200
'mat' - 3800
'perrin' - 3200
Interesting that both Mat & Perrin score lower than either Nyn or Eg...
'eyes' - 4582
'hair' - 3000
'dress' - 780
'trolloc' - 360
'tea' - 300
'breasts' - 93
I guess it gives you some idea of the relative importance RJ places on
hair, mentioned on average 300 times per book..!
Can you check these ones out too ?
'sniff'
'braid'
'tug'
'bath'
'smooth'
Let's all get a good laugh!
Taha
steveo
2005-10-01 02:28:35 UTC
Permalink
Post by George Francis
'sniff' 66
'braid' 271
'tug' 49
'bath' 65
'smoothed' 90
No way. Can you do word searches with wildcards to catch all the variations
(e.g. "tug*" for tug, tugged, tugging)?

steveo
Christer Jacobsson
2005-11-13 22:49:37 UTC
Permalink
Post by steveo
Post by George Francis
'sniff' 66
'braid' 271
'tug' 49
'bath' 65
'smoothed' 90
No way. Can you do word searches with wildcards to catch all the variations
(e.g. "tug*" for tug, tugged, tugging)?
steveo
Now I must put in my 5x10**(-2) SEK question. To do word searches, one
must have a online copy of the books. How does one go about obtaining
one? Scanning in a book and process the scanned images so they become
e-documents, e.g. .txt files or are there better ways?

Assume that I have a legally bought hardcopy so I'm not accused
of piracy.

--
/GAIA (Insulin User - 9th Anniversary & 25th Wedding Anniversary! :-))
Team OS/2 e-mail: ***@gaea.se
Chunkawakan
Will Frank
2005-11-13 22:58:41 UTC
Permalink
Post by Christer Jacobsson
Now I must put in my 5x10**(-2) SEK question. To do word searches, one
must have a online copy of the books. How does one go about obtaining
one?
There's a web site that's done the work for you...
http://idealseek.no-ip.com/

- --
Will "scifantasy" Frank - ***@stwing.upenn.edu
"Batman to all points. I could use some air support. Since I can't
fly. At all. Now would be good." --Batman (Bruce Wayne), /Dark Heart/
Tim Bruening
2010-04-25 20:12:57 UTC
Permalink
Post by ataha
Post by George Francis
Hi,
Just for fun I did some word counts on the text for the first 10 books.
Some of the results I got were interesting;
'rand' - 8500
'nynaeve' - 4700
'egwene' - 4200
'mat' - 3800
'perrin' - 3200
Interesting that both Mat & Perrin score lower than either Nyn or Eg...
'eyes' - 4582
'hair' - 3000
'dress' - 780
'trolloc' - 360
'tea' - 300
'breasts' - 93
I guess it gives you some idea of the relative importance RJ places on
hair, mentioned on average 300 times per book..!
Can you check these ones out too ?
'sniff'
'braid'
'tug'
'bath'
'smooth'
How did you count the words, and how loog did it take you?

Rajiv Mote
2005-10-01 03:32:56 UTC
Permalink
Post by George Francis
Hi,
Just for fun I did some word counts on the text for the first 10 books.
Some of the results I got were interesting;
'rand' - 8500
'nynaeve' - 4700
'egwene' - 4200
'mat' - 3800
'perrin' - 3200
Interesting that both Mat & Perrin score lower than either Nyn or Eg...
'eyes' - 4582
'hair' - 3000
'dress' - 780
'trolloc' - 360
'tea' - 300
'breasts' - 93
I guess it gives you some idea of the relative importance RJ places on
hair, mentioned on average 300 times per book..!
The results are indeed amusing, but I'm more interested in the
electronic copy you're apparently grepping of the entire Wheel of
Time... As a combined data-mining and WoT geek, the kinds of indexing
that would make possible makes me salivate just a little.

Let's see how Google does on the legality of their "online library,"
and perhaps we'll talk...
Frank van Schie
2005-10-01 17:54:20 UTC
Permalink
Post by Rajiv Mote
Let's see how Google does on the legality of their "online library,"
and perhaps we'll talk...
Google's library idea has been in practice for the WoT books a long time
now. It's called IdealSeek:
http://idealseek.no-ip.com/

Yeah.
Christophe Choumert
2005-10-04 05:18:53 UTC
Permalink
Post by Frank van Schie
Post by Rajiv Mote
Let's see how Google does on the legality of their "online library,"
and perhaps we'll talk...
Google's library idea has been in practice for the WoT books a long time
http://idealseek.no-ip.com/
Thanks for the link !

I tried this :
http://idealseek.no-ip.com/IdealSeek.cgi?q=The+Wheel+weaves (82 matches)
The frequence decreases sharply when Moiraine gets out of the picture. :)

But I haven't found a good measure for 'chin' : 'chin' alone is too wide,
'Elayne+chin' too narrow. :(
NicolasC
2005-10-05 15:31:36 UTC
Permalink
Post by Christophe Choumert
Post by Frank van Schie
Post by Rajiv Mote
Let's see how Google does on the legality of their "online library,"
and perhaps we'll talk...
Google's library idea has been in practice for the WoT books a long time
http://idealseek.no-ip.com/
Thanks for the link !
http://idealseek.no-ip.com/IdealSeek.cgi?q=The+Wheel+weaves (82 matches)
The frequence decreases sharply when Moiraine gets out of the picture. :)
But I haven't found a good measure for 'chin' : 'chin' alone is too wide,
'Elayne+chin' too narrow. :(
I found "one power" 666 times... :)
Loading...