ON 20050606@2:55:54 PM at page:
http://www.piclist.org/method/compress/etxtfreq.htm#38509.0104398148
James Newton[JMN-EFP-786] Published and replied to post 38509.0104398148 by lloydod
|Insert 'ISBN's for two seperate books are listed above. Do you need more than that?' at: ''
lloydod@yahoo.com.au asks:
Hi,
I would like to know who I can reference for the work listed here in relation to most common english letters and most common english words (or letter combinations). Surely this is part fo a published academic work?
Thanks
lloyd
|Delete 'P-' before: '' but after: '
http://millikeys.sourceforge.net/freqanalysis.html
There's a nice piece of software that will analyze *your* files for letter frequencies and pair frequencies at sourceforge.
That author ran a bunch of English works of fiction and reports the stats he found. (Naturally, the space is the most frequent "letter" at 18.74%, followed by "e" at 9.60%). Many of the most frequent "letter" pairs include a space character. The most common pairs (in order of frequency) were "e ", " t", "th", "he", which were all more common than the 13th most common single letter but less common than the 12th most common single letter.
|Delete 'P-' before: '' but after: '
/techref/method/compress/embedded.htm
knowledge of text frequencies can be used to compress data.
|Delete 'P-' before: '' but after: '
/techref/method/compress/etxtfreq.htm
english text frequencies (letter frequencies, pair frequencies, most common words, etc.)
|Delete 'P-' before: '' but after: '