Forget tcl & python banana tricks wouter, Russel has specifically said that he wants support for non ascii chars. That means you'll have to read either RTF or DOC or HTML formats. So the plain old text way of doing it is out of question. If it was text only, anyone could implement a solution to this problem in C++ withn 50 lines of code using the STL set container. Possibly it could be done with VBA macros in word. Maybe it could be done in EMACS. Scott is right with the right OS (*nix) its a free banana trick. Jeethu Rao > -----Original Message----- > From: pic microcontroller discussion list > [mailto:PICLIST@MITVMA.MIT.EDU]On Behalf Of wouter van ooijen & floortje > hanneman > Sent: Monday, July 23, 2001 2:11 AM > To: PICLIST@MITVMA.MIT.EDU > Subject: Re: [OT]: Word counter program wanted > > > > > I need a program to count the occurrence of each separate word in a > > > document. > > Using the right tool (tcl, python) this should be a one-banana trick. Even > in plain C (using hashing) it should not be that difficult, provided that > the total number of different words is not too large (let's say less than > 100k). > > Wouter > > -- > http://www.piclist.com hint: The list server can filter out subtopics > (like ads or off topics) for you. See http://www.piclist.com/#topics > > -- http://www.piclist.com hint: The PICList is archived three different ways. See http://www.piclist.com/#archives for details.