|
C Experiments:
![]()
Text Processing
![]() |
1999-05-15:
Back in 1999, I put together this page which serves as a jumping-off place for a number of C Experiments I performed in the general category of Text Processing, e.g. splitting and assembling text files; parsing; word counting and extraction; array, vector, list, stack, queue, and tree representations; sorting; searching; transforming; compressing; etc.
Half the fun of working with computers is searching out and discovering new ways to make them do stuff better, i.e. faster, more efficiently, WITHOUT excessive code bloat, etc. So I'm usually playing around with some new idea all the time. If my reporting on those experiments here also helps you implement a new method (or avoid a pitfall) I've encountered, so much the better!
I've chosen to use the text of the King James Version of the Bible in these experiments. The KJV is in the public domain and therefore readily accessible, it provides over 4MB of diverse text to work with, and I can make it do double duty for some Biblical Research in some other venues (See, for example, The Ten Commandments).
![]()
Experiment No. 1: C Display of HTML and Text Files
![]()
Experiment No. 2: C Text Files Character Counts
![]()
Experiment No. 3: C Text File Common Word Counts
![]()
Experiment No. 4: C Text File Individual Word Counts
![]()
Experiment No. 5: (In Progress)