<HTML><BODY style="word-wrap: break-word; -khtml-nbsp-mode: space; -khtml-line-break: after-white-space; "><DIV><SPAN class="Apple-style-span" style="border-collapse: separate; border-spacing: 0px 0px; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; text-align: auto; -khtml-text-decorations-in-effect: none; text-indent: 0px; -apple-text-size-adjust: auto; text-transform: none; orphans: 2; white-space: normal; widows: 2; word-spacing: 0px; "><SPAN class="Apple-style-span" style="border-collapse: separate; border-spacing: 0px 0px; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; text-align: auto; -khtml-text-decorations-in-effect: none; text-indent: 0px; -apple-text-size-adjust: auto; text-transform: none; orphans: 2; white-space: normal; widows: 2; word-spacing: 0px; "><SPAN class="Apple-style-span" style="border-collapse: separate; border-spacing: 0px 0px; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; text-align: auto; -khtml-text-decorations-in-effect: none; text-indent: 0px; -apple-text-size-adjust: auto; text-transform: none; orphans: 2; white-space: normal; widows: 2; word-spacing: 0px; "> <BR class="Apple-interchange-newline"></SPAN></SPAN></SPAN> </DIV><BR><DIV><DIV>On Nov 16, 2006, at 5:55 AM, Jacob Rus wrote:</DIV><BR class="Apple-interchange-newline"><BLOCKQUOTE type="cite"><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Mark Eli Kalderon wrote:</DIV> <BLOCKQUOTE type="cite"><BLOCKQUOTE type="cite"><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">5. Some commands for getting better statistics than the current document statistics count.<SPAN class="Apple-converted-space"> </SPAN>It would be nice to have a word count which knows how to ignore stuff in html/latex/markdown tags, etc. etc. (or maybe bundles can provide overrides to this command, and all call out to a single script), but it would also be nice to be able to get some readability statistics, such as counts of average word length, average sentence length, and maybe metrics like Flesch-Kincaid, etc.</DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">6. It might even be nice to add some tools for checking grammar (flagging things like wordy sentences, etc.).<SPAN class="Apple-converted-space"> </SPAN>There are some decent open-source programs for this, I believe.</DIV> </BLOCKQUOTE><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Jacob, you might have a look at diction, a descendant of the Writer's Workbench:</DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><A href="http://directory.fsf.org/GNU/diction.html">http://directory.fsf.org/GNU/diction.html</A></DIV> </BLOCKQUOTE><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Yes indeed, I was looking at that page when I wrote the post :).<SPAN class="Apple-converted-space"> </SPAN>When I actually have some time to make this thing, I think I will indeed use style/diction.</DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV> <BLOCKQUOTE type="cite"><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">and `diction` checks for grammar (although it perpetuates some of the unfortunate aspects of Strunk and White's prescriptions--- useful nonetheless).</DIV> </BLOCKQUOTE><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">I didn't look at exactly what it did, but I assume it can't be worse than MS Office's grammar check.</DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV> <BLOCKQUOTE type="cite"><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">As for filtering LaTeX commands, I run the generated pdf through ps2ascii before running these. Won't work for html documents, but you could run them through a text browser like lynx or links with the dump option instead.</DIV> </BLOCKQUOTE><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Well, I'm still thinking about the best way to get things to work for multiple document types.<SPAN class="Apple-converted-space"> </SPAN>It has to do a few things: a) strip out extraneous junk, and b) figure out how to get back to the specific places in the document where there are points of interest.</DIV><DIV style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><BR></DIV></BLOCKQUOTE><BR><BLOCKQUOTE type="cite"></BLOCKQUOTE></DIV> As a Latex user, I would like to encourage this bundle development. Diction, as it is, gets tripped up on all the math output.<BR><DIV><BR class="khtml-block-placeholder"></DIV><DIV>Jenny</DIV></BODY></HTML>