catdoc : Converter for Microsoft Word, Excel, PowerPoint and RTF files to text ( http://www.wagner.pp.ru/~vitus/software/catdoc/ )

htmltidy : The granddaddy of HTML tools, with support for modern standards ( http://www.html-tidy.org/ )

paps : Unicode-aware text to PostScript converter ( http://paps.sourceforge.net/ )

uniutils : This package consists of a set of programs for manipulating and analyzing Unicode text. The analysis utilities are useful when working with Unicode files when one doesn't know the writing system, doesn't have the necessary font, needs to inspect invisible characters, needs to find out whether characters have been combined or in what order they occur, or needs statistics on which characters occur. ( http://billposer.org/Software/unidesc.html )

