TEXTractor provides the functionality to extract text and/or metadata from a total of 33 distinct file types.
Please find the full list of supported file types here.
Built using a modified version of the Toxy library (https://github.com/bmlpg/toxy).
Several improvements on emails content extraction.