Open Office files are zipped collections of xml files. The text/data is usually contained in a sub-file called content.xml. This program accesses that file and extracts the text/data. It can also attempt a full recovery with another method.
This software is not reviewed yet.
This program will extract and convert worksheets to CSV even from damaged or corrupted Excel 2007 xlsx files. It succeeds at doing so where Excel 2007 itself fails to salvage data. The program is coded in Perl/Tk with a GUI interface.
This program will extract the text even from damaged or corrupted Word 2007 docx files. It succeeds at doing so where Word 2007 itself fails to salvage text. The program is coded in Perl/Tk with a GUI interface.
Savvy Repair for Microsoft Office, tries four methods for repair or recovery from corruption of Word DOCX, Excel XLSX and PowerPoint PPTX files. Each method starts with the repair of the zip structure.
Command-Line Corrupt Office 2007 Text Extractor extracts text from corrupted docx, xlsx and pptx files where the respective Office 2007 or 2010 programs fail to make this basic recovery. It also works on non-corrupt files.
Finds first XML error, then truncates a configurable number of characters before the error and using the great xmllint, recovers the salvageable part of the file by automatically adding the correct end tags.