2007-10-29 Markus Schwab * ChangeLog, NEWS: Changes of V0.9.02 documented * po/de.po, po/en.po, po/es.po: New start-of-value for PDF-docs added * configure.in: Version to 0.9.02 * src/ParsePDF.cpp, src/ParsePDF.h: Fixed parsing values containing a closing parenthesis ')' * src/Properties.h: Removed unneeded method clear() 2007-03-08 Markus Schwab * ChangeLog, NEWS: Changes of V0.9.01 documented * configure.in: Version to 0.9.01 * doc/IExtract.html.en, doc/IExtract.html.es: Minor changes 2007-03-05 Markus Schwab * INSTALL: Minor fix * src/Writer.cpp: Fixed compile-error; Improvements * src/IExtract.cpp, src/Writer.cpp: Can be compiled under x86_64 2007-01-30 Markus Schwab * ChangeLog, NEWS: Changes of V0.9.00 documented * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Copyright til 2007; Added plugins * src/IExtract.cpp: Copyright in info-txt to 2007; Improved plug-in errormsg * po/de.po, po/en.po, po/es.po: Added messages of V0.9 2007-01-27 Markus Schwab * INSTALL: Documented --enable-plugins * configure.in: Needs libYGP 0.8.20; Version to 0.9.00; Added --enable-plugins * src/Makefile.am: Plugins are only in distribution; not in build 2007-01-26 Markus Schwab * README: Added plug-ins * src/IExtract.cpp: Supporting plug-ins * src/ParseJPG.cpp, src/ParseMP3.cpp, src/ParseMSOffice.cpp, src/ParseOGG.cpp, src/ParseOOffice.cpp, src/ParsePNG.cpp, src/ParseSOffice.cpp: Using utility-functions from libYGP * src/Properties.h: Formatting changed * src/Makefile.am: Moved Utiltiy.h, FileTypeChk.{cpp,h} to libYGP; Installing Properties.h; Fixes * src/FileTypeChk.cpp, src/FileTypeChk.h, src/Utility.h: Deleted (moved to libYGP) * src/Plugins/Makefile.am, src/Plugins/Text.cpp: New file. 2007-01-24 gettextize * m4/codeset.m4: New file, from gettext-0.16.1. * m4/glibc2.m4: New file, from gettext-0.16.1. * m4/glibc21.m4: New file, from gettext-0.16.1. * m4/intdiv0.m4: New file, from gettext-0.16.1. * m4/intl.m4: New file, from gettext-0.16.1. * m4/intldir.m4: New file, from gettext-0.16.1. * m4/intmax.m4: New file, from gettext-0.16.1. * m4/inttypes_h.m4: New file, from gettext-0.16.1. * m4/inttypes-pri.m4: New file, from gettext-0.16.1. * m4/lcmessage.m4: New file, from gettext-0.16.1. * m4/lock.m4: New file, from gettext-0.16.1. * m4/longdouble.m4: New file, from gettext-0.16.1. * m4/longlong.m4: New file, from gettext-0.16.1. * m4/printf-posix.m4: New file, from gettext-0.16.1. * m4/size_max.m4: New file, from gettext-0.16.1. * m4/stdint_h.m4: New file, from gettext-0.16.1. * m4/uintmax_t.m4: New file, from gettext-0.16.1. * m4/ulonglong.m4: New file, from gettext-0.16.1. * m4/visibility.m4: New file, from gettext-0.16.1. * m4/wchar_t.m4: New file, from gettext-0.16.1. * m4/wint_t.m4: New file, from gettext-0.16.1. * m4/xsize.m4: New file, from gettext-0.16.1. * configure.in (AC_OUTPUT): Add intl/Makefile. 2007-01-17 Markus Schwab * doc/IExtract.html.en: Typo fixed 2007-01-06 Markus Schwab * src/IExtract.cpp: Adapted to new FileTypeChecker-interface; First draft of extending checkers * src/FileTypeChk.cpp, src/FileTypeChk.h: First draft of extendible checkers 2006-12-30 Markus Schwab * src/IExtract.cpp: First draft supporting dynamic handlers 2006-10-01 Markus Schwab * ChangeLog, NEWS: Changes of V0.8.11 documented * Makefile.am: Removed m4 subdirectory * configure.in: Version to 0.8.11; Using AC_GNU_SOURE; Removed m4/Makefile * po/de.po, po/en.po, po/es.po: Actualised 2006-09-11 Markus Schwab * doc/IExtract.html.de: Adapted to new grammar * src/IExtract.cpp: Fixed trap when passing an invalid option to -M 2006-08-02 Markus Schwab * ChangeLog, NEWS: Changes of V0.8.10 documented * po/de.po, po/en.po, po/es.po: Added message for -M EXT/AllEXT and the new INI-file options 2006-07-30 Markus Schwab * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Added info about new INI-file value [FileType] * src/IExtract.cpp: Mode to determine the file-type can be read from an INI-file 2006-07-27 Markus Schwab * configure.in: Version to 0.8.10 * README: Minor fix (grammar) * po/de.po, po/en.po, po/es.po, doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Added info about -M EXT and -M AllEXT * src/IExtract.cpp: Added values EXT and AllEXT to option -M * src/FileTypeChk.cpp, src/FileTypeChk.h: Added FileTypeCheckerByCaseExt; Renamed FileTypeCheckerByName to -Extension 2006-07-26 Markus Schwab * src/FileTypeChk.cpp, src/FileTypeChk.h: Fixed check for (content of) HTML docs; Optimised check for OpenOffice (content) 2006-07-22 Markus Schwab * ChangeLog: Changes of V0.8.00 documented * po/de.po, po/en.po: Last actualizations for 0.8.00 * po/es.po: Changed explication of -M AllExt * README: Copyright to 2006; Removed info about extension defining file-type 2006-07-21 Markus Schwab * src/ParseMP3.cpp: Length of frames is MSB; if reading ID3 v2 info fails, fall back to ID3 v1 2006-07-20 Markus Schwab * src/ParseMP3.cpp: Special handling of GEOB ID3-tags; Fixed size of buffer * src/ParseMP3.cpp: Handling also ID3v2.2 tags 2006-07-19 Markus Schwab * src/ParseMP3.cpp, src/ParseMP3.h: Skipping extended ID3 headers; Taking care of type of values in ID3 v2 tags 2006-07-18 Markus Schwab * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Adapted to the changes of V0.8.00 * src/ParseAbiword.cpp, src/ParseOOffice.cpp: Fixed messages * po/de.po, po/en.po, po/es.po: Translations of messages in V0.8.00 * NEWS: Changes of V0.8.00 documented * configure.in: Version to 0.8.00 * src/IExtract.cpp: Processing StarOffice documents with StarOffice parser * src/ParseMSOffice.cpp: Uppercased hex-constants * src/ParseGIF.cpp: Changed the formatting * src/ParseMP3.cpp, src/ParseMP3.h: Added check if file is an MP3-file * src/ParseOOffice.cpp, src/FileTypeChk.cpp, src/ParseSOffice.cpp: Made constants local to file * src/ParseAbiword.cpp, src/ParseAbiword.h: Added check if the file is an AbiWord document * src/ParseOOffice.cpp, src/ParseOOffice.h: Parsing documents according to ZIP-structure; added checks if file is correct * src/FileTypeChk.cpp: Test for OpenOffice checks zipfile-contents; Using constants for IDs 2006-07-17 Markus Schwab * src/ParseMP3.cpp: Also parsing files with ID3v2 tag * src/ParsePDF.cpp: Checking file-type; Fixed parsing docinfo dictionary without spaces * src/ParseRTF.cpp, src/ParseRTF.h: Checking if file is RTF-document * src/ParseSOffice.cpp, src/ParseSOffice.h: Checking if file is office-document; Optimised 2006-07-16 Markus Schwab * src/FileTypeChk.cpp: Added check for content of PDF, RTF and Star/OpenOffice documents 2006-07-15 Markus Schwab * src/FileTypeChk.cpp, src/FileTypeChk.h: FileTypeCheckerByContent handles HTML-documents * src/IExtract.cpp: Combined -X and -c to -M (modus operandi); Passing file + path for check 2006-07-14 Markus Schwab * src/IExtract.cpp: Added option to get file-type from content * src/FileTypeChk.cpp, src/FileTypeChk.h: Added FileClassCheckByName; 1st draft of FileClassCheckByContents 2006-07-13 Markus Schwab * src/IExtract.cpp: Using FileTyeChecker to get type of files * src/Makefile.am: Added FileTypeChk * src/FileTypeChk.cpp, src/FileTypeChk.h: Initial revision; with FileTypeCheckerbyName implemented 2006-07-12 Markus Schwab * ChangeLog, NEWS: Changes of V0.7.14 documented * configure.in: Version to 0.7.14 * po/de.po, po/en.po, po/es.po: Added description of segment-nr in OGG-parser 2006-07-10 Markus Schwab * src/ParseOGG.cpp, src/ParseOGG.h: Fixed parsing files with a segment-table with other than 17 entries * src/IExtract.cpp: First includes from libYGP are Check.h and Trace.h 2006-06-07 Markus Schwab * ChangeLog, NEWS: Changes of V0.7.13 documented * src/IExtract.cpp: Improved errorhandling of reading INI-file * configure.in: Version to 0.7.13; Needing libYGP 0.8.00 * po/de.po, po/en.po, po/es.po: Actualised 2006-06-05 Markus Schwab * src/IExtract.cpp, src/ParseAbiword.h, src/ParseGIF.h, src/ParseHTML.h, src/ParseJPG.h, src/ParseMP3.cpp, src/ParseMP3.h, src/ParseMSOffice.cpp, src/ParseMSOffice.h, src/ParseOGG.cpp, src/ParseOGG.h, src/ParseOOffice.h, src/ParsePDF.cpp, src/ParsePDF.h, src/ParsePNG.h, src/ParseRTF.h, src/ParseSOffice.h: Adapted to new exception-handling style of libYGP 0.8.00 2006-04-20 Markus Schwab * ChangeLog, NEWS: Changes of V0.7.12 documented * configure.in: Version to 0.7.12 * Makefile.am: Added config.h to distribution; Removed Make.bat 2006-04-12 Markus Schwab * src/ParsePDF.cpp, src/ParsePDF.h: Storing string-representation of ID of info-object 2006-04-06 Markus Schwab * Make.bat: Removed 2006-04-03 Markus Schwab * ChangeLog, NEWS: Changes of V0.7.11 commented * configure.in: Version to 0.7.11 * src/ParseJPG.cpp: Fixed compile-warning when compiling with GCC 4.1 * po/de.po, po/en.po, po/es.po: Actualised 2006-03-30 Markus Schwab * src/IExtract.cpp: Option -X skips not only the last unknown extension, but all 2006-03-09 Markus Schwab * ChangeLog, NEWS: Changes of V0.7.10 commented * po/de.po, po/en.po, po/es.po: Actualised copyright-message * README: Mentioning configurability of supported file-types * INSTALL: Mentioning --disable- * src/IExtract.cpp, src/Makefile.am: Conditionally compiling supported file-types * configure.in: Version to 0.7.10; Can disable file-types to support 2005-12-22 Markus Schwab * ChangeLog, NEWS: Changes of V0.7.01 commented * doc/IExtract.html.en: Fixed typo * configure.in: Version to 0.7.01 * src/IExtract.cpp, po/de.po, po/en.po, po/es.po: Improved description of program * src/IExtract.cpp: Avoid statement which depends on order of execution of its parts 2005-12-13 Markus Schwab * ChangeLog, NEWS: Changes of V0.7.00 commented * README: Actualised JPEG/MS-Office comments; Added OpenOffice 2 documents; Fixes * po/de.po, po/en.po, po/es.po: Adapted to new JPEG/MS-Office parser * po/POTFILES.in: Renamed ParseWord to ParseMSOffice * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Actualised description for JPEG and MS Office documents * configure.in: Version to 0.7.00; checking sizes of short/int/long 2005-12-12 Markus Schwab * src/ParseMSOffice.cpp: Fixed bug when reading file with InfoSummary not in 1st property block * src/ParseMSOffice.cpp, src/ParseMSOffice.h: Added parsing of documents with InfoSummary in small blocks * src/ParseJPG.cpp: Fixed warnings when compiling with -Wall * src/Makefile.am: Renamed ParseWord to ParseMSOffice * src/IExtract.cpp: Using new ParseMSOffice (renamed from ParseWord) * src/ParseMSOffice.cpp, src/ParseMSOffice.h: First draft of version without using YGP::Parse* 2005-12-11 Markus Schwab * src/ParseMSOffice.cpp, src/ParseMSOffice.h: Starting to parse documents with comments in small blocks (not working) 2005-12-08 Markus Schwab * src/ParseJPG.cpp, src/ParseOGG.cpp, src/ParsePNG.cpp, src/ParseSOffice.cpp: Using global utility-functions * src/Utility.h: New file. * src/Makefile.am: Added Utility.h * src/ParseHTML.cpp: Changed year of copyright 2005-12-07 Markus Schwab * src/ParseMSOffice.cpp, src/ParseMSOffice.h: Rewritten (basing on information from the Apache Jakarta POI project 2005-12-02 Markus Schwab * src/ParseJPG.cpp: Fixed constants on bigendian-systems; Reset length after reading Exif-number * src/ParseJPG.cpp, src/ParseJPG.h: Re-written (after reading description of JPEG-format) 2005-11-28 Markus Schwab * ChangeLog, NEWS: Changes of V0.6.40 commented 2005-11-27 Markus Schwab * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Added handling of GIF images * configure.in: Version to 0.6.40 * README: Added handling of GIF images * src/IExtract.cpp: Changed copyright-info * po/de.po, po/en.po, po/es.po: Added messages of GIF-parser * po/POTFILES.in: Added ParseGIF * src/IExtract.cpp: Added handling of GIF images * src/Makefile.am: Added ParseGIF * src/ParseGIF.cpp, src/ParseGIF.h: Generalised reading of extensions; Fixed reading blocks longer than 128 bytes * src/ParseGIF.cpp, src/ParseGIF.h: New file. 2005-10-30 Markus Schwab * ChangeLog, NEWS: Changes of V0.6.30 commented * po/de.po, po/en.po, po/es.po: Added quoted as output-style 2005-10-17 Markus Schwab * configure.in: Version to 0.6.30; needing libYGP 0.7.30 * README: Explaining new output format (comma-separated, quoted text) * src/Writer.cpp, src/Writer.h: Escaping quotes in QuotedTextWriter-output * src/IExtract.cpp: Changed format of output * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Added quoted output * src/IExtract.cpp: Added option to output quoted, comma-separated text * src/Writer.cpp, src/Writer.h: Added QuotedTextWriter * src/Writer.cpp, src/Writer.h: Adapted to new style of YGP::TableWriter 2005-08-03 Markus Schwab * ChangeLog, NEWS: Changes of V0.6.20 commented * po/de.po, po/en.po, po/es.po: Added ParsePNG messages * po/POTFILES.in: Added ParsePNG * configure.in: Version to 0.6.20 * README, doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Added handling of PNG-images * src/IExtract.cpp: Added handling of PNG images * src/Makefile.am: Added ParsePNG * src/ParsePNG.cpp, src/ParsePNG.h: New file. 2005-06-04 Markus Schwab * ChangeLog, NEWS: Changes of V0.6.10 commented * README: Mentioning Abiword documents * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Added handling of Abiword documents * po/de.po, po/en.po, po/es.po: Added Abiword messages * po/POTFILES.in: Added ParseAbiWord * configure.in: Version to 0.6.10 * src/IExtract.cpp: Handling Abiword documents * src/ParseOOffice.cpp, src/ParseOOffice.h: Formatting changed * src/Makefile.am: Added ParseAbiword * src/ParseAbiword.cpp, src/ParseAbiword.h: New file. 2005-04-18 Markus Schwab * src/IExtract.cpp: Remove threads before program exits, not if nothing is to extract * configure.in: Version to 0.6.01 * INSTALL: Added configure-option --enable-threads * README: Mentioning compiling with threads; Added extracting of OGG-files 2005-03-21 Markus Schwab * ChangeLog, NEWS: Changes of V0.6.00 commented * po/de.po, po/en.po, po/es.po: Added messages of OGG-parser * configure.in: Version to 0.6.00; Needing libYGP 0.7.10 * po/POTFILES.in: Added ParseOGG * doc/IExtract.html.es, doc/IExtract.html.de, doc/IExtract.html.en: Added handling of OGG audio files * src/Options.h, src/ParseMP3.h, src/ParseOOffice.h, src/ParsePDF.h: Removed virtual from destructor * src/IExtract.cpp: Added parsing of OGG files * src/Makefile.am: Added ParseOGG * src/ParseOGG.cpp, src/ParseOGG.h: New file. 2005-03-09 Markus Schwab * src/ParseMP3.cpp: Formatting changed 2005-03-08 Markus Schwab * README: Changed copyright; minor corrections * src/ParseHTML.cpp: Allowing titles to be empty 2005-03-06 Markus Schwab * po/de.po, po/en.po, po/es.po: Added messages for new options -s and -X * src/IExtract.cpp: Added option to ignore last extension (if unknown) * doc/IExtract.html.es, doc/IExtract.html.de, doc/IExtract.html.en: New options --sort and --ingore-ext added * src/IExtract.cpp, src/Options.h: Added options to sort the found files * src/Makefile.am: Adapted to new mgeni 2005-01-31 Markus Schwab * po/de.po: Changing format * po/Makevars: Added MSGID_BUGS_ADDRESS 2005-01-20 Markus Schwab * ChangeLog, NEWS: Changes of V0.5.21 commented * po/de.po: Fixed c-format declaration * configure.in: Needing libYGP 0.7.00 * po/en.po, po/es.po, po/de.po: Aktualised * doc/IExtract.html.es: Fixed minor spelling mistake 2004-11-28 Markus Schwab * src/Writer.cpp, src/Writer.h: Removed const from ctrl-paramter of getSubstitute 2004-11-27 Markus Schwab * src/Writer.cpp, src/Writer.h: Deriving from YGP::TableWriter 2004-10-07 Markus Schwab * configure.in: Using PACKAGE_VERSION instead of VERSION 2004-04-17 Markus Schwab * ChangeLog, NEWS: Changes of V0.5.20 commented * src/IExtract.cpp: Worked around stupid MS-VC++ bug * Windows/VCExtract.mak: Regenerated from new project file * Windows/IExtract.dsp: Removed old references to libYGP-files * Windows/IExtract-cfg.h: Version to 0.5.20 * Make.bat: Removed building with GCC * INSTALL: Removed compiling with GCC; mentioning clean target in make 2004-04-11 Markus Schwab * doc/IExtract.html.es: Fixed translation of content * po/de.po, po/en.po, po/es.po: Messages handling options a/A/p/P added * src/IExtract.cpp: Renamed option -a to -u; Appending/prepending of text added; Cleanup * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Explained new options; renamed --all to --add-unknown * configure.in: Version to 0.5.20 2004-01-31 Markus Schwab * ChangeLog, NEWS: Changes of V0.5.12 commented * po/de.po, po/en.po, po/es.po: Added translation of new Parser-objects in ParseHTML and ParsePDF * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es, src/IExtract.cpp, src/ParseHTML.cpp, src/ParseJPG.cpp, src/ParseMP3.cpp, src/ParseOOffice.cpp, src/ParsePDF.cpp, src/ParseRTF.cpp, src/ParseSOffice.cpp, src/ParseWord.cpp, src/Writer.cpp: Changed copyright; including 2004 * src/Writer.cpp: Fixed substituting %(LETTERS) 2004-01-29 Markus Schwab * src/ParseHTML.cpp, src/ParseHTML.h: Fixed parsing other META tags * configure.in: Version to 0.5.12 * README: Note on encrypted PDF files * Windows/IExtract-cfg.h: Version to 0.5.12 2004-01-28 Markus Schwab * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Mentioning that encrypted PDF files are not decrypted * src/ParsePDF.h, src/ParsePDF.cpp: Fixed parsing PDF files having a cross reference table with subsections * src/ParseHTML.cpp, src/ParseHTML.h: Handling more than one script section * src/ParseHTML.cpp, src/ParseHTML.h: Skipping over scripts 2004-01-23 Markus Schwab * ChangeLog, NEWS: Changes of V0.5.11 commented 2004-01-22 Markus Schwab * INSTALL: Actualised build-process for Windows * po/de.po, po/en.po, po/es.po: Actualised * Windows/VCExtract.mak: Linking against (static) VCGeneral library * src/IExtract.cpp, src/ParseOOffice.cpp: Disabled MSVC warnings about truncated debug-info * src/Writer.cpp: Workaround for MSVC stupidity * src/ParseRTF.cpp: Disabled checks 2004-01-21 Markus Schwab * configure.in: Version to 0.5.11 * INSTALL, README: Actualised 2004-01-04 Markus Schwab * ChangeLog, NEWS: Changes of V0.5.10 commented * src/Writer.cpp, src/Writer.h: Conversion of special characters can be switched on an off * src/IExtract.cpp: Explaining new substitution scheme * configure.in, Windows/IExtract-cfg.h: Version to 0.5.10 * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es, po/de.po, po/en.po, po/es.po: New substitution-scheme explained 2003-12-28 Markus Schwab * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Commented the new substitutes %e and %E * po/de.po, po/en.po, po/es.po: Addes messages about %e and %E * src/IExtract.cpp: Fixed bug setting files to in/exclude; Informing about %e/%E * src/Writer.cpp: Substituting %E and %e with filename w/out extension and only extension 2003-12-28 Markus Schwab * ChangeLog, NEWS: Changes of V0.5.00 commented * configure.in: Version to 0.5.00 * src/IExtract.cpp: Changed help-text (so that supported formats don't need to be translated) * Windows/IExtract-cfg.h: Version to 0.5.00 * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Added handling of RTF documents * po/de.po, po/en.po, po/es.po: Messages from ParseRTF added * po/POTFILES.in: Added ParseRTF.cpp 2003-12-27 Markus Schwab * src/IExtract.cpp: Added handling of OpenOpffice and RTF documents to help * src/ParseRTF.cpp, src/ParseRTF.h: Blocks can have a following value * src/IExtract.cpp: Handling RTF documents; optimised building of map defining files to handle * src/Makefile.am: ParseRTF added * src/ParseRTF.cpp, src/ParseRTF.h: New file. 2003-12-12 Markus Schwab * ChangeLog, NEWS: Commented changes in V0.4.02 * configure.in: Version to 0.4.02 * po/en.po, po/es.po, po/de.po: Actualised (only line numbers) * src/IExtract.cpp, src/ParseHTML.cpp, src/ParseJPG.cpp, src/ParseMP3.cpp, src/ParsePDF.cpp, src/ParseSOffice.cpp, src/ParseWord.cpp, src/Writer.cpp: Changed comment style to doxygen format * src/IExtract.cpp: Fixed adding a list of files to in/exclude (adding in/excl-flag to each node) * src/ParseHTML.h: Fixed lenght-declaration for _selMetaTags * src/IExtract.cpp: Added forgotten namespaces; Fixed a check 2003-11-17 Markus Schwab * ChangeLog: Actualised (for V0.4.01) * src/IExtract.cpp, src/Options.h, src/ParseHTML.h, src/ParseJPG.cpp, src/ParseJPG.h, src/ParseMP3.cpp, src/ParseMP3.h, src/ParseOOffice.cpp, src/ParseOOffice.h, src/ParsePDF.cpp, src/ParsePDF.h, src/ParseSOffice.cpp, src/ParseSOffice.h, src/ParseWord.cpp, src/ParseWord.h, src/Properties.h, src/Writer.cpp, src/Writer.h: Including Trace.h (instead of Trace_.h) 2003-11-16 Markus Schwab * ChangeLog, NEWS: Commented changes in V0.4.01 * configure.in: Also generating distribution in bzip2-format * src/Makefile.am: Setting libintl library flags * src/IExtract.cpp, src/Options.h, src/ParseHTML.cpp, src/ParseHTML.h, src/ParseJPG.cpp, src/ParseJPG.h, src/ParseMP3.cpp, src/ParseMP3.h, src/ParseOOffice.cpp, src/ParseOOffice.h, src/ParsePDF.cpp, src/ParsePDF.h, src/ParseSOffice.cpp, src/ParseSOffice.h, src/ParseWord.cpp, src/ParseWord.h, src/Writer.cpp, src/Writer.h: Using new libYGP style (namespaces) 2003-10-11 Markus Schwab * configure.in: Version to 0.4.01 2003-08-12 Markus Schwab * ChangeLog, NEWS: Commented changes in V0.4.00 * README: Section planned added * po/es.po: Fixed 'compiled at'-string * configure.in: Testing for YGP * src/IExtract.cpp: Removed explicit convertion of char* to string (in handler-init) 2003-06-19 Markus Schwab * Windows/IExtract-cfg.h: Versionnr. to 0.4.00 * README: Actualised to reflect handling of OpenOffice * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Handling of OpenOffice docs added 2003-06-18 Markus Schwab * ChangeLog: Added po-updates * po/POTFILES.in: ParseOpenOffice added * po/de.po, po/en.po, po/es.po: New messages (for OpenOffice) translated * configure.in: Setting version to 0.4.00; Searching for libYMP (and not libGeneral) * Makefile.am: Including IExtract-cfg.h for Windows version to distribution * src/Writer.cpp: Including C++ standard headers (and not C standard headers) * src/ParsePDF.cpp: Corrected typo in subsystem * src/Makefile.am: Renamed OpenOfice to OpenOffice (ehem) * src/IExtract.cpp: Added sxi and sxm extension, removed sxa; Declaring processOpenOffice * src/ParseOOffice.cpp, src/ParseOOffice.h: New file. 2003-06-04 Markus Schwab * src/IExtract.cpp: Marking string as no-c-format (for gettext) 2003-06-03 Markus Schwab * Makefile.am: Using pkg-config to search for libAttr 2003-06-03 Markus Schwab * NEWS, ChangeLog: Commented changes in V0.3.02 * configure.in, src/Makefile.am: Using pkg-config to search for libAttr * Makefile.am: Switched to automake 1.7 * po/en.po: Actualised * po/de.po, po/es.po: Removed warning for '%' in a message * po/Makevars: New file. * acconfig.h: Deleted 2003-06-03 gettextize * Makefile.am (SUBDIRS): Add m4. (ACLOCAL_AMFLAGS): New variable. (EXTRA_DIST): New variable. * configure.in (AC_OUTPUT): Add m4/Makefile. 2003-03-26 Markus Schwab * NEWS, ChangeLog, README: Changes of Version 0.3.01 documented * src/IExtract.cpp: Copyright reformated in one line; thanks to gettext * po/de.po, po/en.po, po/es.po: New messages added (for XML); Line numbers actualised * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Features of 0.3.01 docommented * Windows/IExtract-cfg.h, configure.in: Version to 0.3.01 * src/Writer.cpp, src/Writer.h: Removed OutIterator; Changing special characters * src/ParseMP3.cpp: Stripping also \0, but returning always at least one character * src/ParseHTML.cpp, src/ParsePDF.cpp, src/ParseSOffice.cpp, src/ParseWord.cpp: Removed warnings when compiling with -Wall * src/IExtract.cpp, src/Writer.cpp, src/Writer.h: Output in XML-style (defaults to XHTML) 2003-02-24 Markus Schwab * Windows/IExtract.dsp, Windows/VCExtract.mak: Include-pahts corrected * Windows/IExtract-cfg.h: Initial reviion 2003-02-22 Markus Schwab * src/IExtract.cpp, src/ParseHTML.cpp, src/ParseJPG.cpp, src/ParseMP3.cpp, src/ParsePDF.cpp, src/ParsePDF.h, src/ParseSOffice.cpp, src/ParseWord.cpp, src/ParseWord.h: Using namespace std explicitely 2003-02-13 Markus Schwab * NEWS, ChangeLog: Changes of Version 0.3.00 documented * README: Docu-dir added * src/IExtract.cpp: Name of options changed * doc/IExtract.html.de, doc/IExtract.html.en, doc/IExtract.html.es: Updated addressof of FSF 2003-02-10 Markus Schwab * po/de.po, po/en.po, po/es.po: Actualised 2003-02-01 Markus Schwab * doc/IExtract.html.es: Minor fixes * doc/IExtract.html.en, doc/IExtract.html.es, doc/IExtract.html.de: Links to versions in other languages added; Improved and fixed 2003-01-28 Markus Schwab * po/en.po, po/es.po: New file. 2003-01-27 Markus Schwab * Makefile.am, configure.in: I18n enabled * acconfig.h: New file. * src/ParseHTML.cpp, src/ParseJPG.cpp, src/ParsePDF.cpp, src/ParseSOffice.cpp, src/ParseWord.cpp: I18n * src/Writer.cpp: UNIX-style only set on non-UNIX-systems * src/IExtract.cpp: I18n; Handling *php & *sht-files; Minor fixes * po/POTFILES.in, po/de.po: New file. 2003-01-26 Markus Schwab * doc/IExtract.html.de, doc/IExtract.html.en: Minor changes and corrections * doc/Makefile.am: Translations added to distribution * doc/IExtract.html.es: New file. 2003-01-25 Markus Schwab * doc/IExtract.html.de, doc/IExtract.html.en, doc/Makefile.am: New file. * Windows/VCExtract.mak, Windows/IExtract.dsp: Version to 0.3.00; ParseMP3 added 2003-01-16 Markus Schwab * src/Writer.cpp, src/Writer.h: Posibility to print size added; isAtName deleted; Fixed HTML-printMessage * src/ParseMP3.cpp, src/ParseMP3.h: New file. * src/Makefile.am: ParseMP3 added * src/IExtract.cpp: Handling of MP3-files aded; Help text changed to max. 80 chars/line 2003-01-11 Markus Schwab * src/ParseWord.cpp, src/ParseWord.h: Checking for 2 separators between property header and data 2002-12-17 Markus Schwab * NEWS, ChangeLog: Changes in Version 0.2.03 documented * configure.in: Version to 0.2.03 * src/ParseHTML.cpp: End-of-head tag can also be written /HEADER * src/Writer.cpp: New path subtitutes (to UNIX-style); printMessages changed * src/IExtract.cpp: Age of new files read from INI-file; Helptext++; New substitutes commented * src/Options.h: Age of new files also read from INI-file 2002-11-28 Markus Schwab * src/ParseHTML.cpp: Added callback to /HEAD to speed up parsing * src/ParseHTML.h: Added callback to /HEAD to spead up parsing 2002-11-22 Markus Schwab * src/IExtract.cpp: Printing version info corrected; Option to print text separate directories added * src/Writer.cpp, src/Writer.h: Added possibility to print a separating text between directories * src/Options.h: Separate-text added 2002-11-21 Markus Schwab * ChangeLog, NEWS: Changes of V0.2.02 commented * src/IExtract.cpp, src/Makefile.am, src/ParseHTML.cpp, src/ParseJPG.cpp, src/ParsePDF.cpp, src/ParsePDF.h, src/ParseSOffice.cpp, src/Writer.cpp: Asserts replaced with Checks * src/ParseWord.cpp, src/ParseWord.h: Handling documents having OLE-objects (?) in between property header and values * README: e-Mail corrected to report bugs corrected 2002-11-18 Markus Schwab * src/ParseJPG.cpp, src/ParseJPG.h, src/ParsePDF.cpp, src/ParsePDF.h, src/ParseSOffice.cpp, src/ParseSOffice.h, src/ParseWord.cpp, src/ParseWord.h, src/ParseHTML.cpp: Adapted to new Parser interface 2002-11-15 Markus Schwab * src/ParsePDF.cpp: Corrected hex-convertion * src/ParsePDF.h: convertToInt deleted * configure.in: Version to 0.2.02 * src/ParsePDF.cpp, src/ParsePDF.h: Handling of linearised docus added; Strings can be in hex-format * README: Handling of PDF files commented * NEWS: Changes in V0.2.02 commented * src/ParsePDF.cpp, src/ParsePDF.h: Rewritten to better use it for linearised files; works as before 2002-11-14 Markus Schwab * src/ParsePDF.cpp, src/ParsePDF.h: Works for not-linearised files and text values * src/ParsePDF.cpp, src/ParsePDF.h: Parsing /Info object added * src/Makefile.am: ParsePDF added * src/IExtract.cpp: Binary search in handlers-table; handling PDF files; reporting parse error * src/ParsePDF.cpp, src/ParsePDF.h: New file. 2002-11-10 Markus Schwab * src/IExtract.cpp: Parsing INI-file via Options-Entity * src/Options.h: New file. 2002-11-07 Markus Schwab * ChangeLog, NEWS: Changes of V0.2.01 commented * src/Writer.cpp, src/Writer.h: Handling of new format options %D & %(...) added; Special handling of %n removed * src/ParseHTML.cpp, src/ParseJPG.cpp, src/ParseWord.cpp: Wrong comment removed * src/IExtract.cpp: New feat. to help text; Reading new-file string from INI-file; StarDraw handled * configure.in: Version to 0.2.01 * Makefile.am: Typo corrected * README: Actualised to reflect Changes up to 0.2.01 * Windows/IExtract.dsp, Windows/VCExtract.mak: ParseSOffice added 2002-11-05 Markus Schwab * src/IExtract.cpp, src/Writer.cpp, src/Writer.h: Output in LaTeX tabular style added * src/ParseSOffice.cpp, src/ParseSOffice.h: Searching for ID instead of skipping first 0x8c2 bytes; necessary for sdd files 2002-11-04 Markus Schwab * src/Makefile.am: ParseSOffice added * src/ParseJPG.cpp, src/ParseWord.cpp: Should theoretically also work on bigendian machines * src/IExtract.cpp: Handling of StarOffice documents added * src/ParseSOffice.cpp, src/ParseSOffice.h: Initial revision; working for StarWord and StarCalc 2002-11-04 Markus Schwab * ChangeLog: V0.2.00 documented * Windows/IExtract.dsp, Windows/VCExtract.mak: New classes added * src/IExtract.cpp: Using safer Thread::waitForThread-method * configure.in: Type in helpstrings corected 2002-11-03 Markus Schwab * src/Writer.cpp: TextWriter starts a new line after printing title * src/ParseJPG.cpp, src/ParseJPG.h: Extracting also PhotoShop-style comments * src/IExtract.cpp: Fixed trap when INI-file is missing * README: Changes in V0.2.00 documented * NEWS: V0.2.00 documented 2002-11-01 Markus Schwab * src/IExtract.cpp: Reading of INI files added 2002-10-28 Markus Schwab * src/IExtract.cpp: Adapted to new style to format entries of Writer * src/Writer.cpp, src/Writer.h: Style of format similar to title with '|' as separator * src/ParseHTML.cpp, src/ParseHTML.h: Ignoring other Meta tags (as http-equiv, ...) * src/ParseWord.cpp: - Corrected identifier for title; seems to be 2 and not 1 - Values are assigned with operator= and not assign as some documents have wrong length-entries * src/ParseWord.h: Pragmas to suppress VC-warnings added * src/ParseHTML.cpp, src/ParseHTML.h: Parsing of author and description added 2002-10-25 Markus Schwab * src/IExtract.cpp: Unicode strings are converted to normal strings; Helptext fixed * src/Writer.cpp, src/Writer.h: Title and format of output can be specified * src/IExtract.cpp: Format and title of output are changeable (via options) 2002-10-24 Markus Schwab * src/Properties.h: Author added * src/ParseWord.cpp, src/ParseWord.h: Extracting title, author and comment * src/ParseJPG.h: Unused idFormat2 removed * src/ParseJPG.cpp: Accept files without comments; disabled VC-warning about cutting identifiers * src/IExtract.cpp: Only known filetypes are passed to thread * src/ParseHTML.h, src/ParseHTML.cpp: Fixed sequence of title-tag 2002-10-22 Markus Schwab * configure.in: --enable-{debug|checks|threads} added * src/ParseWord.h: Definition of ENABLE_THREADS removed * src/ParseJPG.h: Size of format2-sequence reduced to 2 * src/IExtract.cpp: Waiting for threads before parsing files in subdirs * src/ParseJPG.cpp: Fixed handling of extended properties * src/Writer.cpp: Fixed printMessage in HTML-writer * src/ParseWord.h: Using new Parse*Ignore classes * src/ParseWord.cpp: Gracefully handling missing properties * src/ParseJPG.cpp, src/ParseJPG.h: Changed to parse both JPEG-formats (works for format 1 (as before)) 2002-10-21 Markus Schwab * src/ParseHTML.cpp: Including property-header * src/ParseHTML.h, src/ParseJPG.h, src/ParseWord.h: Using multiple buffers when compiled for threads * src/IExtract.cpp: Verbose-option removed; Threads started 2002-10-20 Markus Schwab * Windows/IExtract.dsp, Windows/VCExtract.mak: New sources added * src/IExtract.cpp: Fixed output of errors only when specified * src/Writer.cpp, src/Writer.h: Methods to print file with infotext added * src/IExtract.cpp: In/excluding of files added; prepared for threads 2002-10-19 Markus Schwab * src/ParseJPG.cpp: Assigning property-values; Fixes * src/ParseJPG.cpp, src/ParseJPG.h: Actual parsing of properties (style 2) added * src/ParseJPG.cpp, src/ParseJPG.h: Enhanced to parse simple and extented properties 2002-10-17 Markus Schwab * src/IExtract.cpp: Fixed option-handling (--new) and help screen * src/IExtract.cpp, src/Writer.cpp, src/Writer.h: Handling of new files added * ChangeLog: Actualised for V0.1.00 * configure.in: New file. * README: Minor changes; Handling of JPEGs added * Makefile.am: Added project files for VC to distribution * Make.bat: New file. * NEWS: Features of V0.1.00 explained * src/Writer.cpp, src/Writer.h: Prepared to print more information * src/ParseHTML.cpp, src/ParseHTML.h, src/ParseWord.cpp, src/ParseWord.h: Prepared to parse more information * src/Makefile.am: ParseJPG and Properties.h added * src/Properties.h: New file. * src/IExtract.cpp: Handling JPEG images; Prepared to parse more information * src/ParseJPG.cpp, src/ParseJPG.h: New file. * src/IExtract.cpp: Handling of subdirectories corrected (search all with passed filespecification) * src/Writer.cpp, src/Writer.h: const char* for descriptions changed to std::string 2002-10-17 Markus Schwab * configure.in: New file. * README: Minor changes; Handling of JPEGs added * Makefile.am: Added project files for VC to distribution * Make.bat: New file. * NEWS: Features of V0.1.00 explained * src/Writer.cpp, src/Writer.h: Prepared to print more information * src/ParseHTML.cpp, src/ParseHTML.h, src/ParseWord.cpp, src/ParseWord.h: Prepared to parse more information * src/Makefile.am: ParseJPG and Properties.h added * src/Properties.h: New file. * src/IExtract.cpp: Handling JPEG images; Prepared to parse more information * src/ParseJPG.cpp, src/ParseJPG.h: New file. * src/IExtract.cpp: Handling of subdirectories corrected (search all with passed filespecification) * src/Writer.cpp, src/Writer.h: const char* for descriptions changed to std::string 2002-10-16 Markus Schwab * src/ParseWord.cpp: Fixed (wrong) skipping of WS; skipping to comment; assignment of found title * src/ParseWord.cpp, src/ParseWord.h: Inspecting properties-header for extracting information * src/ParseHTML.cpp, src/ParseHTML.h: Adapted to new parser interface (found passes length); title-tag ignores case * src/IExtract.cpp: Adapted to new parser-interface (returning strings) 2002-10-15 Markus Schwab * Windows/VCExtract.mak: Writer added * Windows/IExtract.dsp: New file. 2002-10-12 Markus Schwab * src/Writer.cpp: Changed ostream to iostram * src/Makefile.am: Writer added * src/IExtract.cpp: Sanity check before strdup found result * src/Writer.cpp, src/Writer.h: New file. * src/ParseHTML.cpp: Using new-style IO * src/IExtract.cpp: Using writer-class for output; Options to show unknown extension & path 2002-10-11 Markus Schwab * Windows/IExtract.dsw: New file. * Windows/VCExtract.mak: Adapted to new file-structure * src/IExtract.cpp: Output-style flexible; Options to control handling added * README: Install-part moved to install; Windows-help added; Bugreport added * Makefile.am, src/Makefile.am, INSTALL, AUTHORS, ChangeLog, NEWS, README, Windows/VCExtract.mak, src/IExtract.cpp, src/ParseHTML.cpp, src/ParseHTML.h, src/ParseWord.cpp, src/ParseWord.h: New file. 2002-10-08 Markus Schwab * Project created