ocrodjvu (0.7.9) unstable; urgency=low

  * Improve error handling.
  * Fix compatibility with Tesseract > 3.01.

 -- Jakub Wilk <jwilk@jwilk.net>  Sat, 10 Mar 2012 23:36:03 +0100

ocrodjvu (0.7.8) unstable; urgency=low

  * Improve test suite.

 -- Jakub Wilk <jwilk@jwilk.net>  Sun, 22 Jan 2012 00:04:16 +0100

ocrodjvu (0.7.7) unstable; urgency=low

  * Raise proper import error if html5lib is not installed. Thanks to Kyrill
    Detinov for the bug report.

 -- Jakub Wilk <jwilk@jwilk.net>  Sun, 11 Dec 2011 23:08:05 +0100

ocrodjvu (0.7.6) unstable; urgency=low

  * Improve error handling.
  * ocrodjvu:
    + Fix a regression in gocr, ocrad and tesseract engines, which made them
      unusable.

 -- Jakub Wilk <jwilk@jwilk.net>  Thu, 27 Oct 2011 18:06:38 +0200

ocrodjvu (0.7.5) unstable; urgency=low

  * Check Python version in setup.py.
  * Accept slightly malformed hOCR documents (with a text zone not completely
    within the page area).
    http://bugs.debian.org/575484#35
  * Fix compatibility with Tesseract > 3.00.
    Thanks to Janusz S. Bień for the bug report.
  * ocrodjvu, hocr2djvused:
    + Add the --html5 option.

 -- Jakub Wilk <jwilk@jwilk.net>  Sat, 27 Aug 2011 01:25:33 +0200

ocrodjvu (0.7.4) unstable; urgency=low

  * Use a better method to detect Debian-based systems.
  * hocr2djvused:
    + Ignore comments and <script> elements in hOCR.
  * ocrodjvu:
    + For Tesseract ≥ 3.00, extract bounding boxes of particular characters
      with higher accuracy.

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 27 Jul 2011 17:34:38 +0200

ocrodjvu (0.7.2) unstable; urgency=low

  * Don't hang if one of the threads raises an exception.
  * Use the logging module for printing progress messages, errors etc.
  * Produce more useful import error messages on Debian-based systems.

 -- Jakub Wilk <jwilk@jwilk.net>  Mon, 04 Apr 2011 01:14:22 +0200

ocrodjvu (0.7.1) unstable; urgency=low

  * Windows: guess location of the DjVuLibre DLL (requires python-djvulibre
    ≥ 0.3.3).
  * ocrodjvu:
    + Work around a bug in Cuneiform, which mistakenly use ‘slo’ (rather than
      ‘slv’) as language code for Slovenian.
      https://bugs.launchpad.net/cuneiform-linux/+bug/707951
    + Accept ‘ces’, ‘nld’, ‘slv’, ‘ron’ as language codes for Czech, Dutch,
      Slovenian and Romanian languages, even when Cuneiform internally use
      different ones.
  * djvu2hocr:
    + Don't flip hOCR upside-down.
      http://bugs.debian.org/611460

 -- Jakub Wilk <jwilk@jwilk.net>  Sat, 29 Jan 2011 18:14:40 +0100

ocrodjvu (0.7.0) unstable; urgency=low

  * Correctly handle empty pages recognized by Cuneiform and Ocrad. Thanks to
    Alexey Shipunov for the bug report.
  * Fix crash on Cuneiform-generated hOCR with bounding boxes for whitespace
    characters. Thanks to Alexey Shipunov for the bug report.
  * Fix compatibility with Tesseract 3.00.
  * Fix colors in 24-bit BMP images.
  * ocrodjvu:
    + Make ‘-e’ an alias for ‘--engine’.
    + Make ‘-l’ an alias for ‘--language’.
    + Add the -X option (for advanced users).
    + Work-around for Cuneiform returning files with control characters is now
      disabled by default. Use ‘-X fix-html=1’ to re-enable it.
    + Add the --on-error option (for advanced users).
  * djvu2hocr:
    + Fix a typo, which prevented hocr2djvused from correctly parsing files
      produced by it.
      http://bugs.debian.org/600539
  * Extend the test suite.

 -- Jakub Wilk <jwilk@jwilk.net>  Sun, 07 Nov 2010 21:37:00 +0100

ocrodjvu (0.6.1) unstable; urgency=high

  * Improve detection of Tesseract.
  * Correctly handle unrecognized and non-ASCII characters in Ocrad ORF output.
    Thanks to Heinrich Schwietering for the bug report.
  * Correct handle text that is closer than 100 pixels from the left edge in
    Ocrad ORF output. Thanks to Heinrich Schwietering for the test case.
  * Fix crash on hOCR with image elements. Thanks to Alexey Shipunov for the
    bug report.
    http://bugs.debian.org/598139
  * Fix insecure use of temporary files when using Cuneiform.
    http://bugs.debian.org/598134
    CVE-2010-4338

 -- Jakub Wilk <jwilk@jwilk.net>  Sun, 26 Sep 2010 15:01:51 +0200

ocrodjvu (0.6.0) unstable; urgency=low

  * Add support for the Tesseract OCR engine.
  * Fix Cuneiform support (a regression introduced in 0.5). Thanks to Kyrill
    Detinov for the bug report.

 -- Jakub Wilk <jwilk@jwilk.net>  Thu, 16 Sep 2010 19:24:20 +0200

ocrodjvu (0.5.1) unstable; urgency=low

  * Fix crash when listing engines/languages if Ocropus is not found.
    Thanks to Kyrill Detinov for the bug report.
  * lxml is no longer required for OCR engines that are not using hOCR as
    output format.

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 15 Sep 2010 18:38:00 +0200

ocrodjvu (0.5.0) unstable; urgency=low

  * Add support for the Ocrad OCR engine.
  * Add support for the GOCR engine.
  * Cuneiform is no longer required to be linked with ImageMagick.
  * Prevent Cuneiform from asking interactive questions. Thanks to Heinrich
    Schwietering for the bug report.
  * Make sure that signals are handled in a sane way. Thanks to Heinrich
    Schwietering for the bug report.
  * Drop support for guessing page size from image (scan) contents.
  * Let the setup.py script install manual pages. Thanks to Kyrill Detinov and
    Heinrich Schwietering for bug reports.

 -- Jakub Wilk <jwilk@jwilk.net>  Tue, 14 Sep 2010 23:00:35 +0200

ocrodjvu (0.4.7) unstable; urgency=low

  * Preserve as much environment as possible when calling external programs.
    Thanks to Heinrich Schwietering for the bug report.
    http://bugs.debian.org/594385

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 25 Aug 2010 20:27:17 +0200

ocrodjvu (0.4.6) unstable; urgency=low

  * Implement work-around for Cuneiform returning files with control
    characters. Thanks to Kyrill Detinov for the bug report.
  * Avoid deprecation warnings with PyICU ≥ 1.0.
    http://bugs.debian.org/589027
  * djvu2hocr:
    + Don't crash on very long documents.
      http://bugs.debian.org/591389

 -- Jakub Wilk <jwilk@jwilk.net>  Tue, 03 Aug 2010 20:33:49 +0200

ocrodjvu (0.4.5) unstable; urgency=low

  * Fix handling of ‘deu’ and ‘rus-eng’ languages. Thanks to Kyrill Detinov
    for the bug report.
  * Properly handle hOCR with inline formatting. Thanks to Kyrill Detinov for
    the bug report.
  * djvu2hocr:
    + Add ocr-system and ocr-capabilities meta information.

 -- Jakub Wilk <jwilk@jwilk.net>  Mon, 24 May 2010 21:22:39 +0200

ocrodjvu (0.4.4) unstable; urgency=low

  * Document that ocrodjvu honours TMPDIR environment variable.
    http://bugs.debian.org/575488
  * Don't remove temporary directory if ocrodjvu crashed.
    http://bugs.debian.org/575487

 -- Jakub Wilk <jwilk@jwilk.net>  Fri, 02 Apr 2010 12:00:11 +0200

ocrodjvu (0.4.3) unstable; urgency=low

  * Don't crash on --version.
    http://bugs.debian.org/573496
  * Give more meaningful error messages on a malformed hOCR produced by
    Cuneiform.
    http://bugs.debian.org/572522
  * Document how djvu2hocr deals with non-XML characters.

 -- Jakub Wilk <jwilk@jwilk.net>  Fri, 19 Mar 2010 01:22:54 +0100

ocrodjvu (0.4.2) unstable; urgency=low

  * New options for ocrodjvu:
    + --render=mask,
    + --render=foreground,
    + --render=all.
    http://bugs.debian.org/572081
  * Fix off-by-one error in text area coordinates.
  * Add support for Cuneiform 0.9.

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 03 Mar 2010 21:27:15 +0100

ocrodjvu (0.4.1) unstable; urgency=low

  * Be stricter when reading hOCR produced by OCRopus 0.3.1.

 -- Jakub Wilk <jwilk@jwilk.net>  Fri, 22 Jan 2010 20:25:54 +0100

ocrodjvu (0.4.0) unstable; urgency=low
  
  * Add support for the Cuneiform OCR engine.
    New options for ocrodjvu:
    + --engine,
    + --list-engines.
  * Don't crash on non-ASCII file names.
    Thanks to Jean-Christophe Heger for the bug report.
  * hocr2djvused:
    + Add the --page-size option.
  * ocrodjvu:
    + Add the -j/--jobs option.

 -- Jakub Wilk <jwilk@jwilk.net>  Thu, 21 Jan 2010 23:41:37 +0100

ocrodjvu (0.3.2) unstable; urgency=low

  * Accept negative numbers in hOCR bounding boxes.
  * djvu2hocr:
    + Fix broken UAX #29 segmentation.
    + Provide correct page bounding boxes.

 -- Jakub Wilk <jwilk@jwilk.net>  Fri, 08 Jan 2010 17:46:51 +0100

ocrodjvu (0.3.1) unstable; urgency=low

  * djvu2hocr:
    + Fix broken UAX #29 segmentation.

 -- Jakub Wilk <jwilk@jwilk.net>  Sun, 03 Jan 2010 12:56:08 +0100

ocrodjvu (0.3.0) unstable; urgency=low

  * Python ≥ 2.5 is now required.
  * argparse module in now required.
  * Add support for OCRopus 0.3.1.
  * Give better error messages when Tesseract language pack cannot be found.
  * New options for ocrodjvu:
    + -t/--details;
    + --word-segmentation.
  * New options for hocr2djvused:
    + --rotation,
    + -t/--details,
    + --word-segmentation,
  * New tool: djvu2hocr.

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 16 Dec 2009 18:42:21 +0100

ocrodjvu (0.2.1) unstable; urgency=low

  * Give a clearer error message if OCRopus were interrupted by a signal.
  * Add the --language option.
  * Add the --list-languages option.

 -- Jakub Wilk <jwilk@jwilk.net>  Sat, 17 Oct 2009 17:34:43 +0200

ocrodjvu (0.2.0) unstable; urgency=low

  * Provide a manual page.
  * Add the -D/--debug option.
  * Add options to specify how results are stored:
    + -o/--save-bundled,
    + -i/--save-indirect,
    + --save-script,
    + --in-place,
    + --dry-run.
  * Add the --clear-text option.
  * Add the --ocr-only option.

  * Please use the --in-place and --clear-text options to retain compatibility
    with ocrodjvu < 0.2.

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 14 Oct 2009 20:53:48 +0200

ocrodjvu (0.1.3) unstable; urgency=low

  * Use ocroscript, rather than ocrocmd.

 -- Jakub Wilk <jwilk@jwilk.net>  Sun, 15 Mar 2009 19:01:11 +0100

ocrodjvu (0.1.2) unstable; urgency=low

  * Make hocr2djvused work with hOCR for multiple pages. 
  * Handle rotated pages correctly.
  * Ignore IW44-only pages.

 -- Jakub Wilk <jwilk@jwilk.net>  Mon, 23 Jun 2008 20:14:42 +0200

ocrodjvu (0.1.1) unstable; urgency=low

  * Depend on python-lxml.
  * Better compatibility with Python 2.4.

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 14 May 2008 11:23:13 +0200

ocrodjvu (0.1) unstable; urgency=low

  * Initial release.

 -- Jakub Wilk <jwilk@jwilk.net>  Wed, 07 May 2008 18:29:40 +0200
