Change Summary:
30 April 2013 - jTessBoxEditor v0.9
- Enhance Generate TIFF/Box functionality to allow for combining prepending symbols in addition to appending
- Fix a bug that failed to persist changes to table in edit mode
- Find function now supports partial matches
- Fix a problem with table not scrolling along when row header has focus and scrolling
16 April 2013 - jTessBoxEditor v0.8
- Add row number header
- Char cell now editable
- Convert Unicode escape sequences where possible
- Find box now displays Unicode characters and allows search using Unicode escape sequences
- Improve Generate TIFF/Box functionality:
- automatically combine boxes that have the same coordinates or completely encloses one another
- automatically combine boxes that are combining symbols, specified in an external file, with the main, base character
- retain last-modified exp number in filename
6 January 2013 - VietOCR.NET v3.4
- Upgrade to Tesseract 3.02 .NET wrapper (r820) (https://github.com/charlesw/tesseract)
- Add hocr support for Bulk & Batch and command-line operations
- Update links to dictionary files
6 January 2013 - VietOCR v3.4.2
- Update Tesseract 3.02 to r820
- Add hocr support for Bulk & Batch and command-line operations
- Update links to dictionary files
- Update JNA to v3.5.1
28 November 2012 - VietOCR v3.4.1 & VietOCR.NET v3.3.1
- Add Bulk OCR process
3 November 2012 - VietOCR v3.4
- Upgrade Tesseract engine to v3.02 (r798)
- Enable text entry in the combobox for Tesseract 3.02's multi-language OCR support
- Fit Image now retains image aspect ratio
- Add optional support for using Tess4J library
- Update JACOB to 1.16.1 version
16 June 2012 - jTessBoxEditor v0.7
- Fix an issue with opening Help file on OS X
- For TIFF/Box generation:
- increase line spacing
- abbreviate bold/italic font style to b/i for filename
- add a Prefix (Language Code) textbox
- add support for text anti-aliasing
25 February 2012 - VietOCR v3.3 and VietOCR.NET v3.3
- Download Language Data will warn if the user does not have write access to tessdata folder
- Use icons from FatCow (http://www.fatcow.com/free-icons) and by Rimas Kudelis
- Fix a bug with scrollbars visible when Fit Image selected (Java only)
- Update Hunspell to v1.3.2 (Java only)
21 January 2012 - VietOCR v3.2.2
- Fix a context menu's font issue with displaying Unicode characters for spellcheck suggestions
14 January 2012 - VietOCR v3.2.1
- Fix an issue with opening Help file on OS X
- Update JACOB to 1.16-M2 version
- Update JNA to 3.4.0 version
26 November 2011 - VietOCR.NET v3.2
- Update Tesseract 3.01 to r639 (final release version)
- Remove unneeded liblept168.dll
- Update lists of language codes
- Add PSM support to execution from command line
22 October 2011 - VietOCR v3.2
- Update Tesseract 3.01 to r638 (final release version)
- Remove unneeded liblept168.dll
- Update lists of language codes
- Update JACOB to 1.16-M1 version
- Add PSM support to execution from command line
18 October 2011 - jTessBoxEditor v0.6
- Add a utility function that creates TIFF/Box pair suitable for training with Tesseract
- Fix a bug which may clear out a modified box file when loading another image
- Enhance box search operations
- Fix font issues in various visual components
- Merged box will have a character value composed of all the characters of the merging boxes
12 September - VietOCR.NET v3.1.1
- Fix a bug in which the arrow keys would scroll the picturebox even when the textbox has focus
- Provide Page Segmentation Mode options for Tesseract engine
12 September - VietOCR v3.1.5
- Update Tesseract 3.01 to r625
- Provide Page Segmentation Mode options for Tesseract engine
12 September 2011 - jTessBoxEditor v0.5.1
- Correct an issue with (small) image vertically centering in the image pane; align it to top of pane
5 September 2011 - jTessBoxEditor v0.5
- Display informational tooltip for boxed characters in image pane
14 August 2011 - VietOCR.NET v2.0.5
- Fix a bug in which the arrow keys would scroll the picturebox even when the textbox has focus
1 August 2011 - VietOCR.NET v3.1
- Integrate tesseractdotnet .NET wrapper DLL x86 (r48+) based on Tesseract 3.01 (r597)
- Remove tesseract.exe file
- Trap OutOfMemory exceptions that intermittently occur during drawing of selection boxes
- Refactor
1 August 2011 - VietOCR v3.1.4
- Update Tesseract 3.01 to r597
1 August 2011 - VietOCR.NET v2.0.4
- Trap OutOfMemory exceptions that intermittently occur during drawing of selection boxes
- Refactor
26 June 2011 - VietOCR.NET v3.00
- Use command-line process to invoke Tesseract 3.01 (r585) binary executable
- Include improved Vietnamese language pack
4 June 2011 - VietOCR v3.1.3
- Refactoring
- Improve program usability, enabling image nagivation and manipulation with keyboard
- Fix an EOL issue that broke Remove Line Breaks functionality on Windows
- Integrate Linux Sane scanning support
- Fix an issue with restart notification after language pack downloads
- Update Tesseract 3.01 to r585
- Replace Vietnamese language pack with an improved version
4 June 2011 - VietOCR v2.0.3 & VietOCR.NET v2.0.3
- Refactoring
- Improve program usability, enabling image nagivation and manipulation with keyboard
- Fix an installation issue that was unable to uninstall previous versions (.NET only)
- Fix an EOL issue that broke Remove Line Breaks functionality on Windows (Java only)
- Integrate Linux Sane scanning support
28 May 2011 - jTessBoxEditor v0.4
- Add a utility function for merging images into a multi-page TIFF
25 April 2011 - jTessBoxEditor v0.3
- Provide a close-up view of current box
14 April 2011 - jTessBoxEditor v0.2
- Add a provision to set font for the Box Coordinates table
- Set table row height to match font
- Incorporate a pangram into the Font dialog
10 April 2011 - jTessBoxEditor v0.1
- Support editing box data of both Tesseract 2.0x and 3.0x formats
- Implement box select & merge/split/insert/delete operations
- Implement box size change function via spinners
- Support Unicode conversion for the text field
- Include box search function
13 March 2011 - VietOCR v2.0.2/v3.1.2 & VietOCR.NET v2.0.2
- Incorporate deskew functionality using GMSE Deskew algorithm
- Fix a MissingResourceException associated with Font dialog (Java only)
6 March 2011 - VietOCR v2.0.1/v3.1.1 & VietOCR.NET v2.0.1
- Fix a bug which hangs the program if x.DangAmbigs.txt contains entries starting with an equal symbol
- Improve postprocessing performance by caching the word list used; reload only if changes
- Fix a bug that crashes the program when inline spellcheck suggests on empty text (.NET only)
- Incorporate Apple Java Extensions (Java only)
6 February 2011 - VietOCR v3.1
- Port changes from version 2.0
- Update Tesseract OCR engine to 3.01 (r551)
6 February 2011 - VietOCR v2.0 & VietOCR.NET v2.0
- Upgrade JACOB library to version 1.15-M4 (Java only)
- Add support for spellcheck suggestion in context menu
- Improve program accessibility and usability
- Add support for downloading and installing language data packs and appropriate spell dictionaries
- Add UI localization for Lithuanian and Slovak
- Refactor by breaking up large classes into smaller ones
11 November 2010 - VietOCR.NET v1.9
- Integrate Hunspell library to provide spellchecking and spellcheck-as-you-type functionality. Include English and Vietnamese dictionaries
- Add support for a custom dictionary
- List in correct order files generated from PDF conversion
3 October 2010 - VietOCR v3.0
- Upgrade Tesseract OCR engine to 3.0
- Replace old format (2.0x) language data with new format (3.0) language data
- Change datafile suffix from .inttemp to .traineddata
2 October 2010 - VietOCR v1.9
- Integrate a Java binding for Hunspell library to provide spellchecking and spellcheck-as-you-type functionality. Include English and Vietnamese dictionaries
- Add support for a custom dictionary
- List in correct order files generated from PDF conversion
- Upgrade JACOB library to version 1.15-M3
- Preset Tesseract path on Linux to /usr/bin, the default install location of Tesseract
6 September 2010 - VietOCR v1.8 & VietOCR.NET v1.8
- Display image information
- Add Screenshot Mode, which rescales low-resolution images to 300 DPI to be more suitable for OCR operations
- Read output and error streams to prevent subprocess to block or deadlock due to limited buffer size for standard output streams (Java version)
- Fix a problem in which paste (image) event fires twice (Java version)
- Fix an issue with subimages generated by selection box on Linux (Java version)
14 April 2010 - VietOCR v1.7 & VietOCR.NET v1.7
- Add provision to load UTF-8 text file into textbox
- Add Recent Files submenu
- Add Save button on toolbar
- Fix scale factor, offset issues in image manipulation
- Improve postprocessing for Vietnamese
- Add support for more VNI fonts to Vietnamese language data
30 January 2010 - VietOCR v1.6 & VietOCR.NET v1.6
- Fix an image size issue and associated scale factor when toggling between Fit Image vs. Actual Size after (Java) resizing window or (.NET) scrolling in picturebox
- Add unit test
- Improve post-OCR correction for Vietnamese
- Bundle Vietnamese language data for VNI & TCVN3 (ABC) fonts
21 December 2009 - VietOCR v1.5 & VietOCR.NET v1.5
- Add support for execution from command line
- Add support for paste image from clipboard
- Add support for JPEG2000 and PNM image types (Java version)
24 October 2009 - VietOCR v1.4 & VietOCR.NET v1.4
- Publish OCR interim results to produce more responsive UI performance, improving user experience
- Support for cancellation of running OCR tasks
- Merge PDF functionality
3 October 2009 - VietOCR v1.3 & VietOCR.NET v1.3
- Improve exception handling with appropriate error messages
- Improve handling of PDF documents that has many pages. Putting too many images, as a result of PDF extraction, in a multi-page TIFF eventually will generate out-of-memory exceptions
- Split PDF functionality
10 September 2009 - VietOCR v1.2 & VietOCR.NET v1.2
- Integrate PDF support using GPL Ghostscript
6 September 2009 - VietOCR v1.1 & VietOCR.NET v1.1
- Merge TIFF functionality
1 August 2009 - VietOCR.NET v1.0
- Update to Tessnet2 .NET wrapper for Tesseract 2.04 OCR engine
- Add more language codes to ISO639-3.xml file
- Fix a long-standing issue with accessing original image from a different thread; it would throw an InvalidOperationException with message "Object is currently in use elsewhere"
- Add support for custom text replacement in postprocessing
1 August 2009 - VietOCR v1.0.1
- Refactor for improvements
18 July 2009 - VietOCR v1.0
- Update to Tesseract 2.04 engine (bundled Windows executable)
- Add more language codes to ISO639-3.xml file
- Add a pangram.xml file for displaying appropriate Preview text in the Font Dialog for the OCR language currently selected
- Move various settings to the Options dialog
- Remove the option of Locating Tesseract on Windows. Current Tesseract is the executable bundled inside the program
- Add support for custom text replacement in postprocessing
21 June 2009 - VietOCR v0.9.13 & VietOCR.NET v0.9.4
- Update to Tesseract 2.04RC engine
- Add indeterminate progressbar for (more animated) task status
- Add All Image Files filter
- Remove Vietnamese-glyph font filter to now show all system fonts (Java version only)
- Change FontDialog's default Preview text to the standard English pangram to make it more universal (Java version only)
- Modify SimpleFilter to accept multiple file extensions (Java version only)
25 April 2009 - VietOCR v0.9.12
- Fix the way TESSDATA_PREFIX environment variable handled in Linux
- Clean up temporary files if errors occur during OCR operations
- Fix a regression EOL bug with output files in Windows
- Display appropriate error message during batch process
25 April 2009 - VietOCR.NET v0.9.3
- Refactor
- Display appropriate error message during batch process
15 March 2009 - VietOCR v0.9.11 & VietOCR.NET v0.9.2
- Add text formatting functionality
18 January 2009 - VietOCR v0.9.10 & VietOCR.NET v0.9.1
- Add watch folder functionality for Batch Processing support
10 January 2009 - VietOCR v0.9.9 & VietOCR.NET v0.9
- Revamp localization codes
- Add rudimentary support for English postprocessing
30 November 2008 - VietOCR v0.9.8 & VietOCR.NET v0.8
- Minor fixes and various improvements
10 November 2008 - VietOCR v0.9.7 & VietOCR.NET v0.7
- Implement image rotation functionality
2 November 2008 - VietOCR v0.9.6
- Fix an error with path in Linux
- Additional instruction for configuring Tesseract on Linux
28 October 2008 - VietOCR v0.9.5 & VietOCR.NET v0.6
- Integrate scanning support via WIA Automation Library v2.0
21 October 2008 - VietOCR v0.9.4 & VietOCR.NET v0.5
- Localize user interface
1 August 2008 - VietOCR.NET v0.4.2
- Add file drag-n-drop support
- Add selection rectangle
- Make Picturebox scrollable with mouse wheel
- Replace RTF-based help form with HTML-based
10 June 2008 - VietOCR.NET v0.2
- Integrate Tessnet2 .NET wrapper for Tesseract 2.03 OCR engine
- Use Vietnamese language data for Tesseract 2.03 (data for 2.01 crashes frequently with Tesseract 2.03)
- Add Vietnamese input methods
- Add post-processing for Vietnamese language
26 April 2008 - VietOCR v0.9.3 (aka, jtOCR)
- Proof-of-concept design
- Support TIFF image formats
- Add support for JPEG, GIF, BMP, PNG formats
- Add post-processing for Vietnamese to improve accuracy
- Add Vietnamese input methods
- Add recognition of selected area on image
- Add file drag-drop
- Add a context menu for the textarea
- Add support for selection of Look and Feel
- Display appropriate message when Tesseract engine crashes
- Fix the issue involving filepaths containing spaces
- Bundle JAI Image I/O 1.1 library
- Use Java 6.0
- Use Tesseract 2.03 OCR engine
- Use Vietnamese language data for Tesseract 2.03 (data for 2.01 crashes frequently with Tesseract 2.03)