For many years, image processing and address recognition technologies have been used to speed up mail sorting and reduce manual data entry costs. With todays increased competition in the postal business, changing customer behaviors and uncertain mail volumes, it becomes even more critical to quickly respond to market change. The recent growth in e-commerce has also caused a rise in international and domestic postal parcel traffic and raised interest in automatic parcel sorting. Since parcel processing is labor intensive, reducing costs associated with parcel sorting and expedient delivery has also received increased urgency.
Postal and courier parcel sorting centers currently introducing or upgrading parcel sorting systems are at an advantage over mail centers that were implementing automation of letters and flats in past years. Today, more mature technology and business models provide higher savings to make solutions viable. Furthermore, accumulated experience can help in arriving at the most efficient and cost-effective solutions for automated parcel sorting.
In particular, innovative universal optical character recognition (OCR) solutions, already implemented in letters and flats sorting automation, may be adapted and successfully applied to the parcel industry. These modularized systems are not tied to specific hardware platforms and can be easily implemented in order to provide an accurate, fast, flexible and cost-effective solution in an existing environment. Until recently, suppliers were offered only integrated solutions that included both OCR and transport components. This tight integration of sorting equipment and OCR modules hampered the periodic replacement or update of just one of the components. An OCR/Video Coding Systems (OCR/VCS) open interface standard was created over the past two years to enable postal operators to work with different suppliers for needed replacements or expansions to sub-systems without incurring significant engineering costs. Universal OCR systems that may be considered separately from the upgrade of sorting equipment make it possible to unify automation efforts and provide address recognition improvements for several mail stream types simultaneously.
There are other advantages of universal technology that are worth considering. First, the capability of an OCR engine to provide high performance levels on both machine-printed and handwritten addresses makes automation investments more viable despite the ratio of handwritten and machine-print addresses in the full mail stream. Second, universal technology automatically identifies and locates address blocks and labels on any parcel, despite its format, and requires comparatively lower expenses to adapt and customize recognition components for specific countries (accounting for address formats and structure, coding rules, unique handwriting styles, etc.) This approach requires lower development expenses and makes even small business opportunities feasible and attractive.
Universal OCR solutions currently available on the market were designed to answer the entire range of market needs and proved their efficiency in a number of postal and parcel automation projects worldwide. The software incorporates the latest achievements in artificial intelligence in areas such as neural network technology, fuzzy logic, hidden Markov models, multiple independent engines and contextual information. Using these innovative methods, universal OCR successfully resolves address location and recognition challenges that parcel images pose. In particular, they allow the software to successfully solve the task of Region of Interest (ROI) location. Though often underestimated or taken for granted, ROI location is a necessary step in applications for postal and courier parcel sorting and is a key function to the success of the solution overall. In cases where parcel images contain several blocks of information, like senders addresses, company logos and barcodes, as well as the destination address, ROI location is one of the main challenges.
Finding the small address block amidst different pieces of data should be approached in a very precise way. Yet the process must have flexibility to accommodate different applications. First, it is necessary to separate the image of a mailpiece from the background. Here, an ROI is the image of the mailpiece, which is found within a larger image, which also captures a piece of a conveyer belt used to transport the mailpiece. The software separates the ROI, cleaning up the image for processing. Then, the second-level ROI represents the required information found in either a simple address block or a label containing an address block. Finally, for labels, which contain additional information, such as the return address, phone number, account number or barcode, the destination address block is located. While
processing labels, a new ROI location task is assigned to extract the destination address data only. Sophisticated ROI algorithms filter out irrelevant objects at this stage.
Universal OCR removes background noise from the parcel images and identifies address blocks and labels, as well as detects images missing a destination address. Unique label-type independent techniques can be used to locate address blocks on any parcel and provide a universal solution for a mixed stream of packages.
When the address is located, the software reads machine-printed and handwritten addresses on parcel images that are often of poor quality, rotated at random angles, skewed and often include multiple types of noise.
Universal address recognition solutions are now available to parcel processors to locate and recognize country-specific addresses on any parcel. These solutions enable global automation and raise quality and data accuracy standards regardless of geographic location.
Dr. Tatiana Vazioulina has worked in product management and marketing at Parascript, LLC., a leader in address recognition and interpretation, for a decade. She can be reached at marketing@parascript.com or 303-381-3100.