Custom templates

Custom templates represent the pinnacle of personalization and adaptability in AI. These templates allow you to train the AI to understand the specific structure of your documents with just five examples, and allow for the integration of human input to create highly personalized solutions and extract only the information you want. Ideal for documents with less common structures, custom templates allow you to achieve optimal results in unique situations. They are the perfect choice for those seeking maximum customization and precision in document processing.

When should I use custom templates?

Custom templates should be used when standard templates fail to produce satisfactory results.

Standard templates use predefined labels to extract text from documents, but sometimes they fail to make a correct association between the text and the labels, especially when dealing with documents with complex or unclear structures.

In such situations, it is possible to create a custom model to train artificial intelligence (AI) to read these documents correctly. You can think of the custom model as a container in which you can insert several examples of a document with the same structure but different data. For this document, you can specify where to find the desired information and what name to give to each label. This step is essential to ensure correct reading by your business management system (ERP) or your customer relationship management (CRM) system.

Once the AI has been trained to read documents similar to those loaded into the “container” of the custom model, you need to select the “Start Scan” menu item and then choose “custom templates”. This way, Retica will be able to correctly identify and associate the text of the documents with the labels defined during the training of the custom model.

Thanks to this process, Retica will be able to correctly recognize the texts in the documents you want to process, since it will identify similar and already labeled documents in the container of the selected custom model.

Custom model input requirements

First, ensure that your training dataset meets the input requirements:

For best results, we recommend providing clear images or high-quality analytics for each document.
The following file formats can be processed: PDF, JPEG/JPG, PNG, BMP, TIFF, HEIF.
For PDF and TIFF files, the maximum processing capacity is 2000 pages. Also, ensure that the file size for document analytics does not exceed 500 MB.
The image dimensions must be in the range of 50 x 50 pixels to 10,000 px x 10,000 pixels.
If your PDF files are password protected, it is essential to remove the password before submitting them.
For text extraction, consider that the minimum height should be 12 pixels in a 1024 x 768 pixel image, which is approximately 8 points at 150 dots per inch (DPI).
For training custom models, the maximum page limit for training data is 500.

Optimal Training Data

Training input data is the foundation of any machine learning model. It determines the quality, accuracy, and performance of the model. Therefore, it is critical to create the best possible training input data. Here are some tips for effectively training models:

Use text-based PDFs instead of image-based PDFs whenever possible. One way to identify an image-based PDF is to try to select specific text in the document. If you can only select the entire image of the text, the document is image-based, not text-based.
Use forms with all available fields filled in.
Use forms with different values in each field.
If the images are of low quality, use a larger dataset, that is, more than five training documents.
Determine whether you need to use a single model or multiple models compounded into a single model.
If the form has variations with formats and page breaks, consider segmenting the dataset to train multiple models.
Custom forms are based on a consistent visual model.
Make sure you have a balanced data set considering formats, document types and structure.

Creating a custom template

To create a new custom template, you need to select the “Custom Templates” section from the left menu, and when the interface opens, press the “Create Template” button.