Voiced by Amazon Polly |
1. Introduction to Azure Form Recognizer
Azure Form Recognizer is a cloud-based Azure Applied AI Service that extracts key-value pairs, text, and tables from your documents using AI-powered data solutions. Form Recognizer examines your documents and forms, extracting text and data, mapping field relationships as key-value pairs, and returning a structured JSON output.
Here is a blog on Automate Report Extraction with Microsoft Azure Cognitive Service: Form Recognizer – Part 1 that can help you refresh your understanding.
Let us get started with the Form Recognizer Sample Labeling tool:
The Form Recognizer Sample Labeling tool is an open-source tool that enables you to test the latest features of Azure Form Recognizer and Optical Character Recognition (OCR) services:
- Analyze documents with the Layout API: Extract text, tables, selection marks, and structure from documents.
- Analyze documents using a prebuilt model: Start with a prebuilt model to extract data from invoices, receipts, identity documents, or business cards.
- Train and analyze a custom Form: Use a custom model to extract data from documents specific to distinct business data and use cases.
Customized Cloud Solutions to Drive your Business Success
- Cloud Migration
- Devops
- AIML & IoT
2. Prerequisites
You will need the following to get started:
- An Azure subscription
- A Cognitive Services or Form Recognizer resource: Once you have your Azure subscription, create a single-service or multi-service Form Recognizer resource in the Azure portal to get your Key and endpoint.
3. Step by Step Guide
Step 1: Sign in to the Azure Portal, search for form recognizer in the search bar and select the service
Step 2: Click on Create form recognizer.
Step 3: Provide the appropriate details as below :
- Select the appropriate subscription
- Create a new Resource group else select the required Resource group if you have already created it
- Select the region where you want to deploy the service
- Provide the appropriate name for the service
Select the pricing tier F0 which is the free tier and click on Review + create
Step 4: Once the service is deployed keep a note of Keys and Endpoint which will be required in the next steps
Step 5: Now, create a storage account with the appropriate name, region, and keep other details as default
Step 6: Select the containers from the left panel and create a new container to store the images where the public access level should be Blob
Step 7: Click on Upload. Upload the images on which you want to build the custom model of the form recognizer
Step 8: Select the storage account created in Step 5, select the CORS from the left panel and configure the settings as shown below and click on Save
Step 9: Create the SAS URL as shown below
Step 10: Copy the Blob SAS URL add the container name in the URL as shown below:
Step 11: Go to https://fott.azurewebsites.net/ and click on the Connection symbol in the left panel
- Click on the “+” at the top to add a new connection
- Provide the name for the connection and SAS URL generated in Step 9
- Click on Save Connection
Step 12: Once the connection is created go to Home, select the custom model and click on New Project
Step 13: Provide the appropriate details as given below :
- Provide the name for the project
- Select the connection created in Step 11
- Provide the endpoint of the form recognizer noted in Step 4
- Provide the Key of the form recognizer noted in Step 4 and click on Save Project
Step 14: Once the project is created the images stored in the container will be loaded on the left panel
- Add the tags as shown below by clicking on “+”. Let’s assume you have to extract the Name of the person, No. of days of leaves, and the status of the leave applied from the custom images provided
Step 15: Select the appropriate data from the image and select the relevant tags for the same. Tag all the images as shown below
Step 16: Once the tagging of images is done select the “Train” model from the left panel. Provide the name for the model and click on Train
Step 17: It will take some time to train the model. Once the model is trained it will provide the results as shown below. You can view the accuracy of the trained model
Step 18: Let’s analyze the model by providing one of the images and click on Run analysis
Step 19: It will analyze the given image and provide the desired output as shown below
4. Conclusion:
Azure Form Recognizer’s deep-learning-based universal models support many languages that can extract multi-lingual text from images and documents, including text lines with mixed languages. It uses Natural Language Processing (NLP) to detect and extract information from forms and documents supported by AI to provide more structure and information to the text extraction. Hence, it helps us to quickly get accurate results that are tailored to the specific content without excessive manual intervention or extensive data science expertise.
Get your new hires billable within 1-60 days. Experience our Capability Development Framework today.
- Cloud Training
- Customized Training
- Experiential Learning
About CloudThat
CloudThat is a leading provider of Cloud Training and Consulting services with a global presence in India, the USA, Asia, Europe, and Africa. Specializing in AWS, Microsoft Azure, GCP, VMware, Databricks, and more, the company serves mid-market and enterprise clients, offering comprehensive expertise in Cloud Migration, Data Platforms, DevOps, IoT, AI/ML, and more.
CloudThat is the first Indian Company to win the prestigious Microsoft Partner 2024 Award and is recognized as a top-tier partner with AWS and Microsoft, including the prestigious ‘Think Big’ partner award from AWS and the Microsoft Superstars FY 2023 award in Asia & India. Having trained 850k+ professionals in 600+ cloud certifications and completed 500+ consulting projects globally, CloudThat is an official AWS Advanced Consulting Partner, Microsoft Gold Partner, AWS Training Partner, AWS Migration Partner, AWS Data and Analytics Partner, AWS DevOps Competency Partner, AWS GenAI Competency Partner, Amazon QuickSight Service Delivery Partner, Amazon EKS Service Delivery Partner, AWS Microsoft Workload Partners, Amazon EC2 Service Delivery Partner, Amazon ECS Service Delivery Partner, AWS Glue Service Delivery Partner, Amazon Redshift Service Delivery Partner, AWS Control Tower Service Delivery Partner, AWS WAF Service Delivery Partner, Amazon CloudFront Service Delivery Partner, Amazon OpenSearch Service Delivery Partner, AWS DMS Service Delivery Partner, AWS Systems Manager Service Delivery Partner, Amazon RDS Service Delivery Partner, AWS CloudFormation Service Delivery Partner, AWS Config, Amazon EMR and many more.
FAQs
1. Does Form Recognizer integrate with other Microsoft services?
ANS: – Yes, Form Recognizer integrates with the following services:
- AI Builder workflows
- Azure Cognitive Search
- Azure Functions
- Azure Logic Apps
2. How is Form Recognizer related to OCR?
ANS: – Azure Form Recognizer is a cloud-based Azure Applied AI Service that is built using optical character recognition (OCR), Text Analytics, and Custom Text from Azure Cognitive Services.
3. Can I retrain a custom model?
ANS: – There is no explicit retrain operation in Form Recognizer. Each train action results in the creation of a new model
WRITTEN BY Daneshwari Mathapati
Daneshwari M is an Associate Architect at CloudThat, specializing in AWS, Python, SQL, and data analytics. She has expertise in building data pipelines, creating interactive dashboards, and optimizing cloud-based analytics solutions. Passionate about data-driven decision-making, she helps businesses turn complex data into actionable insights.
Rai Singh Rathor
Dec 20, 2022
How can I train a custom model in Azure form recognizer FOTT tool which is installed as desktop application.
Can you guide
Elton
May 17, 2022
Thanks for your article it helped me a great deal with the initial configuration of a custom model.
I did have to use the updated Studio found here: https://formrecognizer.appliedai.azure.com/