Voiced by Amazon Polly |
Introduction
Amazon Bedrock Data Automation is a powerful new capability designed to simplify and accelerate the extraction of insights from unstructured, multimodal data—such as documents, images, audio, and video. Fully managed and easy to integrate, it enables organizations to build intelligent document processing (IDP), media analysis, and Retrieval-Augmented Generation (RAG) workflows with speed and cost-efficiency. Whether you’re summarizing key video moments, analysing complex documents, or detecting inappropriate content, Bedrock Data Automation delivers customizable outputs tailored to your unique business needs. It can be used independently or as a parser within knowledge bases to enhance RAG workflows.
Explore and Interpret Information in an Interactive Visual Environment
- No upfront cost
- Row level security
- Highly secure data encryption
Amazon Bedrock Data Automation
Amazon Bedrock Data Automation is a new capability within Amazon Bedrock that makes it easier to extract meaningful insights from unstructured, multimodal content like documents, images, videos, and audio. It provides developers with a unified, API-driven experience for processing this content through a single interface—no need to juggle multiple AI services or models.
This feature supports four modalities—documents, images, video, and audio—and uses a consistent asynchronous inference API. All output results are delivered to an Amazon S3 bucket, streamlining integration into your applications. To promote trust and accuracy, it includes built-in safeguards like visual grounding and confidence scores.
For each type of media, you can choose between two output formats:
- Standard output: Provides predefined insights based on the content type, such as semantic document representations, video scene summaries, or audio transcripts. You can easily configure which insights to extract.
- Custom output: Allows for more tailored results using “blueprints” that define exactly what to extract and how to format it. This is useful for aligning insights with specific business requirements or formatting them for downstream systems like databases.
During the preview, standard output is supported across all modalities, while custom output is currently available for documents and images only. Both output types can be configured and saved in a project, allowing you to generate both simultaneously when processing files.
Exploring Amazon Bedrock Data Automation
Let’s explore data automation capability for sample use cases for different modality-document, image, video and audio.
Go to Amazon Bedrock console and select Data Automation section of the navigation pane, where the overview and use cases are discussed. Select See Demo, and under Get Started, select the option for sample demo or upload a file from computer or import form S3 bucket.
Let’s get insight for different modality one by one.
- Get Insights from Document/Image Modality:
Amazon Bedrock Data Automation for document/Image modality supports only PDF, JPEG, PNG, and TIFF file types and generates standard and custom output. Consider I am working with finance application, which needs to process customers bank statement. I uploaded the bank statement and see the standard output.
You can specify the granularity of result generated like page level, element level, text format and output format.
Document modality also generates custom output. Application needs further insights which are not extracted by standard output, you can create a custom blueprint and customize the results generated as per application requirement.
Similarly, you can get insights from image like image summarization, image text recognition, content moderation. Consider an application needs to get insights from travel advertisement, based on which a travel recommendation can be generated.
One can customize the output to get more insights like destination, mood and atmosphere to better generate travel recommendations.
- Get Insights from Video/Audio Modality:
Amazon Bedrock Data Automation for Video/Audio modality supports only MP4, MOV with H.264, H.265, VP8, VP9 video codec, FLAC, M4A, MP3, Ogg and WAV file types and generates only standard output. Consider I am working with an out-fit recommendation application, which needs to process video of recent fashion show.
Similarly, insights from audio file of customer support call can be used by an application for sentiment detection.
Conclusion
Amazon Bedrock Data Automation offers a powerful and streamlined solution for extracting insights from unstructured, multimodal content such as documents, images, videos, and audio. By providing both standard and customizable outputs through a unified API-driven interface, it enables organizations to quickly build intelligent, insight-driven applications across use cases like financial document processing, travel recommendation generation, fashion trend analysis, and customer sentiment detection. With support for flexible output formats, easy integration, and built-in safeguards, Bedrock Data Automation simplifies complex data workflows and empowers businesses to unlock the full value of their content with greater speed, accuracy, and efficiency.
Transform Your Career with AWS Certifications
- Advanced Skills
- AWS Official Curriculum
- 10+ Hand-on Labs
About CloudThat
CloudThat is a leading provider of Cloud Training and Consulting services with a global presence in India, the USA, Asia, Europe, and Africa. Specializing in AWS, Microsoft Azure, GCP, VMware, Databricks, and more, the company serves mid-market and enterprise clients, offering comprehensive expertise in Cloud Migration, Data Platforms, DevOps, IoT, AI/ML, and more.
CloudThat is the first Indian Company to win the prestigious Microsoft Partner 2024 Award and is recognized as a top-tier partner with AWS and Microsoft, including the prestigious ‘Think Big’ partner award from AWS and the Microsoft Superstars FY 2023 award in Asia & India. Having trained 650k+ professionals in 500+ cloud certifications and completed 300+ consulting projects globally, CloudThat is an official AWS Advanced Consulting Partner, Microsoft Gold Partner, AWS Training Partner, AWS Migration Partner, AWS Data and Analytics Partner, AWS DevOps Competency Partner, AWS GenAI Competency Partner, Amazon QuickSight Service Delivery Partner, Amazon EKS Service Delivery Partner, AWS Microsoft Workload Partners, Amazon EC2 Service Delivery Partner, Amazon ECS Service Delivery Partner, AWS Glue Service Delivery Partner, Amazon Redshift Service Delivery Partner, AWS Control Tower Service Delivery Partner, AWS WAF Service Delivery Partner, Amazon CloudFront, Amazon OpenSearch, AWS DMS, AWS Systems Manager, Amazon RDS, and many more.

WRITTEN BY Rashmi D
Rashmi Dhumal is working as a Subject Matter Expert in AWS Team at CloudThat, India. Being a passionate trainer, “technofreak and a quick learner”, is what aptly describes her. She has an immense experience of 20+ years as a technical trainer, an academician, mentor, and active involvement in curriculum development. She trained many professionals and student graduates pan India.
Comments