The "Image Description" generative AI application showcases the capability of generating accurate and detailed multilingual textual descriptions for multiple images by leveraging the power of AWS services and the AWS Generative AI Cloud Development Kit (CDK) Constructs. This application harnesses the potential of state-of-the-art generative AI models to provide users with valuable insights and a comprehensive understanding of visual data.
The application features a user-friendly Streamlit interface, enabling users to authenticate securely via Amazon Cognito, a robust and highly scalable identity management service. Upon successful authentication, users can seamlessly upload images, and the application will generate descriptive text for the uploaded visual data. The description can be generated in different languages by selecting the desired language option in the sidebar. Multiple images can be uploaded simultaneously, and the application will generate descriptions for all of them.
Under the hood, the AWS Generative AI CDK Constructs leverage the Anthropic Claude 3 generative AI model, integrated with Amazon Bedrock, a fully managed service for building and deploying machine learning models. This integration enables the application to generate accurate and contextually relevant descriptions for the uploaded images, providing users with a comprehensive understanding of the visual data.
The AWS Generative AI CDK Constructs simplify the deployment and management of this complex architecture, enabling developers to focus on building innovative applications while leveraging the power of AWS services and generative AI models.
This sample application codebase is organized into folders : the backend code lives in bin/image-description.ts
and uses the AWS CDK resources defined in the lib
folder.
The key folders are:
samples/image-description
│
├── bin
│ └── image-description.ts # CDK app
├── lib # CDK Stacks
│ ├── image-description.ts-stack.ts # Stack deploying the S3 bucket, Bedrock Agent, Action Group, and Knowledge Base
├── client_app # Streamlit
│ └── pages
└── image-description.py
└── Home.py # Streamlit landing
To deploy this Image description application, follow these steps to set up the required tools and configure your AWS environment:
-
An AWS account.
-
AWS CLI: configure your credentials
- aws configure --profile [your-profile]
- AWS Access Key ID [None]: xxxxxx
- AWS Secret Access Key [None]:yyyyyyyyyy
- Default region name [None]: us-east-1
- Default output format [None]: json
-
Node.js: v18.12.1
-
AWS CDK: 2.68.0
-
jq: jq-1.6
-
Docker - This construct builds a Lambda function from a Docker image, thus you need Docker desktop running on your machine.
This project is built using the AWS Cloud Development Kit (CDK). See Getting Started With the AWS CDK for additional details and prerequisites.
-
Clone this repository.
git clone <this>
-
Enter the code sample backend directory.
cd samples/image-description
-
Install packages
npm install
-
Boostrap AWS CDK resources on the AWS account.
cdk bootstrap aws://ACCOUNT_ID/REGION
-
Enable Access to Amazon Bedrock Models
You must explicitly enable access to models before they can be used with the Amazon Bedrock service. Please follow these steps in the Amazon Bedrock User Guide to enable access to the models (at minimum,
Anthropic::Claude
):.
-
Deploy the sample in your account.
$ cdk deploy --all
-
Configure client_app
cd client_app python -m venv venv source venv/bin/activate pip install -r requirements.txt
-
Still within the /client_app directory, create an
.env
file with the following content. Replace the property values with the values retrieved from the stack outputs/console.
COGNITO_DOMAIN="<ImageDescStack.CognitoDomain>"
REGION="<ImageDescStack.Region>"
USER_POOL_ID="<ImageDescStack.UserPoolId>"
CLIENT_ID="<ImageDescStack.ClientId>"
CLIENT_SECRET="COGNITO_CLIENT_SECRET"
IDENTITY_POOL_ID="<ImageDescStack.IdentityPoolId>"
APP_URI="http://localhost:8501/"
AUTHENTICATED_ROLE_ARN="<ImageDescStack.AuthenticatedRoleArn>"
GRAPHQL_ENDPOINT = "<ImageDescStack.GraphQLEndpoint>"
S3_INPUT_BUCKET = "<ImageDescStack.InputsAssetsBucket>"
S3_PROCESSED_BUCKET = "<ImageDescStack.processedAssetsBucket>"
Note: The COGNITO_CLIENT_SECRET
is a secret value that can be retrieved from the AWS Console. Go to the Amazon Cognito page in the AWS console, then select the created user pool. Under App integration, select App client settings. Then, select Show Details and copy the value of the App client secret.
- Run client_app
streamlit run Home.py
- Open the browser and go to http://localhost:8501/
- Click login and sign up a user for the first time access.
- Click on Image Description on left panel.
- Upload a file and select preferred configuration on left panel.
- The image along with the generated summary should be displayed on the central panel.
Please note - Ensure your virtual environment is free from SSL certificate issues. If any SSL certificate issues are present, reinstall the CA certificates and OpenSSL package using the following command:
brew reinstall ca-certificates openssl
Do not forget to delete the stack to avoid unexpected charges.
First make sure to remove all data from the Amazon Simple Storage Service (Amazon S3) Buckets. Then:
$ cdk destroy --all
Then in the AWS Console delete the S3 buckets.
The SummarizationAppsyncStepfnProps
interface can be utilized to pass custom properties to the SummarizationAppsyncStepfn
construct. In the event that no optional properties are provided, the construct will create the resources with their default configurations.
// Default summarization construct
//-----------------------------------------------------------------------------
import * as emergingTech from '@cdklabs/generative-ai-cdk-constructs';
const summarization = new emergingTech.SummarizationAppsyncStepfn
(this, 'ImageSummarization', {
cognitoUserPool: this.cognitoPool,
observability: true,
});
The sample code; software libraries; command line tools; proofs of concept; templates; or other related technology (including any of the foregoing that are provided by our personnel) is provided to you as AWS Content under the AWS Customer Agreement, or the relevant written agreement between you and AWS (whichever applies). You should not use this AWS Content in your production accounts, or on production or other critical data. You are responsible for testing, securing, and optimizing the AWS Content, such as sample code, as appropriate for production grade use based on your specific quality control practices and standards. Deploying AWS Content may incur AWS charges for creating or using AWS chargeable resources, such as running Amazon EC2 instances or using Amazon S3 storage.
This solution collects anonymous operational metrics to help AWS improve the quality and features of the solution. Data collection is subject to the AWS Privacy Policy (https://aws.amazon.com/privacy/). To opt out of this feature, simply remove the tag(s) starting with “uksb-” or “SO” from the description(s) in any CloudFormation templates or CDK TemplateOptions.