© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Build responsible AI applications
with Guardrails for Amazon Bedrock
Anubha...
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Build responsible AI applications
with Guardrails for Amazon Bedrock
Anubhav Mishra
GRC325
(he/him)
Principal Product Manager
AWS
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Building generative apps brings new challenges
Privacy protection
Protect user
information or
sensitive data
Bias/stereotype
propagation
Biased results or
unfair user
outcomes
Undesirable and
irrelevant topics
Controversial
queries and
responses
Toxicity and safety
(including
brand risk)
Harmful or
offensive responses
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Titan Claude Llama
Command + Embed Stable Diffusion
Jurassic-2 Mistral + Mixtral
Many foundation models have built-in protections
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Building generative AI apps requires additional controls
Customizations based on use cases
and organizational policy
Consistent safeguards across FMs
and applications
Safety and privacy controls for
responsible AI
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Guardrails for
Amazon Bedrock
Filter harmful content and safeguard
against prompt injection and jailbreaks
Define and disallow denied topics with
short natural language descriptions
Redact or block sensitive information,
such as PIIs, and custom regex
(regular expressions)
Implement safeguards customized to
your application requirements and
responsible AI policies
Apply guardrails to multiple foundation
models, knowledge bases, and agents for
Amazon Bedrock
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How it works: Guardrails for Amazon Bedrock
User input FM output
Guardrail
Final response
FM inference
Responsible AI policies
Denied topics Content filters Word filter
PII redaction
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Denied topics
AVOID UNDESIRABLE TOPICS IN YOUR APPLICATIONS
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Content filters
CONFIGURE THRESHOLDS TO FILTER CONTENT
TO VARYING DEGREES
Filter harmful content across categories:
➢ Hate
➢ Insults
➢ Sexual
➢ Violence
➢ Misconduct (criminal activity)
➢ Prompt Attack (jailbreak and prompt injection)
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
➢ Redact personally identifiable
information (PII) in FM responses to
protect user privacy
➢ Detect and filter PIIs in user inputs
➢ Select from a variety of PIIs based on
application requirements
➢ Define your own sensitive information
using regular expressions (regex)