Amazon Comprehend Trust and Safety

Detect undesired content in text

Why Amazon Comprehend Trust and Safety?

Amazon Comprehend’s Trust and Safety features help organizations moderate text content. Amazon Comprehend toxicity detection API is an ML-powered capability that identifies toxic content by classifying user generated or machine generated text across seven categories including sexual harassment, hate speech, threat, abuse, profanity, insult, and graphic. Amazon Comprehend prompt safety classifier enables moderation of generative AI input prompts to prevent inappropriate use of generative AI applications. Lastly, Comprehend PII detect API can prevent PII data leak by redacting all personal information from generative AI output.

Benefits of Amazon Comprehend Trust and Safety

Quickly and accurately moderate large volume of text and keep your online platforms free from inappropriate content.
Customize the moderation thresholds in API responses to suit your application needs.
Deploy Comprehend APIs through Langchain to moderate input and output of LLMs.

Use cases

Amazon Comprehend toxicity detection classifies text content and provides a confidence score (0 to 1) for the following seven categories: sexual harassment, hate speech, violence/threat, abuse, profanity, insult, and graphic.

Prompt safety classifier provides a confidence score (0 to 1) for the input prompt to be safe or not.

Comprehend PII detect can mask upto 22 universal PII entities like address, age, credit card number etc. and up to 14 country specific entities like US social security number, CA health number, Passport number etc.