TipPanel provides powerful moderation tools to help you manage user messages, filter inappropriate content, and maintain a positive environment for your audience.
TipPanel offers three levels of message moderation to help you manage user-generated content:
A dashboard where you can manually review, approve, or reject messages before they appear publicly.
An automated filter that blocks messages containing specific words or phrases you define.
Advanced AI-powered content moderation that can detect inappropriate content across multiple categories.
These moderation tools can be used individually or in combination to create a moderation system that fits your needs. For example, you can:
The moderation queue is a dashboard where you can review, approve, or reject messages before they appear publicly on your widget.
To access the moderation queue:
The moderation queue displays all messages that are pending review, with the most recent messages at the top.
For each message in the queue, you can see:
Information | Description |
---|---|
Message Content | The text of the message |
Sender Information | Name and email (if provided) |
Campaign | The campaign the message was submitted to |
Timestamp | When the message was submitted |
Payment Amount | The amount paid (if applicable) |
Flag Reason | Why the message was flagged (if applicable) |
To moderate a message, you have three options:
Click the Approve button to accept the message. The message will be displayed publicly on your widget.
Click the Reject button to decline the message. The message will be removed from the queue and will not be displayed.
Click the Flag button to mark the message for further review later. The message will remain in the queue.
If you have many messages to moderate, you can use bulk actions to process multiple messages at once:
The banned words filter automatically blocks messages containing specific words or phrases that you define. This is useful for filtering out profanity, spam, or other unwanted content.
To manage your banned words list:
On the Banned Words page, you can:
You can also import or export your banned words list:
You can customize how the banned words filter works:
Setting | Description | Default |
---|---|---|
Filter Action | What happens when a banned word is detected:
|
Block |
Match Type | How banned words are matched:
|
Exact |
Case Sensitivity | Whether matching is case-sensitive:
|
Insensitive |
Use Default List | Whether to use TipPanel's default profanity list | Enabled |
To change these settings:
You can test your banned words filter to see if it works as expected:
The test will show you:
TipPanel offers AI-powered content moderation that can detect inappropriate content across multiple categories. This feature uses OpenAI's moderation API to analyze messages and identify potentially problematic content.
To set up AI moderation:
The AI moderation system can detect content in the following categories:
Category | Description | Default |
---|---|---|
Hate | Content that expresses, incites, or promotes hate based on race, gender, ethnicity, religion, nationality, sexual orientation, disability status, or caste | Enabled |
Harassment | Content that expresses, incites, or promotes harassing language towards any target | Enabled |
Self-Harm | Content that promotes, encourages, or depicts acts of self-harm, such as suicide, cutting, and eating disorders | Enabled |
Sexual | Content meant to arouse sexual excitement, such as the description of sexual activity, or that promotes sexual services (excluding sex education and wellness) | Enabled |
Violence | Content that promotes or glorifies violence or celebrates the suffering or humiliation of others | Enabled |
You can enable or disable each category based on your moderation needs. For example, if you're running a support widget for a mental health website, you might want to disable the Self-Harm category to allow users to discuss these topics.
For each moderation category, you can set the sensitivity level:
Only flags content with a high probability of violating the category. May miss some borderline content but has fewer false positives.
Balanced approach that flags content with a moderate probability of violating the category. Default setting for most categories.
Flags content with even a low probability of violating the category. May have more false positives but catches more borderline content.
You can also set what happens when the AI detects content in a category:
Automatically reject messages that violate the category
Send messages that violate the category to the moderation queue for manual review
You can configure global moderation settings that apply to all messages:
Setting | Description | Default |
---|---|---|
Pre-Moderation | Whether all messages require approval before being displayed:
|
Disabled |
Payment Bypass | Whether messages with payments bypass moderation:
|
Disabled |
Moderation Order | The order in which moderation tools are applied:
|
Banned Words → AI |
To change these settings: