Auto-Tagging Assets
You can manually tag most assets, such as Message Boards, Blogs, Documents, Images, and Web Content Articles. You can tag certain assets, however, automatically when they are created. Tags applied automatically are in only English.
Auto-tagging is available for text-based documents, text-based web content, and blog entries, as well as images.
Text auto-tagging is disabled by default. To use it, enable auto-tagging in DXP, enable a text auto-tagging provider, and choose the assets to auto-tag. DXP offers two text auto-tagging providers:
-
Google Cloud Natural Language Text Auto-Tagging: Uses the Google Cloud Natural Language API to analyze and tag content automatically.
-
OpenNLP Text Auto-Tagging: Uses the open source Apache OpenNLP. It applies tags based on models for location names, organizations, and person names.
Use OpenNLP with caution due to potential accuracy limitations.
Configuring Google Cloud Natural Language Text Auto-Tagging
-
Click the Global Menu → Control Panel.
-
Click System Settings.
-
Click Assets under Content and Data.
-
Under VIRTUAL INSTANCE SCOPE, click Google Cloud Natural Language Text Auto-Tagging.
-
Fill out the form and click Save when finished.
The form asks for this information:
API Key: The API key for the Google Cloud Natural Language API. For more information, see Google’s documentation on API keys.
Classification Endpoint Enabled: Enables auto-tagging using the classification endpoint.
Confidence: Sets the classifier’s confidence level for assigning categories.
Entity Endpoint Enabled: Enables auto-tagging using the entity endpoint.
Salience: Defines the importance of an entity in the text.
Enable Google Cloud Natural Language Text Auto-Tagging For: Choose asset types such as Document, Blogs Entry, or Web Content Article. Use the Add icon () to include multiple asset types or the Trash icon (
) to remove them.
For multiple virtual instances, override these settings under Control Panel → Instance Settings → Assets.
Configuring OpenNLP Text Auto-Tagging
-
Click the Global Menu → Control Panel
-
Click System Settings.
-
Click Assets under Content and Data.
-
Under VIRTUAL INSTANCE SCOPE, click OpenNLP Text Auto-Tagging.
-
Fill out the form and click Save when finished.
The form asks for this information:
Confidence Threshold: Set the minimum confidence level (0 to 1, where 1 is the highest). Higher values result in fewer tags.
Enable OpenNLP Text Auto-Tagging For: Choose asset types such as Document, Blogs Entry, or Web Content Article. Use the Add icon () to include multiple asset types or the Trash icon (
) to remove them.
For multiple virtual instances, override these settings under Control Panel → Instance Settings → Assets.
Example: Blogs Auto-Tagging
Follow these steps to enable auto-tagging for blog entries using OpenNLP:
-
Follow the steps in Configuring OpenNLP Text Auto Tagging.
-
In the OpenNLP Text Auto-Tagging page under VIRTUAL INSTANCE SCOPE, set your Confidence Threshold value to
0.01
. -
In Enable OpenNLP Text Auto-Tagging For, select Blogs Entry.
-
Click Update
-
Go to your Home Page and open the Product Menu → Content and Data → Blogs.
-
Click New and create a blog post. The more text you include in the content, the more tags may be generated.
-
Click Publish when finished.
-
Click Actions (
) and click Edit.
-
Select the Categorization field. Tags appear under Other Metadata.