Data Tagging lets you organize your data very quickly in a non-destructive manner.
Add Data Tagging to any node in your Dataflow. Select any node and use the Transform (+) button to open the list of options to select Data Tagging. You could work with the node in the Dataflow tab or use the Properties panel.
{video}
The Data Tagging node is broken up into two sections:
You first need to choose what data type you would like to use as a tag. Click the dropdown under Create Column to select from the following list of data types:
For list and multiselect tags, you will need to type in the values for each option. Each of the values must be separated by a comma. Once your tag values are set up, type in a name for the new column.
Here's a quick tutorial on how to do this:
To make data tagging easy, here are a few simple steps to keep in mind:
Before creating conditions to assign tags, you could enter a default value. If no condition is true, assign the default tag to the remaining records in the new column.
Use conditional statements to assign tags that respect a set of criteria. The best part is your original data remains untouched.
In the tutorial above, the list data type is used to tag records by year and quarter (i.e., 2014-Q1). The video shows the first 2 conditions to tag values in the year 2014 for Q1 and Q2.
In the first condition, we assign the tag 2014-Q1 when records match the following conditions:
Use the +Add Condition to add conditions. Click on the OR button to switch to an AND conditional statement.
Use the+ Add Tag button to create new tags and remove a tag using the "x" button on the right in line with the Tag Value.
Absolutely. Connect one or many Data Tagging nodes - each will create a new column with its set of data tags.