Trim/Normalize Node is built to help you quickly clean your data pipelines and like the Column Renamer, built to make data pipeline maintaining simple, not complicated, and more than anything, easy to repeat.

AT TIMES WE NEED CAPITAL LETTERS! Perhaps you-have-a-lot-of-this-happening (special characters you don’t ne3ed).

then there are times we aren’t trying to scream, and perhaps lowercase is a requirement for user names or emails. okay, you’re in a good place. case sensitivity is here too. AlongWithTrimmingWhiteSpace.

ET1’s Trim/Normalize Node helps people quickly clean their data.

You can select more than one column to clean, or just choose 1 column to normalize.

The Trim/Normalize Node was created to help you people quickly clean data pipelines and improve data quality across your data environment (a data environment might be a grouping of individual solutions that look and feel similar).

Cleaning dirty unstructured text for sentiment analysis, parsing HTML, or optimizing pipelines for data visualization – this node helps transition your pipelines into what some consider a piece of their overarching data governance.

Using the Trim/Normalize Node in ET1

Using this node is easy and intuitive. Checkboxes, drop downs, and nothing crazy.

Connect data downstream to your node, adjust the settings, and keep solving.

  1. Connect data
  2. Choose column(s)
  3. Decide to trim ends – space(s) on the left and right only
  4. Decide to remove whitespace – any and all space(s)
  5. Remove special characters, any characters, includes spaces
  6. Choose the case sensitivity

Real-world use case Trim/Normalize Node

In this example we are gaining a file from an end user who needs help with capitalizing all of the Address.

Someone sends us this csv. We open it with the CSV Input Node in ET1 then we want to trim/normalize.

Supplier_ID,Supplier_Name,Address,Email
SUP001,Supplier X,123 Main Street|Suite 100|Anytown|CA 90210,supplierx@example.com
SUP002,Supplier Y,456 Oak Avenue|Building B|Sometown|NY 10001,suppliery@example.com
SUP003,Supplier Z,789 Pine Road|Floor 3|Othercity|TX 75001,supplierz@example.com

We are going to add trim ends, incase future data has padded spaces (thinking ahead), and swapping case to upper to follow internal best practices.

Upper case for Address passes this users current data strategy, their reasoning; some data inputs do not automatically swap to uppercase during the software writing to the database, and the software engineers don’t have time to optimize this part of the software.

Thanks for learning more about ET1 and know if you have any questions… Please contact us.

We would love to hear about your use cases.

Return to ET1 Overview to learn more.