Parser Transformation
Parser Transformation
Parser Transformation
Parsing ideally refers as an activity where a system can analyze and breakdown the information into
meaningful chunk. We are exactly going to achieve the same using this Data Quality transformation.
IDQ parser is meant to identify one or more data elements in an input field and to write each element to a
different output field.
Parsing allows you to have greater control over the information in each column. For example
Your source file name includes generic file name, date, country, option type (say
coseorder_xxxxsecurity.20121016113056.EU.CNC.dat); you may have to preserve them as
independent columns in audit.
Data field that contains a persons full name, William Shakespeare You can use the Parser
transformation to split the full name into separate data columns for the first name and last name.
Any data field that has a structure that can be described in a regular expression. VAT numbers,
SSN, PAN, credit card numbers etc are all the valid candidates
After the data is parsed into new columns, one can create custom data quality operations for each column.
Primarily two types of parser transformations are there
Custom Token sets or Expressions. Example I from above fall in this category. Parse data that matches
reference table entries or custom regular expressions that you enter.
R1. Full name to be parsed to First and Last names, Dont process the record if the full name format is other
than First_Name Last_Name, this need to written into the reject file;
R2. File name to be parsed and the data should be preserved in audit file
Select Regular Expression, click choose (It will open Regex editor window)
Number of outputs 5
Click Finish
Done