Data Masking Components: City Names
The purpose of the component is to mask values of city names with other realistic substituted values of the city names. The component contains about 700 possible substitution values. City names are the part of the PHI data requirements for all geographic subdivisions smaller than a state of 18 data elements per HIPAA'a Safe Harbor.
Variation of Random Substitution data masking algorithm.
|City Name data masking component takes the field value of city name in one of the character data types, both ASCII and unicode (char, nchar, varchar, nvarchar, text, ntext) as an input and produces a city name in the same format as an output. The output value of the City Name data masking component is random. It does not guarantee uniqueness of the names that are produced as the result of the application of the algorithm. City Name can mask at the same time more than one column.|
|1. Configure a source that contains the column with the element of city name. A source could be connected to any database, including but not limited to SQL Server , Oracle, Sybase, mySQL, as well as a file.|
|2. Drag and Drop "Masking City Name" data masking component, then connect the source and the "Masking City Name" data masking component with the data flow path:|
|3. Now, the data flow path (the blue arrow) passes proper meta-data to the "Masking City Name" data masking component. If you click on the data flow path, you will see:|
|4. Now that the metadata for the "Masking City Name" component exists, and values are passed into the data masking component, please open the component editor:|
|5. In the second tab, there are input columns. Please check-mark only the column(s) that you will be masking with the City Name algorithm:|
|6. This will create an extra column(s) with the prefix “Masked_”.|
|7. Create a connection manager for the destination and configure source component for the destination. In the connection manager, in the tab “Mappings”, specify that you want newly created Field(s)_Masked to be a field(s) replacing the original value. For that, just click on the available input columns, choose the masked value, and map to the “Available Destination Columns”|
|8. Now, all the configurations are complete for the valid values. You can run the package with the City Name Data Masking component, and see the results of data masking:|
|9. If, however, there are invalid values in the package's source, one would need to configure error handling. Invalid values are those that are not conforming to the rules of the entity. To handle invalid values, each data masking component has error handling precedence constraint. One needs to create error destination connection and connect red arrow (error handling constraint) with this destination. As the connection is made, one needs to configure the state of failure: “Fail”,”Ignore” or “Redirect”.|
|10. It is recommended that one re-directs the output into the error destination, so that later one be able to analyze and process data for quality purposes.||>|