Not logged in - Login


R
e
q
u
e
s
t

a

d
e
m
o
< back

Row Vs. Column Integrity

{TOC} TEXT

ROW VS. COLUMN INTEGRITY

Data exists in multiple databases and potentially multiple columns within those databases. In order to mask data, those columns need to be identified.

THE RELATIONSHIP BETWEEN ROWS AND COLUMNS

The content of columns vary according to the context of the row. For example, the data contained in the Address row will be vastly different to the data contained in the Phone Number row. Similarly, the data in one row may directly relate to the content of another, for example the content the Title column will directly correlate to the data in the Name column.

It is for this reason that no single row can be masked or de-identified without first taking into consideration the content of the other rows. A masking tool will need to examine a table of columns and rows in order to correctly change values in a row to maintain column integrity.

It is important to note that if you alter unique identifiers within a column, there is a risk of affecting the referential integrity of the affected column as well as the database as a whole. A unique identifier is also known as a primary key.

CREATING TEST SETS FROM SCRATCH**

Random components allow you to create test sets when the data in production does not exist. In text-based components such as names, you can simply provide a “dummy” value in the necessary fields, such as “John” or “abc” – the component will create a random set of values.

In this case, use a SQL query to create the templated value. We suggest the following:

● Use any value for input for the strings type components.

● Cast or convert it into one of the character data types with the predefined length, i.e.

varchar(10) or nvarchar(10).

● Create an Alias Name (as shown in our example “AS FirstName”) for the column so that metadata created by the source component will have a name for this column.

Please note that not assigning the type to the column in SQL might create issues when converting to the expected data type.

Download a Trial