Skip to content
English
  • There are no suggestions because the search field is empty.

6. Create scanning rules

See how to create and edit data lists, configure and alter property values and create scanning submit forms to set your scan rules. This section includes Exercise 6.

To build the scanning rules that the profiling will work with: 

  1. Click on ‘Create Category List’

    By default, the Curiosity Platform comes with a set of rules that will be used as a starting point. By  attaching these to the activity they are exposed, allowing customisation if desired.

  2. Click ‘OK’ when you’re ready, and the data lists will be created against the activity. Here’s an example: 

    If you click on one of the lists you can view the categories of data we will scan for. Below is an example of what you’ll see when you click:

    Each list will provide a different type of search type, from RegEx patterns analysis to specific data.

    ‘Data Lists’ are searchable and editable from the left-hand menu. 

  3. Click on one of the lists to view and alter the types of records that are being searched. Below you can see the regular expression being used: 

  4. Now it’s time to customise the property values. To do this, first click on the 'Configuration' tab.

  5. The 'Property' column will show some of the things you can customise. For example: including views to be scanned, counting rows in tables or finding distinct values. The default parameters are optimally configured, but alter them as your requirements need. Click ‘Edit’.

    You can also toggle values on and off. Click ‘OK’ when finished. 

     

  6. From the Data Activity you will now create a Data Scanning Submit Form, which will let you run the job.

    Click on the ‘Data Scanning Submit Form’ action.

  7.  The form requires a Name Group 
  8. The group can be an existing group from the ‘Self-service Data’page or a new group.

    If you are updating an existing process, pick it from the bottom drop down list.

    Click ‘Execute’ when ready.

Exercise 6

  1. Create the starter lists
  2. For the Regex Data list try adding an additional rule
  3. For the Regex Column list try adding an additional rule

Need help or want to check your work? Check the solution video here.

Proceed to Section 7 - Run the data profiling activity >