Self-led training Module 1 - Profiling and Discovery - Introduction
Profiling and discovery with Curiosity's Enterprise Test Data® Platform
Introduction
The Curiosity Platform provides an extensive corporate dictionary allowing users to understand all aspects of their data throughout their data ecosystem. Definitions are used to track and store detailed information about the databases or files we work with and are at the heart of the corporate dictionary Curiosity provides. In addition, the platform has extensive abilities to profile your data and conduct deep scanning of your data to populate the corporate dictionary with a rich set of data.
In this module, we’ll walk through the process of setting up the corporate dictionary and how to set up the scanning to populate it with rich, meaningful data which can then be leveraged in your data activities and test data pipelines.
Training Overview:
This training course will take you through how to set up a connection to your data source and how to scan your data. In addition, it will also take you through how to set a profiling activity and how to leverage our AI technology to take your profiling to the next level.
By the end of this self-led training, you will be able to do the following:
- Create a connection to database server
- Scan a database
- Create a definition from the scan
- Set up a Scan Data Activity
- Use AI to profile and set it up as a regular task
Key capabilities:
- Catalogue: Automatically map, catalogue, and visualise all data assets across your organisation.
- Sensitive data identification: Detect and classify sensitive data such as Personally Identifiable Information and Protected Health Information. Understand where your most critical data is stored and how it’s being used to ensure compliance with regulations.
- Relationship Mapping: Uncover how your data assets are interconnected. Automatically map relationships between tables, fields, and systems, providing a complete picture of how data flows through your organisation.
- Data gap analysis: Identify gaps and missing data elements that could hinder your analytics, testing, or compliance strategies. Our platform provides detailed reports, and synthetic data generation capabilities, to ensure that every scenario is accounted for.
Pre-requisites for profiling and discovery training:
- Access to the Curiosity Platform
- Connectivity between Curiosity and your data source
You can follow this high-level process diagram for setting up a connection and database scan activity:
Before you begin:
- Please download a copy of the Data Profiling and Discovery self-led training guide which will act as your workbook for the training.
- Confirm your progress in the training form and submit at the end of your training to ensure you receive your certificate of completion. The training form is here.
This training module should take approximately 1 day to complete. The training is structured into 7 key sections, and you will find exercises to complete after each section, based on the information you've just read. The training is structured as follows, and should be completed in the following order:
You can find the solutions to each exercise in the 'Exercise Solutions' page of this training course.
Proceed to Section 1 - Set up a server and database connections >