Skip to the main content.

Curiosity Modeller

Design Complex Systems, Create Visual Models, Collaborate on Requirements, Eradicate Bugs and Deliver Quality! 

Product Overview Solutions
Success Stories Integrations
Book a Demo Release Notes
Free Trial Brochure
Pricing  

Enterprise Test Data

Stream Complete and Compliant Test Data On-Demand, Removing Bottlenecks and Boosting Coverage!

Explore Curiosity's Solutions

Our innovative solutions help you deliver quality software earlier, and at less cost!

robot-excited copy-1              AI Accelerated Quality              Scalable AI accelerated test creation for improved quality and faster software delivery.

palette copy-1                      Test Case Design                Generate the smallest set of test cases needed to test complex systems.

database-arrow-right copy-3          Data Subsetting & Cloning      Extract the smallest data sets needed for referential integrity and coverage.

cloud-cog copy                  API Test Automation              Make complex API testing simple, using a visual approach to generate rigorous API tests.

plus-box-multiple copy-1         Synthetic Data Generation             Generate complete and compliant synthetic data on-demand for every scenario.

file-find copy-1                                     Data Allocation                  Automatically find and make data for every possible test, testing continuously and in parallel.

sitemap copy-1                Requirements Modelling          Model complex systems and requirements as complete flowcharts in-sprint.

lock copy-1                                 Data Masking                            Identify and mask sensitive information across databases and files.

database-sync copy-2                   Legacy TDM Replacement        Move to a modern test data solution with cutting-edge capabilities.

Explore Curiosity's Resources

See how we empower customer success, watch our latest webinars, read our newest eBooks and more.

video-vintage copy                                      Webinars                                Register for upcoming events, and watch our latest on-demand webinars.

radio copy                                   Podcasts                                  Listen to the latest episode of the Why Didn't You Test That? Podcast and more.

notebook copy                                           eBooks                                Download our latest research papers and solutions briefs.

calendar copy                                       Events                                          Join the Curiosity team in person or virtually at our upcoming events and conferences.

book-open-page-variant copy                                          Blog                                        Discover software quality trends and thought leadership brought to you by the Curiosity team.

face-agent copy                               Help & Support                            Find a solution, request expert support and contact Curiosity. 

bookmark-check copy                            Success Stories                            Learn how our customers found success with Curiosity's Modeller and Enterprise Test Data.

file-document-multiple (1) copy                                 Documentation                            Get started with the Curiosity Platform, discover our learning portal and find solutions. 

connection copy                                  Integrations                              Explore Modeller's wide range of connections and integrations.

Better Software, Faster Delivery!

Curiosity are your partners for designing and building complex systems in short sprints!

account-supervisor copy                            Meet Our Team                          Meet our team of world leading experts in software quality and test data.

calendar-month copy                                         Our History                                Explore Curiosity's long history of creating market-defining solutions and success.

check-decagram copy                                       Our Mission                                Discover how we aim to revolutionize the quality and speed of software delivery.

handshake copy                            Our Partners                            Learn about our partners and how we can help you solve your software delivery challenges.

account-tie-woman copy                                        Careers                                    Join our growing team of industry veterans, experts, innovators and specialists. 

typewriter copy                             Press Releases                          Read the latest Curiosity news and company updates.

bookmark-check copy                            Success Stories                          Learn how our customers found success with Curiosity's Modeller and Enterprise Test Data.

book-open-page-variant copy                                                  Blog                                                Discover software quality trends and thought leadership brought to you by the Curiosity team.

phone-classic copy                                      Contact Us                                           Get in touch with a Curiosity expert or leave us a message.

7 min read

8 Criteria for a Modern Test Data Solution

8 Criteria for a Modern Test Data Solution

In 2023, (test) data availability, quality, and compliance risks remain a major headache for software development.

Parallel teams, pipelines, and frameworks remain dependent on outdated Test Data Management (TDM) practices, particularly those focused on copying unwieldy data sets periodically to non-production. These obsolete TDM practices are incapable of supplying compliant data of sufficient variety, at the speeds demanded by “agile”, DevOps and CI/CD.

These perennial TDM challenges are not going to go away in a hurry; in fact, several trends are set to add demand for test data of growing complexity, volume, and variety. This article explores several of these trends, deriving 8 criteria for designing an effective, modern test data solution.

Overall, these criteria point to one broad trend: The need to shift focus from Test Data “Management” and copying data, to Test Data Automation that streams data on-the-fly. In particular, this article calls for a move to Test Data Automation that is real-time and event-based.

Modernising test data in this way shifts the paradigm from slow and manual “provisioning”, to parallel teams and frameworks streaming rich data on-the-fly. To learn more about this bottleneck-free approach to test data, check our Curiosity’s Test Data-as-a-Service Solution Brief.

Download the Solution Brief

1.   Enable “Agile”, DevOps and CI/CD

In an era of “agile”, DevOps, and automated pipelines, any manual intervention to make or supply test data is simply too slow. It cannot scale to meet the sheer volume and variety of data needed, as ever-changing tests are executed at speed by automated CI/CD pipelines.

Copying data periodically to non-production presents a particularly clear mismatch with the speed of system evolution promised by greater agility, DevOps and CI/CD:

Test Data Bottlenecks in DevOps Pipelines

Even today, it can take organisations weeks or months to refresh data in a non-production environment. This is often performed using a combination of manual processes, scripts and outdated tooling. With such wait times, how can test data provisioning provide up-to-date test data for a system that is updated in days, hours, and minutes?

Once data is available in non-production environments, too much time is then spent finding, making, and hacking data for diverse and fast-evolving tests. Even supposedly “on demand” approaches to test data provisioning tend to be brittle as tests and environments change, as they require re-configuration for different scenarios and environments.

Criteria #1 of a modern test Data solution:

To achieve rigorous testing at speed, a test data solution must make accurately matched data available “just in time” as different tests are executed within CI/CD pipelines.

2.   Support for Complex, Hybrid Architectures

Any modern test data solution must provide integrated data that’s more complex than ever, in parallel and at speed.

Today, test data must link seamlessly across diverse technologies in evolving hybrid architectures. While 70% of organisations have at least 2 database systems [1], test data must further link across messages, APIs, cloud-based components, mainframe systems, applications, and more.

Legacy systems do not go away as new technologies are adopted, and test data must reflect the co-existence of the old and new technologies. This must include the technologies used today in development, including containerisation and microservices architecture:

Interrelated test data for hybrid architectures

At its core, end-to-end testing today involves firing data into a system, and measuring its impact as it flows through integrated technologies. Test cases have converged with test data. If test data does not link consistently across integrated technologies, tests will fail even if there is no genuine defect. But then, what’s the value of running the tests?

Criteria #2 of a modern test data solution:

A robust test data solution today must be capable of connecting into numerous source technologies, and pushing data to a range of different technologies. It must also retain referential integrity as it manipulates or creates cross-system data, including during data masking, generation, cloning, and beyond.

3.   Acceleration of High-Investment Transformation Projects

Enterprises today are engaged in a range of high-investment, long-term transformation projects. This often includes (cloud) migrations and DevOps modernisation, along with renewed attempts to boost agility in software development.

These projects are not only high-value and high-cost; they also add to the demand for rich test data.

Any test data solution must support and accelerate these transformation projects, and not drag behind and block it. Let’s consider two examples:

  1. If parallelised development teams have adopted Kubernetes and containers as part of a DevOps modernisation, then any test data solution must push the right data to containerized databases and clusters on demand.
  2. If an organisation is migrating a Mainframe System to cloud-based architecture, then a test data solution must be capable to producing rich and compliant data in parallel for both the legacy and migrated systems.

Test data can introduce negative risk to these costly and lengthy projects, which already have an uncomfortably high failure rate. These risks include gaps in test data coverage and a lack of support for new tools and techniques, as well as compliance risks and test data bottlenecks.

Criteria #3 of a modern test data solution:

A test data solution must provide rich and compliant data at speed for modern transformation projects, even as these projects at to the demand and complexity of test data.

4.   Affordability in the Face of Massive Data Growth

The sheer volumes of data produced by organisations continue to grow at astonishing rates: 1/10 of data science and engineering professionals in North America reported in 2021 that data volumes at their organisation grow by over 100% per month. The average monthly data growth is a whopping 63% [2].

This growth reflects trends like cloud migrations, as well as the adoption of new tools and technologies. Add in the demand for parallelised non-production data, and the scope for runaway test data infrastructure costs becomes clear.

Making large, physical copies of complex production data for parallel teams, frameworks and environments is simply not viable today. It can also present a compliance risk, given rules around “data minimisation” and purpose limitation.

Criteria #4 of a modern test data solution:

A test data solution must continuously manage growing data volumes, providing optimised and affordable data for parallelised testing and development. Techniques like subsetting, cloning, and “covered” test data generation help by only creating as much data as needed in testing. Data Virtualisation furthermore copies data in parallel, at a fraction of the time and cost of making physical copies.

5.   Evolving Privacy Legislation

Data Privacy legislation continues to grow more stringent and complex globally. This can add risk to outdated TDM practices, particularly those that copy (raw) production data to less-secure environments.

A test data solution must be compliant with evolving global privacy legislation. This might include requirements like demonstrating legitimate grounds for data processing in non-production, while showing that only as much data as needed is being used to fulfil those grounds.

Organisations might further need to demonstrate that only as many people as necessary have processed data, and might furthermore need to delete or copy every instance of a person’s data “without delay”.

Criteria #5 of a modern test data solution:

A test data solution must be compliant with evolving global privacy legislation.

Minimising the use of sensitive data should be a priority, given the risk and complexity of complying with rules for using production data in non-production. A combination of masking and synthetic data generation enables a shift away from production data, in time creating simulated versions of production for testing and development.

6.   Centralised-but-Democratised Test Data

Organisations today require greater visibility and control over their data and infrastructure, to maximise efficiency and ensure compliance. However, this centralisation cannot come at the cost of introducing blockers to teams who require continuous data access.

A test data solution should emphasise reusability, centralising and distributing core competencies. Processes set up by a small team of skilled test data engineers should be made reusable by parallel teams and frameworks. This provides sufficient centralisation, while enabling teams and frameworks to parametrise and trigger processes on-the-fly:

Test Data-as-a-Service

Criteria #6 of a modern test data solution:

A test data solution should centralise skills and processes, while making configurable processes reusable on demand by parallel teams and frameworks. Test data jobs should be automated and parameterizable. They should be exposed to human and automated requesters on demand, for instance via self-service forms, API calls, and functions embedded in test automation frameworks:

Automated Test Data Find and Makes

7.   Parallelisation and Shorter Release Cycles

As organisations seek to shorten release cycles, demand has increased for rapidly changing data. A test data solution must make up-to-date data available to a growing number of parallel teams and frameworks, during sprints that are becoming increasingly short. Test data must furthermore be available for several different system versions at the same time, reflecting the parallelisation of modern development practices:

Test Data Automation for Parallel Teams and Frameworks

Criteria #7 of a modern test data solution:

A test data solution must make versioned data available in parallel, at ever-faster speeds, and to a growing number of parallel data requesters.

8.   Automated Data Requesters

Test Data must furthermore be made available to automated data requesters. This includes test automation frameworks and CI/CD pipelines, which are less forgiving than humans.

If data’s incomplete or inaccurate, a human tester might process and adjust the data manually. By contrast, an automated test is likely to fail or deliver a false positive if data is out-of-date, misaligned, or mismatched.

Automated tests furthermore increase demand for parallelisation. Test automation frameworks might run high volumes of parallelised tests. If two tests require the same data, they require this data combination in parallel. One test can furthermore not consume or edit data that will lead other tests to fail.

The pace of testing enabled by automation and CI/CD further increases the demand for data, as different tests are executed at speeds unimaginable with manual testing.

Criteria #8 of a modern test data solution:

A test data solution must make data available on demand to parallelised automated tests, even as evolving scenarios are executed at speed.

Automating Test Data

This article has now identified 8 criteria for a modern test data solution, drawing on observations regarding the nature of software delivery today. To summarise, a modern test data solution must:

  1. Enable “Agile”, DevOps, and CI/CD.
  2. Seamlessly support complex, hybrid architectures.
  3. Accelerate high-investment transformation projects.
  4. Continuously manage exponential data growth.
  5. Comply with evolving privacy legislation.
  6. Centralise, but distribute, test data skills and processes.
  7. Make versioned data in parallel, faster than ever, and to a growing number of requesters.
  8. Make data available on demand to automated requesters like test automation frameworks and CI/CD pipelines.

“Managing” or copying data periodically to non-production environments will not suffice in the face of these criteria. Meeting the demand for data today instead requires Test Data Automation. A test data solution must not only offer all the techniques needed for finding, anonymising, making, and allocating data; these same processes must be automated, reusable, and parameterizable on-the-fly.

This in turn provides Test Data-as-a-Service, allocating rich data “just in time” as testers, automated tests, and developers seamlessly trigger the reusable processes. Instead of a bottleneck, test data then becomes an accelerator of rapid testing and development. To learn more about this bottleneck-free approach to test data, check our Curiosity’s Test Data-as-a-Service Solution Brief.

Download the Solution Brief

References:

[1] Redgate (2021), Ten insights from the 2021 State of Database DevOps. Retrieved from https://www.red-gate.com/solutions/database-devops/entrypage/report-2021-infographic on 28/06/2021.

[2] Matillion (2022), Matillion and IDG Survey: Data Growth is Real, and 3 Other Key Findings. Retrieved from https://www.matillion.com/resources/blog/matillion-and-idg-survey-data-growth-is-real-and-3-other-key-findings on 28/06/2022.

We Need to Talk About Test Data “Strategy”

We Need to Talk About Test Data “Strategy”

For many organisations, test data “best practices” start and end with compliance. This reflects a tendency to focus on the problem immediately in...

Read More
Test Data is make or break for parallel testing and development

Test Data is make or break for parallel testing and development

Today, there is a greater-than-ever need for parallelisation in testing and development. “Agile” and iterative delivery practices hinge on teams...

Read More
The Democratisation of (Test) Data

The Democratisation of (Test) Data

A glance at industry research from recent years shows that test data remains one of the major bottlenecks to fix in DevOps and CI/CD:

Read More
Removing Quality Bottlenecks in CI/CD and DevOps

Removing Quality Bottlenecks in CI/CD and DevOps

Curiosity often discuss barriers to “in-sprint testing”, focusing on techniques for reliably releasing fast-changing systems. These solutions...

Read More
If testing was a race, data would win every time

If testing was a race, data would win every time

Okay, so that title doesn’t make complete sense. However, if you read to the end of this article, all will become clear. I’m first going to discuss...

Read More
5 Test Data Challenges That Every CTO Should Know About

5 Test Data Challenges That Every CTO Should Know About

At Curiosity, we talk about test data extensively, because we believe test data is repeatedly neglected in testing and development discussions....

Read More
Is test data the engineering problem to solve in 2024?

Is test data the engineering problem to solve in 2024?

It’s 2024 and the risks associated with poor test data practices show no signs of abating.

Read More
Quality Testing Requires Quality Data

Quality Testing Requires Quality Data

My two most recent blogs have made the case for a new TDM paradigm called “Test Data Automation”. The first article considered how a logistical...

Read More
Automate Test Data Bottlenecks out of CI/CD and DevOps - Infographic

Automate Test Data Bottlenecks out of CI/CD and DevOps - Infographic

Discover how Test Data Automation can help you automate your test data management by reading the infographic below! Curiosity's Test Data Automation...

Read More