Perfect Pet synthetic relational database instances - version2

Description

Database instances generated and polluted using the Perfect Pet open-source software (https://github.com/mathildemarcy/perfect_pet).

These databases contain instances of the Perfect Pet database of different sizes, polluted with various artificial unicity factors.

  • file perfect_pet_5000.sql includes schemas clean_db and polluted_db_100 (polluted with a factor of 100%).
  • file perfect_pet_10000.sql includes schemas clean_db, polluted_db_au_25polluted_db_au_50polluted_db_au_75, and polluted_db_au_100 (polluted with factors of 25%, 50%, 75%, and 100%).
  • file perfect_pet_25000.sql includes schemas clean_db and polluted_db_100 (polluted with a factor of 100%).
  • file perfect_pet_50000.sql includes schemas clean_db and polluted_db_100 (polluted with a factor of 100%).
  • file perfect_pet_100000.sql includes schemas clean_db, polluted_db_au_25polluted_db_au_50polluted_db_au_75, and polluted_db_au_100 (polluted with factors of 25%, 50%, 75%, and 100%).
  • file perfect_pet_250000.sql includes schemas clean_db, polluted_db_au_25polluted_db_au_50polluted_db_au_75, and polluted_db_au_100 (polluted with factors of 25%, 50%, 75%, and 100%).

The clean schemas contain 13 relations: animal, animal_owner, animal_weight, appointment, appointment_service, appointment_slot, doctor, doctor_historization, microchip, microchip_code, owner, service, slot.

The polluted schemas contain 9 relations: animal, appointment, appointment_slot, doctor, microchip, microchip_code, owner, service, slot.

More information on these databases and their generation and pollution is available at https://github.com/mathildemarcy/perfect_pet.

Download instructions

The files contain full dumps of PostgreSQL databases.

Download from
Licence
Publication date
18/06/2025
Author(s)
Mathilde MARCY, Jean-Marc PETIT
Version
version2
Dataset size
9Go (6 files)