Database instances generated and polluted using the Perfect Pet open-source software (https://github.com/mathildemarcy/perfect_pet).
These databases contain instances of the Perfect Pet database of different sizes, polluted with various artificial unicity factors.
- file perfect_pet_5000.sql includes schemas clean_db and polluted_db_100 (polluted with a factor of 100%).
- file perfect_pet_10000.sql includes schemas clean_db, polluted_db_au_25, polluted_db_au_50, polluted_db_au_75, and polluted_db_au_100 (polluted with factors of 25%, 50%, 75%, and 100%).
- file perfect_pet_25000.sql includes schemas clean_db and polluted_db_100 (polluted with a factor of 100%).
- file perfect_pet_50000.sql includes schemas clean_db and polluted_db_100 (polluted with a factor of 100%).
- file perfect_pet_100000.sql includes schemas clean_db, polluted_db_au_25, polluted_db_au_50, polluted_db_au_75, and polluted_db_au_100 (polluted with factors of 25%, 50%, 75%, and 100%).
- file perfect_pet_250000.sql includes schemas clean_db, polluted_db_au_25, polluted_db_au_50, polluted_db_au_75, and polluted_db_au_100 (polluted with factors of 25%, 50%, 75%, and 100%).
The clean schemas contain 13 relations: animal, animal_owner, animal_weight, appointment, appointment_service, appointment_slot, doctor, doctor_historization, microchip, microchip_code, owner, service, slot.
The polluted schemas contain 9 relations: animal, appointment, appointment_slot, doctor, microchip, microchip_code, owner, service, slot.
More information on these databases and their generation and pollution is available at https://github.com/mathildemarcy/perfect_pet.
The files contain full dumps of PostgreSQL databases.