For a while I have wanted to set up reusable scrubbed database dumps like I had at a former employer. While I never learned the specifics, I feel like the gist is still the same. We take the following steps:
- Create scrubbed/pruned database within internal network
- Upload database to private yet externally accessible location (in our case S3)
- Download and install uploaded scrubbed/pruned database for development
- Guarantees accurate yet non identifiable information
Side note: We also store multiple versions and use a hardcoded file (i.e. latest-version
) as a pointer for which to use.
The corresponding scripts are run like this: