What is taking up so much space in /var/lib/pulp?
Greetings, hope someone can share some insight here. Running Red Hat Satellite 6.2.9.
The installation doc states that with RHEL 5,6 and 7 synchronized, the runtime size is 500GB. I understand that is an estimate. I only have RHEL 6.3-6.9 and 7.1-7.3 synchronized, I only have optional/debug repos enabled for 7.2 and 7.3, some HA/RS repos and capsule server. Yet /var/lib/pulp is pushing 600GB and I can't expand it right now, my hardware is maxed out. I have disabled the repos I don't need and I've manually ran the cron job to clear orphaned content, yet I was only able to free up about 1GB. Is there any other tool I can run to find unnecessary content, or a certain configuration to avoid that could be wasting space? Just looking for some guidance, thank you.
Responses
Historically debug repos are huge. We have at least 1 TB allocated for /var/lib/pulp on our primary prod satellite. (RHEL5, 6, 7 {optional,extras,supplmentary} and each major kickstart trees as well as RHV, RHV-M, and Satellite repos) I think the docs fail to provide an accurate approximation for storage. It is pretty emphatic about using LVM storage and making sure that pulp is expandable though.
I inadvertently synched a LifeCycle Environment to a capsule the other day, and had to work pretty hard to back it out using the KB's and Discussions on clearning orphaned rpms, but in the end, I don't think it completely frees up space for RPMs or content that are no longer in use.
Pulp default is to retain content by default.
A bugzilla was raised for /var/lib/pulp growth on Sat 6.1.10.
In Satellite 6.2, there are a couple of features that may be set on repositories that may be useful in managing space: 1. 'Mirror on Sync': With this feature, each time a repository is synchronized, if a new version of a package is downloaded, the old version is removed. 2. 'On Demand' (Download Policy): With this feature, the content will only get downloaded to the filesystem when a client requests it.
Mirror on sync means 'make the repository on disk match what is in the upstream repository'. It is useful when there are issues with a repo, such as if Red Hat mistakenly releases a package with the Beta repo, you need a means to fix the downstream repos (in your Satellite) after we fix it upstream.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
