Skip to content

Instantly share code, notes, and snippets.

@dsc
Created June 13, 2012 17:53
Show Gist options
  • Save dsc/2925497 to your computer and use it in GitHub Desktop.
Save dsc/2925497 to your computer and use it in GitHub Desktop.
Permanently remove crap from a git repo.
#!/bin/bash
#set -x
# Shows you the largest objects in your repo's pack file.
# Written for osx.
#
# @see http://stubbisms.wordpress.com/2009/07/10/git-script-to-show-largest-pack-objects-and-trim-your-waist-line/
# @author Antony Stubbs
# set the internal field spereator to line break, so that we can iterate easily over the verify-pack output
IFS=$'\n';
# list all objects including their size, sort by size, take top 10
objects=`git verify-pack -v .git/objects/pack/pack-*.idx | grep -v chain | sort -k3nr | head`
echo "All sizes are in kB's. The pack column is the size of the object, compressed, inside the pack file."
output="size,pack,SHA,location"
for y in $objects
do
# extract the size in bytes
size=$((`echo $y | cut -f 5 -d ' '`/1024))
# extract the compressed size in bytes
compressedSize=$((`echo $y | cut -f 6 -d ' '`/1024))
# extract the SHA
sha=`echo $y | cut -f 1 -d ' '`
# find the objects location in the repository tree
other=`git rev-list --all --objects | grep $sha`
#lineBreak=`echo -e "\n"`
output="${output}\n${size},${compressedSize},${other}"
done
echo -e $output | column -t -s ', '
#!/bin/bash
### Rewrite history (all commits in all branches) without the files that match the pattern
DELETE_PAT="$1"
git filter-branch --tree-filter "rm -rf $1" --prune-empty --tag-name-filter cat -- --all
### filter-branch saves the original history to refs/original
### because we aim to reclaim space, we need to remove it and then gc the refs
git for-each-ref --format="%(refname)" refs/original/| xargs -n 1 git update-ref -d
git reflog expire --expire=now --all
git gc --prune=now
### before removing any old remotes, you'll want to check out the branches
for b in $(git branch -r | grep -v -e '->'); do
git checkout $(cut -d/ -f2)
done
### remove whatever old remotes are scary
git remote rm origin
### finally, you'll want to push this repo with modified history someplace new
git remote add origin NEW_REMOTE
git push origin --all
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment