Merge lp:~wgrant/launchpad/gc-dupe-lighter into lp:launchpad
Proposed by
William Grant
Status: | Merged |
---|---|
Merged at revision: | 18113 |
Proposed branch: | lp:~wgrant/launchpad/gc-dupe-lighter |
Merge into: | lp:launchpad |
Diff against target: |
149 lines (+42/-38) 2 files modified
lib/lp/services/librarianserver/librariangc.py (+26/-33) lib/lp/services/librarianserver/tests/test_gc.py (+16/-5) |
To merge this branch: | bzr merge lp:~wgrant/launchpad/gc-dupe-lighter |
Related bugs: |
Reviewer | Review Type | Date Requested | Status |
---|---|---|---|
Colin Watson (community) | Approve | ||
Review via email: mp+297878@code.launchpad.net |
Commit message
Don't hash duplicates in librarian-gc.
Description of the change
Don't hash duplicates in librarian-gc.
Instead of comparing everything, just check that the original file's hash matches the DB and trust that the rest haven't collided. Pedantically verifying that every duplicate file matches the original file bit by bit is very slow now that we have a lot of duplicates and most of them have to be downloaded from Swift.
To post a comment you must log in.
I guess we have little alternative if librarian-gc can't keep up.
We really need to move to SHA-256 here sooner rather than later, though.