Bazaar

Merge lp:~jameinel/bzr/2.0-490228-gc-segfault into lp:bzr/2.0

2.0-490228-gc-segfault
Merge into 2.0

Proposed by John A Meinel on 2009-12-14

Status:

Merged

Approved by:

Vincent Ladeuil on 2009-12-14

Approved revision:

not available

Merged at revision:

not available

Proposed branch:

lp:~jameinel/bzr/2.0-490228-gc-segfault

Merge into:

lp:bzr/2.0

Diff against target:

73 lines (+20/-10)

2 files modified

NEWS (+5/-0)
bzrlib/diff-delta.c (+15/-10)

To merge this branch:

bzr merge lp:~jameinel/bzr/2.0-490228-gc-segfault

High

Fix Released

Link a bug report

Reviewer	Review Type	Date Requested	Status
Vincent Ladeuil		2009-12-14	Approve on 2009-12-14
Review via email: mp+16140@code.launchpad.net

Revision history for this message

John A Meinel (jameinel) wrote on 2009-12-14:

See the mp here:
https://code.edge.launchpad.net/~jameinel/bzr/2.0-490228-gc-segfault/+merge/16139

I meant to submit this for 2.0 inclusion, and I just forgot to mark it as such.

Revision history for this message

Vincent Ladeuil (vila) wrote on 2009-12-14:

Unless you can demonstrate that you don't include regression here, that clearly a controversial bug
to land.

My feeling here is that we should rely on your judgment (as I mentioned in the related thread
on the bazaar list).

So, since you're still RM for 2.0.3 and you feel comfortable landing this, go ahead !

review: Approve

Revision history for this message

John A Meinel (jameinel) wrote on 2009-12-14:

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Vincent Ladeuil wrote:
> Review: Approve
> Unless you can demonstrate that you don't include regression here, that clearly a controversial bug
> to land.
>
> My feeling here is that we should rely on your judgment (as I mentioned in the related thread
> on the bazaar list).

I'm quite confident it isn't going to introduce a regression. I did a
fairly careful analysis of the code and structures (since I didn't have
test cases to rely on :).

A just-as-correct-but-more-controversial fix would have been to remove
the '->ptr == NULL' check. Specifically,

if (cur_entry == next_bucket_entry) {

The data structures *should* work such that if there isn't a slot to
hold the new pointer, cur_entry == next_bucket_entry after the while
loop. I was being conservative by leaving the rest of the structures alone.

>
> So, since you're still RM for 2.0.3 and you feel comfortable landing this, go ahead !

If you look at the bug report, you'll see the analysis I did. I'll land
it in 2.0.*

John
=:->
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (Cygwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAksmcr0ACgkQJdeBCYSNAANVuQCgmS+waehHRvbZ08thk1LKs+Qc
mXMAoMlH2cwwCtIqk/baLPiHDErzJchT
=5nzA
-----END PGP SIGNATURE-----

Revision history for this message

Vincent Ladeuil (vila) wrote on 2009-12-15:

>>>>> "jam" == John A Meinel <email address hidden> writes:

<snip/>

jam> If you look at the bug report, you'll see the analysis I
jam> did.

I did look and the analysis as well as the commit message
are... great. You did a really good job there for sure.

Vincent

Preview Diff

[H/L] Next/Prev Comment, [J/K] Next/Prev File, [N/P] Next/Prev Hunk

Subscribers

People subscribed via source and target branches

to all changes:

Alexander Belchenko

Bazaar Codereview Subscribers

Benoit Pierre

John A Meinel

Martin Pool

Matt Nordhoff

bzr PQM

pascalprost

 === modified file 'NEWS'
 --- NEWS	2009-12-02 01:30:35 +0000
 +++ NEWS	2009-12-14 16:02:17 +0000
@@ -27,6 +27,11 @@
  * Content filters are now applied correctly after pull, merge and switch.
    (Ian Clatworthy, #385879)
++* Fix a potential segfault in the groupcompress hash map handling code.
++  When inserting new entries, if the final hash bucket was empty, we could
++  end up trying to access if ``(last_entry+1)->ptr == NULL``.
++  (John Arbash Meinel, #490228)
++
  * Improve "Binary files differ" hunk handling.  (Aaron Bentley, #436325)
  Improvements
 === modified file 'bzrlib/diff-delta.c'
 --- bzrlib/diff-delta.c	2009-08-03 16:54:36 +0000
 +++ bzrlib/diff-delta.c	2009-12-14 16:02:17 +0000
@@ -688,7 +688,7 @@
      const unsigned char *data, *buffer, *top;
      unsigned char cmd;
      struct delta_index *new_index;
--    struct index_entry *entry, *entries, *old_entry;
++    struct index_entry *entry, *entries;
      if (!src->buf || !src->size)
          return NULL;
@@ -789,6 +789,7 @@
      entry = entries;
      num_inserted = 0;
      for (; num_entries > 0; --num_entries, ++entry) {
++        struct index_entry *next_bucket_entry, *cur_entry, *bucket_first_entry;
          hash_offset = (entry->val & old_index->hash_mask);
          /* The basic structure is a hash => packed_entries that fit in that
           * hash bucket. Things are structured such that the hash-pointers are
@@ -797,15 +798,19 @@
           * forward. If there are no NULL targets, then we know because
           * entry->ptr will not be NULL.
           */
--        old_entry = old_index->hash[hash_offset + 1];
--        old_entry--;
--        while (old_entry->ptr == NULL
--               && old_entry >= old_index->hash[hash_offset]) {
--            old_entry--;
++        // The start of the next bucket, this may point past the end of the
++        // entry table if hash_offset is the last bucket.
++        next_bucket_entry = old_index->hash[hash_offset + 1];
++        // First entry in this bucket
++        bucket_first_entry = old_index->hash[hash_offset];
++        cur_entry = next_bucket_entry - 1;
++        while (cur_entry->ptr == NULL && cur_entry >= bucket_first_entry) {
++            cur_entry--;
+         }
--        old_entry++;
--        if (old_entry->ptr != NULL
--            || old_entry >= old_index->hash[hash_offset + 1]) {
++        // cur_entry now either points at the first NULL, or it points to
++        // next_bucket_entry if there were no blank spots.
++        cur_entry++;
++        if (cur_entry >= next_bucket_entry || cur_entry->ptr != NULL) {
              /* There is no room for this entry, we have to resize */
              // char buff[128];
              // get_text(buff, entry->ptr);
@@ -822,7 +827,7 @@
              break;
+         }
          num_inserted++;
--        *old_entry = *entry;
++        *cur_entry = *entry;
          /* For entries which we *do* manage to insert into old_index, we don't
           * want them double copied into the final output.
           */

Bazaar

Merge lp:~jameinel/bzr/2.0-490228-gc-segfault into lp:bzr/2.0

Commit message

Description of the change

Preview Diff

Subscribers