| Date: Fri, 19 Dec 2008 00:45:19 -0800 |
| From: Linus Torvalds <torvalds@linux-foundation.org>, Junio C Hamano <gitster@pobox.com> |
| Subject: Re: Odd merge behaviour involving reverts |
| Abstract: Sometimes a branch that was already merged to the mainline |
| is later found to be faulty. Linus and Junio give guidance on |
| recovering from such a premature merge and continuing development |
| after the offending branch is fixed. |
| Message-ID: <7vocz8a6zk.fsf@gitster.siamese.dyndns.org> |
| References: <alpine.LFD.2.00.0812181949450.14014@localhost.localdomain> |
| |
| Alan <alan@clueserver.org> said: |
| |
| I have a master branch. We have a branch off of that that some |
| developers are doing work on. They claim it is ready. We merge it |
| into the master branch. It breaks something so we revert the merge. |
| They make changes to the code. they get it to a point where they say |
| it is ok and we merge again. |
| |
| When examined, we find that code changes made before the revert are |
| not in the master branch, but code changes after are in the master |
| branch. |
| |
| and asked for help recovering from this situation. |
| |
| The history immediately after the "revert of the merge" would look like |
| this: |
| |
| ---o---o---o---M---x---x---W |
| / |
| ---A---B |
| |
| where A and B are on the side development that was not so good, M is the |
| merge that brings these premature changes into the mainline, x are changes |
| unrelated to what the side branch did and already made on the mainline, |
| and W is the "revert of the merge M" (doesn't W look M upside down?). |
| IOW, "diff W^..W" is similar to "diff -R M^..M". |
| |
| Such a "revert" of a merge can be made with: |
| |
| $ git revert -m 1 M |
| |
| After the developers of the side branch fix their mistakes, the history |
| may look like this: |
| |
| ---o---o---o---M---x---x---W---x |
| / |
| ---A---B-------------------C---D |
| |
| where C and D are to fix what was broken in A and B, and you may already |
| have some other changes on the mainline after W. |
| |
| If you merge the updated side branch (with D at its tip), none of the |
| changes made in A nor B will be in the result, because they were reverted |
| by W. That is what Alan saw. |
| |
| Linus explains the situation: |
| |
| Reverting a regular commit just effectively undoes what that commit |
| did, and is fairly straightforward. But reverting a merge commit also |
| undoes the _data_ that the commit changed, but it does absolutely |
| nothing to the effects on _history_ that the merge had. |
| |
| So the merge will still exist, and it will still be seen as joining |
| the two branches together, and future merges will see that merge as |
| the last shared state - and the revert that reverted the merge brought |
| in will not affect that at all. |
| |
| So a "revert" undoes the data changes, but it's very much _not_ an |
| "undo" in the sense that it doesn't undo the effects of a commit on |
| the repository history. |
| |
| So if you think of "revert" as "undo", then you're going to always |
| miss this part of reverts. Yes, it undoes the data, but no, it doesn't |
| undo history. |
| |
| In such a situation, you would want to first revert the previous revert, |
| which would make the history look like this: |
| |
| ---o---o---o---M---x---x---W---x---Y |
| / |
| ---A---B-------------------C---D |
| |
| where Y is the revert of W. Such a "revert of the revert" can be done |
| with: |
| |
| $ git revert W |
| |
| This history would (ignoring possible conflicts between what W and W..Y |
| changed) be equivalent to not having W nor Y at all in the history: |
| |
| ---o---o---o---M---x---x-------x---- |
| / |
| ---A---B-------------------C---D |
| |
| and merging the side branch again will not have conflict arising from an |
| earlier revert and revert of the revert. |
| |
| ---o---o---o---M---x---x-------x-------* |
| / / |
| ---A---B-------------------C---D |
| |
| Of course the changes made in C and D still can conflict with what was |
| done by any of the x, but that is just a normal merge conflict. |
| |
| On the other hand, if the developers of the side branch discarded their |
| faulty A and B, and redone the changes on top of the updated mainline |
| after the revert, the history would have looked like this: |
| |
| ---o---o---o---M---x---x---W---x---x |
| / \ |
| ---A---B A'--B'--C' |
| |
| If you reverted the revert in such a case as in the previous example: |
| |
| ---o---o---o---M---x---x---W---x---x---Y---* |
| / \ / |
| ---A---B A'--B'--C' |
| |
| where Y is the revert of W, A' and B' are rerolled A and B, and there may |
| also be a further fix-up C' on the side branch. "diff Y^..Y" is similar |
| to "diff -R W^..W" (which in turn means it is similar to "diff M^..M"), |
| and "diff A'^..C'" by definition would be similar but different from that, |
| because it is a rerolled series of the earlier change. There will be a |
| lot of overlapping changes that result in conflicts. So do not do "revert |
| of revert" blindly without thinking.. |
| |
| ---o---o---o---M---x---x---W---x---x |
| / \ |
| ---A---B A'--B'--C' |
| |
| In the history with rebased side branch, W (and M) are behind the merge |
| base of the updated branch and the tip of the mainline, and they should |
| merge without the past faulty merge and its revert getting in the way. |
| |
| To recap, these are two very different scenarios, and they want two very |
| different resolution strategies: |
| |
| - If the faulty side branch was fixed by adding corrections on top, then |
| doing a revert of the previous revert would be the right thing to do. |
| |
| - If the faulty side branch whose effects were discarded by an earlier |
| revert of a merge was rebuilt from scratch (i.e. rebasing and fixing, |
| as you seem to have interpreted), then re-merging the result without |
| doing anything else fancy would be the right thing to do. |
| |
| However, there are things to keep in mind when reverting a merge (and |
| reverting such a revert). |
| |
| For example, think about what reverting a merge (and then reverting the |
| revert) does to bisectability. Ignore the fact that the revert of a revert |
| is undoing it - just think of it as a "single commit that does a lot". |
| Because that is what it does. |
| |
| When you have a problem you are chasing down, and you hit a "revert this |
| merge", what you're hitting is essentially a single commit that contains |
| all the changes (but obviously in reverse) of all the commits that got |
| merged. So it's debugging hell, because now you don't have lots of small |
| changes that you can try to pinpoint which _part_ of it changes. |
| |
| But does it all work? Sure it does. You can revert a merge, and from a |
| purely technical angle, git did it very naturally and had no real |
| troubles. It just considered it a change from "state before merge" to |
| "state after merge", and that was it. Nothing complicated, nothing odd, |
| nothing really dangerous. Git will do it without even thinking about it. |
| |
| So from a technical angle, there's nothing wrong with reverting a merge, |
| but from a workflow angle it's something that you generally should try to |
| avoid. |
| |
| If at all possible, for example, if you find a problem that got merged |
| into the main tree, rather than revert the merge, try _really_ hard to |
| bisect the problem down into the branch you merged, and just fix it, or |
| try to revert the individual commit that caused it. |
| |
| Yes, it's more complex, and no, it's not always going to work (sometimes |
| the answer is: "oops, I really shouldn't have merged it, because it wasn't |
| ready yet, and I really need to undo _all_ of the merge"). So then you |
| really should revert the merge, but when you want to re-do the merge, you |
| now need to do it by reverting the revert. |