Differences between revisions 15 and 16
Revision 15 as of 2015-12-03 05:58:55
Size: 7690
Comment:
Revision 16 as of 2015-12-14 19:08:36
Size: 7729
Editor: lcharignon
Comment:
Deletions are marked like this. Additions are marked like this.
Line 3: Line 3:
Line 11: Line 10:

The simplest way to help is to grab one of: [[http://bz.selenic.com/buglist.cgi?keywords=easy%2C%20&keywords_type=anywords&order=Bug%20Number&resolution=---&query_format=advanced&bug_status=UNCONFIRMED&bug_status=CONFIRMED&bug_status=NEED_EXAMPLE&bug_status=IN_PROGRESS&component=evolution&list_id=5014|list of easy bug]]
The simplest way to help is to grab one of: [[http://bz.selenic.com/buglist.cgi?keywords=easy,%20&keywords_type=anywords&order=Bug%20Number&resolution=---&query_format=advanced&bug_status=UNCONFIRMED&bug_status=CONFIRMED&bug_status=NEED_EXAMPLE&bug_status=IN_PROGRESS&component=evolution&list_id=5014|list of easy bug]]
Line 17: Line 15:
  * detection that part of the rebase set is already in the destination
   * warning about divergence creation.
  * detection that part of the rebase set is already in the destination
  * warning about divergence creation.
Line 37: Line 35:
  * Continue/stop/abort for all command including evolve
   * In transaction content non exchangeable.
  * Continue/stop/abort for all command including evolve
  * In transaction content non exchangeable.
Line 45: Line 43:
Line 53: Line 50:
Line 57: Line 53:
Line 75: Line 70:
Line 89: Line 83:
Line 91: Line 84:
Line 94: Line 86:
- It is easy to create divergence,
- You get a lot of conflict when rebasing an obsolete stack on it's successors version.
- It is easy to create divergence, - You get a lot of conflict when rebasing an obsolete stack on it's successors version.
Line 97: Line 88:
To enable the current implementation set the config '''experimental.rebaseskipobsolet'''e to true
Line 99: Line 91:
 * {X} same logic as above, handling prune,
 * {X} same logic as above, handling split,
 * {X} ability to detect divergence creation and bail out,
 * (./) same logic as above, handling prune,
 * (./) same logic as above, handling split,
 * (./) ability to detect divergence creation and bail out,
Line 107: Line 99:
Line 109: Line 100:

Markers are stored in an append-only file stored in
'.hg/store/obsstore'.
Markers are stored in an append-only file stored in '.hg/store/obsstore'.
Line 114: Line 103:
Line 118: Line 106:
Line 123: Line 110:
Line 125: Line 111:
Line 129: Line 114:
Line 138: Line 122:
   obsolete marker operations, to avoid repeated decoding of metadata
  
entries.
  . obsolete marker operations, to avoid repeated decoding of metadata entries.
Line 146: Line 129:
   string contains a key and a value, separated by a colon ':', without
  
additional encoding. Keys cannot contain '\0' or ':' and values
  
cannot contain '\0'.
  . string contains a key and a value, separated by a colon ':', without additional encoding. Keys cannot contain '\0' or ':' and values cannot contain '\0'.
Line 151: Line 132:

Line 154: Line 133:
Line 157: Line 135:
  * date: There is currently *always* a date in the meta data. So storing it explicitly would be more space efficient. It would also open the way to quickly access the date for sorting purpose (no use case yet but not crazy to think about it)  * date: There is currently *always* a date in the meta data. So storing it explicitly would be more space efficient. It would also open the way to quickly access the date for sorting purpose (no use case yet but not crazy to think about it)
Line 159: Line 137:
  * parents: When a changesets is pruned (obsoleted, no successors) we needs to records its parents. This is necessary to link the markers chain to the push/pull operation it is relevant to.  * parents: When a changesets is pruned (obsoleted, no successors) we needs to records its parents. This is necessary to link the markers chain to the push/pull operation it is relevant to.
Line 161: Line 139:
  * We may want to extend the bit field to 2 bytes. We currently use 1 and can see use case for 3-5 others (tracking the type of changes introduce by the rewriting (desc, patches, content, etc) so we are running short  * We may want to extend the bit field to 2 bytes. We currently use 1 and can see use case for 3-5 others (tracking the type of changes introduce by the rewriting (desc, patches, content, etc) so we are running short
Line 163: Line 141:
  * We may also want to explicitly store the username of the marker's creator are they will always be ones. however there is no need for quick access of such information so it could stay in the metadata field.  * We may also want to explicitly store the username of the marker's creator are they will always be ones. however there is no need for quick access of such information so it could stay in the metadata field.
Line 166: Line 144:
Line 179: Line 156:
 2. Having an explicite field but store the number of parent in the bit fields (since we never have more than 2 parents)  1. Having an explicite field but store the number of parent in the bit fields (since we never have more than 2 parents)
Line 181: Line 158:
 3. Using the successors field. Having negative number of successors mean it is a prune.  1. Using the successors field. Having negative number of successors mean it is a prune.
Line 192: Line 169:
 

Changesets Evolution - development page.

/!\ This page is intended for developer

For a user perspective have a look at the ChangesetEvolution page.

1. Contributing

The simplest way to help is to grab one of: list of easy bug

There is also a multiple well defined topic that where idea exist but needs an implementation

  • Rebase could make more use of obsolescence marker:
    • detection that part of the rebase set is already in the destination
    • warning about divergence creation.
  • Obsolescence markers exchange (no really we have idea waiting to be implemented)

  • A hg split command

  • Prototype for a command bringing changeset back to life.

There is more complicated part that requires attention too.

  • Better storage//loading//cache for markers (depends on marker exchange)

  • Solving the problem of Topic Branch

  • Handling moving changesets around with the "plan" concept,
  • in memory merge (helps a lot of troubles resolution throught evolve)

  • Wide transaction capability
    • Continue/stop/abort for all command including evolve
    • In transaction content non exchangeable.
  • Making evolve capable of solving all troubles that user can encounter
  • Computing UI message about troubles from event who happened during the transaction.

2. Roadmap

Current status:

  • (./) Alpha Stage,

  • {X} Beta Stage,

  • {X} Release Stage,

2.1. Alpha Stage

Changeset Evolution is currently at Alpha Stage. If won't eat people data, but only handful of people knowns how to get out some situation..

2.2. Beta Stage

At beta state, the UI and experience will not be easy/pleasant enough for normal user, but advance user of Mercurial should find their mark. We may consider shipping it with Core Mercurial with an experimental flag.

Blocker to beta release:

  • Obsmarkers exchange should be good enough to define a final storage format for markers.
  • On disk format should be stable in the foreseeable future and fit performance//exchange need.
  • Evolve should be able to solve (or provide a way to) all troubles that a user may encounter, especially divergence,
  • Evolve should have predictable result (--rev options and co),
  • Evolve should be abortable (wide transaction),
  • Performance impact should be reasonable,

2.3. Release Stage

A which point we can merge changesets evolution into core.

  • UI offering a Solution to the N² markers creation when editing history (TopicPlan),

  • Commands set defined enough to be freezed for backward compatibility
  • No race condition when exchanging with server (bundle2 + repo layout allowing atomic transaction),

  • No impactful Performance Regression (including efficient exchange),
  • Concrete plan to handle high volume of markers (archiving or something),

3. In progress Features

3.1. Using Obsolescence Marker during Rebase

There is two big issues with rebasing a set containing obsolescence changeset:

- It is easy to create divergence, - You get a lot of conflict when rebasing an obsolete stack on it's successors version.

To enable the current implementation set the config experimental.rebaseskipobsolete to true

  • (./) rebase can skip obsolete changeset when rebased on successors,

  • (./) same logic as above, handling prune,

  • (./) same logic as above, handling split,

  • (./) ability to detect divergence creation and bail out,

  • {X} ability to rebase set with obsolescence inside the set (rebasing both precursors and successors) without creating divergence,

  • {X} official config to control these two behavior (either in one or two config)

  • {X} config on by default.

4. Archived Topic

4.1. Obsstore Format

Markers are stored in an append-only file stored in '.hg/store/obsstore'.

4.1.1. V1 (current) Format

(see in line document for latest data)

4.1.1.1. quick summary
  • <number-of-successors(=N)><metadata-lenght(=M)><bits-field><precursor>(<successor>*N)<metadata>

  • B, I, B, 20s, (20s*N), s*M

4.1.1.2. longer explanation

The file starts with a version header:

  • 1 unsigned byte: version number, starting at zero.

The header is followed by the markers. Each marker is made of:

  • 1 unsigned byte: number of new changesets "N", can be zero.
  • 1 unsigned 32-bits integer: metadata size "M" in bytes.
  • 1 byte: a bit field. It is reserved for flags used in common
    • obsolete marker operations, to avoid repeated decoding of metadata entries.
  • 20 bytes: obsoleted changeset identifier.
  • N*20 bytes: new changesets identifiers.
  • M bytes: metadata as a sequence of nul-terminated strings. Each
    • string contains a key and a value, separated by a colon ':', without additional encoding. Keys cannot contain '\0' or ':' and values cannot contain '\0'.

4.1.2. V2 (current) Format

4.1.2.1. motivation

There is two extra information we would like to see in a second version of the format:

  • date: There is currently *always* a date in the meta data. So storing it explicitly would be more space efficient. It would also open the way to quickly access the date for sorting purpose (no use case yet but not crazy to think about it)
  • parents: When a changesets is pruned (obsoleted, no successors) we needs to records its parents. This is necessary to link the markers chain to the push/pull operation it is relevant to.
  • We may want to extend the bit field to 2 bytes. We currently use 1 and can see use case for 3-5 others (tracking the type of changes introduce by the rewriting (desc, patches, content, etc) so we are running short
  • We may also want to explicitly store the username of the marker's creator are they will always be ones. however there is no need for quick access of such information so it could stay in the metadata field.

4.1.2.2. possible change

Date:

  • The date will be a 64bits float (for seconds since epoch) followed by a 16 bits integer (time zone)
  • It will make sense to put the date in front of the markers. that would give markers sorting some semantic.

Parents:

We have multiple option for storing parents:

  1. Having an explicite field similar to successors (one byte to know how many parents, then parents)
  2. Having an explicite field but store the number of parent in the bit fields (since we never have more than 2 parents)
  3. Using the successors field. Having negative number of successors mean it is a prune.

Option (3) is the most space saving but prevent use to store parent information for more changesets if needed in the future (We do not have a final exchange plan yet).

Option (1) and (2) takes 2 to 8 bits more than (3) but are more flexible.

bit field

If we extend the bit field to 2 Bytes, it makes sense to use option (2) for storing parent.

4.1.2.3. Proposed Format
  • <date><timezone><number-of-successors(=N)><metadata-lenght(=M)><bits-field+nb-parents(=P)><precursor>(<successor>*N)(<parents>*P)<metadata>

  • d, h, B, I, H, 20s, (20s*N), (20s*P), s*M

The P number would be hidden in the bit field. We need to store 4 possible values here: 0 parents, 1 parent, 2 parents, ø parents information stored. Possible assignement is 00, 01, 10, 11. this let both 0 parent and ø parent info to be 0 module 3.

ChangesetEvolutionDevel (last edited 2020-05-29 08:03:48 by aayjaychan)