Size: 8306
Comment:
|
Size: 22808
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 2: | Line 2: |
= States Plan = /!\ This page is intended for developers A set of proposed Mercurial features to cleanly and safely handle mutable history. |
= States Plan (aka Phases) = /!\ This page is intended for developers /!\ For more generic details check [[MutableHG]] page |
Line 11: | Line 7: |
== Introduction == This page aims to define features connected with the 'liquid hg' discussion in enough detail that they can be fine-tuned and implemented. They are intentionally presented in a proposed order of implementation. == Hidden changesets == Various features will require changesets to be hidden by default. So it makes sense to implement a centralized method of dealing with this property. Usage: Hidden changesets can be shown with the --hidden flag, eg 'hg log --hidden' will show all changesets. Implementation: The changelog object contains a _hidden set (using revision numbers) that log and other commands consult. To hide a changeset, add it to the set at startup. Contexts are given a .hidden() predicate that consults the set. == Changeset states == A 'changeset state' is an indicator that tells us how a changeset is manipulated and communicated. The details of each state is described below, here we describe their shared properties. Like bookmarks, states are not stored in history and thus are not permanent and leave no audit trail. First, no changeset can be in two states at once. States are also ordered, so they can be considered from lowest to highest. The default, lowest state is 'frozen' - this is the normal state of existing changesets. No child changeset can be in a lower state than its parents. The proposed states are: frozen < liquid < local < dead These states share a hierarchy of traits: || || mutable || unshared || hidden || GC-able|| ||frozen|| || || || || ||liquid|| x || || || || ||local || x || x || || || ||dead || x || x || x || x || These names are subject to change. Some of them have initials that collide with each other and with options like force and they don't have an obvious progression. |
== Abstract == The phase concept aims to introduce a clear distinction between the part of the history that can't be rewritten and the part of history that you can safely rewrite. A basic usage of phases will track changesets on which you have full control because they exist only in your repository, and will prevent you to rewrite the other changesets that have been shared. A more advanced usage of phases will allow to exchange a part of the history and yet keep it mutable. The actual rewritting of the existing changesets is covered by MutableHG. This will ease the setup of test or review processes. The phase concept also adds a layer to help people to control what changesets they exchange with others. == related discussion == State of changeset when no data are available: * http://selenic.com/pipermail/mercurial-devel/2012-January/036745.html Phase name and property name: * http://selenic.com/pipermail/mercurial-devel/2011-December/036632.html * this wiki page * constraint got property name here: http://selenic.com/pipermail/mercurial-devel/2012-January/036760.html == Changeset phases == A 'changeset phases' is an indicator that tells us how a changeset is manipulated and communicated. The details of each phase is described below, here we describe the properties they have in common. Like bookmarks, phases are not stored in history and thus are not permanent and leave no audit trail. First, no changeset can be in two phases at once. Phases are ordered, so they can be considered from lowest to highest. The default, lowest phase is 'public' - this is the normal phase of existing changesets. A child changeset can not be in a lower phase than its parents. The proposed phases are: public < draft < secret These phases share a hierarchy of traits: || ||immutable ||shared || ||public ||x||x|| ||draft || ||x|| ||secret || || || These names are subject to change. See the Naming section. |
Line 70: | Line 51: |
States are manipulated via the 'hg state' command, ie 'hg state -f x' to mark x as frozen. Each state puts its own constraints on how it can be manipulated. The current state of changesets is displayed in the log with a 'state:' header. The frozen state is not displayed at all. |
The current phase of changesets is displayed in the log with a 'phase:' header. The public phase is not displayed at all. There are corresponding revset predicates for each phase as well. Most phase change will be made automatically by standard mercurial command. Read each phase section for details. States are manipulated via the 'hg phase' command, ie 'hg phase -p x' to mark x as public. Each phase puts its own constraints on how it can be manipulated. |
Line 77: | Line 60: |
Contexts provide a state() method that returns the current state as a string. It may be useful to represent the frozen state with the empty string. == Frozen changesets == Frozen changesets are changesets that are considered permanently immutable. This matches the history model presented by legacy Mercurial: changesets cannot be changed or removed without using history editing features from extensions. This state can be thought of as 'published': once a changeset is published, it becomes very difficult to remove it from distributed history. |
Contexts provide a phase() method that returns the current phase as an integer. Contexts provide a phasestr() method that returns the current phase as a string. public phase is stored as 0, draft phase is stored 1, secret phase is stored 2 == Public changesets == Public changesets are changesets that are considered permanently immutable. This matches the history model presented by legacy Mercurial: changesets cannot be changed or removed without using history editing features from extensions. The name reflect the fact that once a changeset is published, it becomes very difficult to remove it from distributed history. |
Line 92: | Line 68: |
Changeset are moved to the frozen state via 'hg state --frozen X'. |
Changeset are automatically set in the public phase when they are known to exist outside the repository. This behaviour can be altered using the publish option describe in the draft changeset section. * changeset pulled are public * changeset pushed are public * changeset unbundled are public Changeset can be manually moved to the public phase via 'hg phase --public X'. Tagged changeset should probably be public === implementation === === Legacy clients === N/A == draft changesets == draft changesets are changesets that the user is permitted to use history-modifying operations like rebase or mq on. They may not be tagged. They may also be thought of as 'unpublished'. Various operations such as pushing to other repo will move changesets into the public phase. Changesets cannot be moved from public to draft without a forcing operation. A phase.publish option will allow exchange of changeset without moving them in the public phase. * Changeset adding changeset to a phase.publish=True server are set public. This is the default. * Changeset adding changeset to a phase.publish=False server do not alter their phase. This allows people to collaborate on work in progress before it becomes finalized. The phase of changesets are communicated between compatible servers and clients. This should generally be engineered such that users don't have to give any additional thought to draft vs public in their day-to-day usage. === Usage === Changesets must be in a draft phase when they are created. So new commits will start in a draft phase by default. |
Line 96: | Line 101: |
N/A |
The set of draft changesets is stored as a list of draft roots. All descendants of these roots are draft (or in a higher phase). This set is known as the 'draft barrier' and defines the 'draft set'. This barrier is intended only to advance. The draft barrier is communicated via the pushkey protocol to servers that support it. The client is responsible for advancing the barrier on both the client and server sides. On each operation, the client reduces the phase set on both sides to the intersection of the sets on the client and server. That is, if a changeset is frozen on either side, it becomes frozen on both sides. By default servers are configured as 'publishing servers'. Legacy servers are publishing servers by default. These are recognized by not having the 'phase' pushkey namespace. When pushing to a publishing server, all pushed changesets are moved into the public phase on the client. Similarly, all changesets pulled from a publishing server are treated as public See this conversation for details: http://selenic.com/pipermail/mercurial-devel/2011-December/036279.html |
Line 100: | Line 110: |
N/A == Liquid changesets == Liquid changesets are changesets that the user is permitted to use history-modifying operations like rebase or mq on. They may not be tagged. They may also be thought of as 'unpublished'. Various operations such as pushing to certain public servers will move changesets into the frozen state. Changesets cannot be moved from frozen to liquid without a forcing operation. The liquidity of changesets can be communicated between compatible servers and clients. This allows people to collaborate on work in progress before it becomes finalized. This should generally be engineered such that users don't have to give any additional thought to liquid vs frozen in their day-to-day usage. |
See above for pushing to legacy clients. Legacy clients are allowed to pull draft changesets, though a round trip will make them public. == secret changesets == secret changesets are changesets that are not visible to remote clients. This is useful to mark work private and to avoid inadvertently publishing changesets. |
Line 121: | Line 116: |
Changesets must be in a liquid state when they are created. So new commits will start in a liquid state by default. |
secret changesets (and their descendents) can be marked with 'hg phase --secret'. Some extension as mq may automatically set changeset as secret. |
Line 126: | Line 121: |
The set of liquid changesets is stored as a list of liquid roots. All descendants of these roots are liquid (or in a higher state). This set is known as the 'liquid barrier' and defines the 'liquid set'. This barrier is intended only to advance. The liquid barrier is communicated via the pushkey protocol to servers that support it. The client is responsible for advancing the barrier on both the client and server sides. On each operation, the client reduces the liquid set on both sides to the intersection of the sets on the client and server. That is, if a changeset is frozen on either side, it becomes frozen on both sides. Some servers are configured as 'publishing servers'. Legacy servers are publishing servers by default. These are recognized by not having the 'liquid' pushkey namespace. When pushing to a publishing server, all pushed changesets are moved into the frozen state on the client. Similarly, all changesets pulled from a publishing server are treated as frozen. |
Like draft changesets, secret changesets are implemented via a local barrier set. This set is used to filter changesets from remote clients for push/pull/hgweb. |
Line 147: | Line 124: |
See above for pushing to legacy clients. Legacy clients are allowed to pull liquid changesets, though a round trip will make them frozen. == Local changesets == Local changesets are changesets that are not visible to remote clients. This is useful to mark work private and to avoid inadvertently publishing changesets. === Usage === Local changesets (and their descendents) can be marked with 'hg state --local'. === Implementation === Like liquid changesets, local changesets are implemented via a local barrier set. This set is used to filter changesets from remote clients for push/pull/hgweb. This is probably best implemented via a _local set on changelog. === Legacy clients === Legacy clients cannot see local changesets so will not pull them. New clients will not push local changesets by default == Dead changesets == Dead changesets are changesets that are hidden and are eligible for garbage collection. Dead changesets themselves are never pushed or pulled between clients (they are a subset of local) but deadness of changesets can be communicated between clients. If a client has a dead changeset and happens to pull a remote changeset that is an ancestor of it, the changeset is becomes liquid or frozen as appropriate. Garbage collection consists of stripping all dead changesets. When garbage collection occurs is currently undefined. === Usage === Commits (and their descendents) can be manually marked dead with 'hg state --dead'. Some operations like strip, qpop, and rebase may choose to mark changesets dead rather than actually stripping them. === Implementation === The dead set is stored as a complete list of all revisions. This allows dead status to be propagated after garbage collection occurs. Dead status is communicated via pushkey. === Legacy clients === Like local changesets, old clients cannot see dead changesets. == Abandoned changesets == Abandoned changesets are changesets that have been marked as 'no longer relevant'. Like dead changesets, they are hidden, but abandoned changesets are implemented via markers in history so they are not part of the states concept. Abandoned changesets are not pushed or pulled (see the implementation of local) but are not subject to garbage collection. Abandoned changesets can be unabandoned by committing new descendants. === Usage === To mark a branch as abandoned, use 'hg commit --abandon' to abandon the working directory and its descendants. === Implementation === The abandon commit contains an explicit list of abandoned changesets. === Legacy clients === Old clients see abandoned changesets and their marker commits as normal changesets. == Obsolete changesets == Obsolete changesets are part of an advanced concept ('changeset evolution') used to automatically resolve and combine refactoring operations between collaborators using liquid changesets. Like dead changessets, these are also hidden, but obsolete markers are implemented as pointers from X' (new changeset) to X (obsolete changeset) so are again not part of the state concept. |
Legacy clients cannot see secret changesets so will not pull them. New clients will not push secret changesets by default == More phase == No more phase are planned, in particular no trash phase: The big issue with using phase mechanism to mark changeset as dead is that phase define set of node which are consistent regarding rules that are problematic for the dead semantic. A golden semantic of phase are """all descendant of a changeset in phase X have the same property that this changeset""". We **can not** alter this: (1) We created phase for this exact semantic and we need it for all other usage. (2) The way phase are store and exchanged rely on this [1]. Having a trash phase means that a Y changeset children of X may be seen dead "by mistake" just because X was marked as dead. This is very problematic if X is added **after** the data than X is dead was created. Small local example bellow: $ hg pull Getting changeset "X" You look "X" and decide it's a silly changeset $ hg trash "X" Getting patch-y.diff from your mailbox patch-y.diff is a nice bugfix and you want it by chance, patch-y.diff have been created above "X" $ hg import --exact patch-y.diff As "Y" is a descendant of "X", "Y" is trashed too! This example highlight that phase are not very well suited to store "dead" information. We may have dedicated logic that detect new changesets is added on top of trashed one and handle this situation. But phase for dead still have two major issue! 1) It prevents any sane synchronisation of this trash phase with the outside. As phase only store un-versionned data, you have no way to know if the trash phase information were created before or after a changeset was committed. This means that you can't decide if the trash data really applies to changeset when you transfers it. 2) Adding non-trashed changeset on top on trashed changeset means moving the boundary and loosing the information that the parents trashed changeset are "not welcome in the final history". To conclusion about trash phase: * You can not safely exchange data about "dead" changeset through phase. Loosing most of point of having this data. * Safely using phase locally mean loosing the "dead" information whenever there is a conflict. == Naming == /!\ Until these names are officially nailed down, please use the names above for discussion of the concepts. The 'state' name is somewhat problematic because it's rather overloaded already and conflicts with hg stat. Thus 'phase' is maybe a better choice. The 'frozen/liquid/local/dead' name set is also not ideal. The ideal set of state names will: * all be on an obvious continuum or theme * not have conflicting initials (liquid vs local) (why?) * not conflict with common options (frozen vs force) * not insist too much on a particular workflow (review) * imply that changesets states can be moved only in one direction (frozen vs public) * shouldn't conflict with the attributes of states (mutable, hidden) * have fairly obvious semantics Current proposals: * public < draft < secret < ? (public implicates publishing: push or pull) * finished < draft < private (a draft gets finished) * final < liquid < private (a liquid reaches a final state) Discarded proposals: * public < local < secret < ? (local is a bit misleading, doesn't imply mutability) * public < local < private (initials clash) * published < review or ready < draft < trash (review or ready is a bit too much workflow) * published < mutable < private < discarded (mutable isn't thematic, conflicts with attributes) * public < draft < private < deprecated (name clash) * public < liquid < private < deprecated (liquid strongly implies mutability and unfinishedness) (but name clash) From a UI perspective, transitions between states are done either implicitly or by setting a state. So there should be no 'freeze' or 'publish' verb in the UI. == Controlling and Hooking on phase movement == === Basic use case for control === I expect this use case to be a good mirror of how we would like to use phase at Logilab or for mercurial development. There is three kind of people working with three kind of repo. ==== People ==== This scenario includes three kind of people : :Developer: are part of a team that write code and create changesets. They are not allowed to validate their own changesets. :Admin: are part of the same team but are allowed to validate changeset. :External: are not part of the team and not allowed to validate their changesets either. ''Note: In this team, only a few people are allowed to validate changesets, but it could be different, for example with everyone but the interns allowed to validate their changesets.'' ==== Repositories ==== This scenario includes three kind of repositories: :Public: is the repository used as a reference for the project. The phases.publish option is set to True. Only people:admin people are allowed to push to it. For the mercurial project it is the repository available at http://selenic.com/hg :Devel: is a development repository used by the team. The phases.publish option is set to False. Both people:developer and people:admin can push to it but only people:admin are expected to move the phase boundary to mark changeset as validated and immutable. :External: repositories created and controlled by people:external where they work on their contributions. The core development team can only pull contribution from these repositories. ==== Vocabulary ==== * people:XXX refer to a mercurial user in the group XXX described above. * repo:XXX refer to a mercurial repositoty in the group XXX described above. * phase:Public refer to the 0 phase in which a changeset are immutable * ctx:X refer to a changeset "X" ==== Expectation ==== We need a way to make sure no changeset can be set in the phase:Public by something else than being pushed into repo:Public. Below are some variants: * We don't expect changeset of repo:Devel to in the public phase if they are not in repo:Public. * Phase movement on repo:Devel should be only accepted is triggered by * pull from repo:Public * push from people:admin * Changeset in a repo owned by people:devel of people:external should not be . set in public phase by something else than pulling from repo:Public or repo:Devel ==== Examples ==== ===== Example A ===== . (1) A people:Admin push ctx:A to repo:Public (2) A people:Developer pull ctx:A from repo:Public. The phase:Public boundary . move to ctx:A (3) The same people:Developer push ctx:A to repo:Devel. The public:Phase boundary move . to ctx:A * A.1 MUST be allowed * A.2 MUST be allowed * A.3 COULD be allowed ===== Example B ===== . (1) A people:Developer make ctx:B public locally. (2) The same people:Developer push ctx:B to Dev. Public phase boundary move . to B * B.1 COULD be denied. But we can't ensure it is. * B.2 MUST be denied ===== Example C ===== . (1) A people:Developer set ctx:C as phase:Public locally (2) A people:Admin pull from people:Developer which have publish=False. . phase:Public boundary move to ctx:C (2) The same people:Admin push to repo:Devel. The phase:Public . boundary move to ctx:C * C.1 COULD be denied. But we can't ensure it is * C.2 and C.3 are more about people:Admin making a mistake. But if the phase . movement is silent enough it won't be hard to make such mistake. ===== example D ===== . (1) A people:Developer set ctx:D as phase:Public locally (2) Another people:Developer pull from the first one. The phase:Public . boundary move to ctx:D (3) The second people:Developer push to Dev. The phase:Public boundary . move to ctx:D * D.1 COULD be denied. But we can't ensure it is. * D.2 CAN NOT be denied if we what to stay a ''Decentralized'' VCS (make it . clear by having a case where dev exchange valid public changeset) * D.3 SHOULD be denied (this conflict with A.1) ===== example E ===== . (1) A people:External push ctx:E to it's repo:External repo using old . mercurial version. (2) A people:Developer pull ctx:E from repo:External. The phase:Public . boundary move to ctx:E (3) The same people:Developer push ctx:E to repo:Devel for sharing. The . phase:Public boundary move to ctx:E. * E.1 Is not something we have control on. * E.2 SHOULD not deny the pull but COULD deny the phase movement (at least warn about it) * E.3 MUST be denied === Basic use case for hook === MQ: When mq managed changeset move from secret to ready (or public) mq will want to detect it and either: * qfinish those patches * Abort the transaction QA: When changeset are made public QA bot may trigger. SYNC: When changeset are made public in a devel repo, automatic sync with public repo may be wished. === Hook change === The way to controll this should be through hooks. Bellow is proposal ==== existing hook change ==== The following existing hooks familly might receive and additional argument about phase: * changegroup * commit * incoming * outgoing They would receive a new argument $IN_PHASE. This argument hold a generic information about the maximum phase the changeset may take. This would affect the following case: * hg unbundle into publish == True * '''changegroup''' IN_PHASE=0 * '''incoming''' IN_PHASE=0 * hg unbundle into publish == False * '''changegroup''' IN_PHASE=1 * '''incoming''' IN_PHASE=1 (I'm not sur bundle should be seens as public or not…) * hg pull into publish == True from publish == True * '''changegroup''' IN_PHASE=0 * '''incoming''' IN_PHASE=0 * '''remote outgoing''' IN_PHASE=0 * hg pull into publish == True from publish == False * '''changegroup''' IN_PHASE=0 * '''incoming''' IN_PHASE=1 * '''remote outgoing''' IN_PHASE=1 * hg pull into publish == False from publish == True * '''changegroup''' IN_PHASE=1 * '''incoming''' IN_PHASE=0 * '''remote outgoing''' IN_PHASE=0 * hg pull into publish == False from publish == False * '''changegroup''' IN_PHASE=1 * '''incoming''' IN_PHASE=1 * '''remote outgoing''' IN_PHASE=1 * hg push from publish == True to publish == True * '''remote changegroup''' IN_PHASE=0 * '''outgoing''' IN_PHASE=0 * '''remote incoming''' IN_PHASE=0 (how should it know about it ?) * hg push from publish == True to publish == False * '''remote changegroup''' IN_PHASE=1 * '''outgoing''' IN_PHASE=0 * '''remote incoming''' IN_PHASE=0 (how should it know about it ?) * hg push from publish == False to publish == True * '''remote changegroup''' IN_PHASE=0 * '''outgoing''' IN_PHASE=0 * '''remote incoming''' IN_PHASE=1 (how should it know about it ?) * hg push from publish == False to publish == False * '''remote changegroup''' IN_PHASE=1 * '''outgoing''' IN_PHASE=1 * '''remote incoming''' IN_PHASE=1 (how should it know about it ?) * hg commit * '''commit''' IN_PHASE=1 * hg commit --phase=secret * '''commit''' IN_PHASE=2 * hg commit --phase=public * '''commit''' IN_PHASE=0 ==== Introducing new hook ==== I can see two new possible famillies of hook: movephase: triggered when we move phase boundary locally. * reason (commit, pull, push, remotepush, addchangegroup, unbundle, other (eg manual)) * source * publishing_source (true or false) * user ? * boundaries (how to serialise this for shell…) pushphase: triggered when we send phase data to remote * source (aka remote) * user ? * publishing_source (true or false) * boundaries (how to serialise this for shell…) Stuff that move phase: * hg commit * hg pull * hg push * hg unbundle The more details we can get about phase movement is * hg commit: * premovephase with precommit * prexnmovephase with prexncommit * movephase with commit * hg pull * (Do we need something before discovery ?) * '''premovephase''' . before we pull changegroup (move computed using remote phase data) ? (phase did not moved yet) * '''prexnmovephase''' after we pull changegroup (move computed using remote phase data). . (phase did moved) * '''movephase''' after changegroup have been added (or nothing to pull) . (phase did moved) * hg push * (Do we need something before discovery ?) * '''premovephase''' before we push changegroup (move computed using remote phase data ?) * '''prepushphase''' before we push changegroup (move computed using remote phase data ?) * '''prexnmovephase''' after we pushed changegroup but before we push phases to remote * '''prexnpushphase''' after we pushed changegroup but before push phases to remote * '''pushphase''' after changegroup have been pushed (or nothing to push) and phase pushed * '''movephase''' after changegroup have been added (or nothing to pull) and phase pushed === Solving usecase with Hook === ==== Control usecase ==== repo:Devel set a '''premovephase''' hook that deny phase move for user outside people:Admin. This deny A.1, B.2, D.3, E.3 people:Admin set a '''movephase'' hook that display warning when movephase is called on something else than: ''''' * '''''local operation ''''' * '''''pull from repo:Public ''''' * '''''push to repo:Public ''''' '''''people:Developer set a ''movephase'' hook that display flashy warning when movephase is called on something else than: ''''' * '''''pull from repo:Public ''''' '''''people:Developer set a ''pushphase'' hook that display flashy warning in all case: ||case ||requirement ||behavior || ||A.1 ||MUST be allowed ||works || ||A.2 ||MUST be allowed ||works || ||A.3 ||COULD be allowed ||denied (because D.3 and E.3) '' '' || ||B.1 ||COULD be denied ||flashy warning it's not too late || ||B.2 ||MUST be denied ||denied || ||C.1 ||COULD be denied ||flashy warning it's not too late || ||C.2 ||COULD be denied ||flashy warning it's not too late || ||C.3 ||MUST be denied ||allowed || ||D.1 ||COULD be denied ||flashy warning it's not too late || ||D.2 ||Wrong ||flashy warning it's not too late || ||D.2' ||Wrong ||flashy warning contact the other dev || ||D.3 ||MUST be denied ||denied || ||E.1 ||No control ||No Control || ||E.2 ||DENY phase move ||flashy warning fixable locally || ||E.3 ||MUST be denied ||denied || ''''' === other case === '''''MQ: can use a prexnmovephase hook to deny pull over mq patches. ''''' '''''MQ: can use a movephase hook to qfinish mqpatch made public. ''''' '''''QA: can use movephase hook to trigger build bot. ''''' '''''SYNC: can use movephase hook to trigger build bot. ''''' == Implementation status == === Done === * '''Basic phase mechanism'''. phase summary can be move and are properly save in mercurial transaction. * '''Public and draft phases'''. Changeset are committed as draft. Changeset are set as public on exchange operation. * phase.publish option to control if echanged changeset are set in the public phase. * Exchange phase boundary exchange === Implementation in progress === * secret phase === To be implemented === * display phase data in log + template support * phase command to view and alter changeset phase * revset support * clone support * '''Public changeset are immutable''', rebase and mq refuse to delete them. * Hide secret changeset in hgweb graft. * have ui.status about phase movement. * Hide secret changeset from outgoing. === Still in discussion === * Hooks on Phase movement * phases number: 0,1,2 or 0,10,20 keep room for future addition ? |
Line 241: | Line 542: |
CategoryDeveloper | ''''' CategoryDeveloper ''''' |
States Plan (aka Phases)
This page is intended for developers
For more generic details check MutableHG page
Contents
1. Abstract
The phase concept aims to introduce a clear distinction between the part of the history that can't be rewritten and the part of history that you can safely rewrite.
A basic usage of phases will track changesets on which you have full control because they exist only in your repository, and will prevent you to rewrite the other changesets that have been shared.
A more advanced usage of phases will allow to exchange a part of the history and yet keep it mutable. The actual rewritting of the existing changesets is covered by MutableHG. This will ease the setup of test or review processes.
The phase concept also adds a layer to help people to control what changesets they exchange with others.
2. related discussion
State of changeset when no data are available:
* http://selenic.com/pipermail/mercurial-devel/2012-January/036745.html
Phase name and property name:
* http://selenic.com/pipermail/mercurial-devel/2011-December/036632.html * this wiki page * constraint got property name here: http://selenic.com/pipermail/mercurial-devel/2012-January/036760.html
3. Changeset phases
A 'changeset phases' is an indicator that tells us how a changeset is manipulated and communicated. The details of each phase is described below, here we describe the properties they have in common.
Like bookmarks, phases are not stored in history and thus are not permanent and leave no audit trail.
First, no changeset can be in two phases at once. Phases are ordered, so they can be considered from lowest to highest. The default, lowest phase is 'public' - this is the normal phase of existing changesets. A child changeset can not be in a lower phase than its parents.
The proposed phases are:
public < draft < secret
These phases share a hierarchy of traits:
|
immutable |
shared |
public |
x |
x |
draft |
|
x |
secret |
|
|
These names are subject to change. See the Naming section.
3.1. Usage
The current phase of changesets is displayed in the log with a 'phase:' header. The public phase is not displayed at all.
There are corresponding revset predicates for each phase as well.
Most phase change will be made automatically by standard mercurial command. Read each phase section for details.
States are manipulated via the 'hg phase' command, ie 'hg phase -p x' to mark x as public. Each phase puts its own constraints on how it can be manipulated.
3.2. Implementation
Contexts provide a phase() method that returns the current phase as an integer. Contexts provide a phasestr() method that returns the current phase as a string.
public phase is stored as 0, draft phase is stored 1, secret phase is stored 2
4. Public changesets
Public changesets are changesets that are considered permanently immutable. This matches the history model presented by legacy Mercurial: changesets cannot be changed or removed without using history editing features from extensions. The name reflect the fact that once a changeset is published, it becomes very difficult to remove it from distributed history.
4.1. Usage
Changeset are automatically set in the public phase when they are known to exist outside the repository. This behaviour can be altered using the publish option describe in the draft changeset section.
- changeset pulled are public
- changeset pushed are public
- changeset unbundled are public
Changeset can be manually moved to the public phase via 'hg phase --public X'.
Tagged changeset should probably be public
4.2. implementation
4.3. Legacy clients
N/A
5. draft changesets
draft changesets are changesets that the user is permitted to use history-modifying operations like rebase or mq on. They may not be tagged. They may also be thought of as 'unpublished'.
Various operations such as pushing to other repo will move changesets into the public phase. Changesets cannot be moved from public to draft without a forcing operation.
A phase.publish option will allow exchange of changeset without moving them in the public phase.
- Changeset adding changeset to a phase.publish=True server are set public. This is the default.
- Changeset adding changeset to a phase.publish=False server do not alter their phase. This allows people to collaborate on work in progress before it becomes finalized.
The phase of changesets are communicated between compatible servers and clients.
This should generally be engineered such that users don't have to give any additional thought to draft vs public in their day-to-day usage.
5.1. Usage
Changesets must be in a draft phase when they are created. So new commits will start in a draft phase by default.
5.2. Implementation
The set of draft changesets is stored as a list of draft roots. All descendants of these roots are draft (or in a higher phase). This set is known as the 'draft barrier' and defines the 'draft set'. This barrier is intended only to advance.
The draft barrier is communicated via the pushkey protocol to servers that support it. The client is responsible for advancing the barrier on both the client and server sides. On each operation, the client reduces the phase set on both sides to the intersection of the sets on the client and server. That is, if a changeset is frozen on either side, it becomes frozen on both sides.
By default servers are configured as 'publishing servers'. Legacy servers are publishing servers by default. These are recognized by not having the 'phase' pushkey namespace. When pushing to a publishing server, all pushed changesets are moved into the public phase on the client. Similarly, all changesets pulled from a publishing server are treated as public
See this conversation for details: http://selenic.com/pipermail/mercurial-devel/2011-December/036279.html
5.3. Legacy clients
See above for pushing to legacy clients. Legacy clients are allowed to pull draft changesets, though a round trip will make them public.
6. secret changesets
secret changesets are changesets that are not visible to remote clients. This is useful to mark work private and to avoid inadvertently publishing changesets.
6.1. Usage
secret changesets (and their descendents) can be marked with 'hg phase --secret'.
Some extension as mq may automatically set changeset as secret.
6.2. Implementation
Like draft changesets, secret changesets are implemented via a local barrier set. This set is used to filter changesets from remote clients for push/pull/hgweb.
6.3. Legacy clients
Legacy clients cannot see secret changesets so will not pull them. New clients will not push secret changesets by default
7. More phase
No more phase are planned, in particular no trash phase:
The big issue with using phase mechanism to mark changeset as dead is that phase define set of node which are consistent regarding rules that are problematic for the dead semantic. A golden semantic of phase are """all descendant of a changeset in phase X have the same property that this changeset""". We **can not** alter this:
- (1) We created phase for this exact semantic and we need it for all other usage. (2) The way phase are store and exchanged rely on this [1].
Having a trash phase means that a Y changeset children of X may be seen dead "by mistake" just because X was marked as dead. This is very problematic if X is added **after** the data than X is dead was created.
Small local example bellow:
- $ hg pull Getting changeset "X" You look "X" and decide it's a silly changeset $ hg trash "X" Getting patch-y.diff from your mailbox patch-y.diff is a nice bugfix and you want it by chance, patch-y.diff have been created above "X" $ hg import --exact patch-y.diff As "Y" is a descendant of "X", "Y" is trashed too!
This example highlight that phase are not very well suited to store "dead" information.
We may have dedicated logic that detect new changesets is added on top of trashed one and handle this situation. But phase for dead still have two major issue!
1) It prevents any sane synchronisation of this trash phase with the outside. As phase only store un-versionned data, you have no way to know if the trash phase information were created before or after a changeset was committed. This means that you can't decide if the trash data really applies to changeset when you transfers it.
2) Adding non-trashed changeset on top on trashed changeset means moving the boundary and loosing the information that the parents trashed changeset are "not welcome in the final history".
To conclusion about trash phase:
- You can not safely exchange data about "dead" changeset through phase.
- Loosing most of point of having this data.
- Safely using phase locally mean loosing the "dead" information whenever
- there is a conflict.
8. Naming
Until these names are officially nailed down, please use the names above for discussion of the concepts.
The 'state' name is somewhat problematic because it's rather overloaded already and conflicts with hg stat. Thus 'phase' is maybe a better choice.
The 'frozen/liquid/local/dead' name set is also not ideal. The ideal set of state names will:
- all be on an obvious continuum or theme
- not have conflicting initials (liquid vs local) (why?)
- not conflict with common options (frozen vs force)
- not insist too much on a particular workflow (review)
- imply that changesets states can be moved only in one direction (frozen vs public)
- shouldn't conflict with the attributes of states (mutable, hidden)
- have fairly obvious semantics
Current proposals:
public < draft < secret < ? (public implicates publishing: push or pull)
finished < draft < private (a draft gets finished)
final < liquid < private (a liquid reaches a final state)
Discarded proposals:
public < local < secret < ? (local is a bit misleading, doesn't imply mutability)
public < local < private (initials clash)
published < review or ready < draft < trash (review or ready is a bit too much workflow)
published < mutable < private < discarded (mutable isn't thematic, conflicts with attributes)
public < draft < private < deprecated (name clash)
public < liquid < private < deprecated (liquid strongly implies mutability and unfinishedness) (but name clash)
From a UI perspective, transitions between states are done either implicitly or by setting a state. So there should be no 'freeze' or 'publish' verb in the UI.
9. Controlling and Hooking on phase movement
9.1. Basic use case for control
I expect this use case to be a good mirror of how we would like to use phase at Logilab or for mercurial development. There is three kind of people working with three kind of repo.
9.1.1. People
This scenario includes three kind of people :
:Developer: are part of a team that write code and create changesets. They are not allowed to validate their own changesets.
:Admin: are part of the same team but are allowed to validate changeset.
:External: are not part of the team and not allowed to validate their changesets either.
Note: In this team, only a few people are allowed to validate changesets, but it could be different, for example with everyone but the interns allowed to validate their changesets.
9.1.2. Repositories
This scenario includes three kind of repositories:
:Public: is the repository used as a reference for the project. The phases.publish option is set to True. Only people:admin people are allowed to push to it. For the mercurial project it is the repository available at http://selenic.com/hg
:Devel: is a development repository used by the team. The phases.publish option is set to False. Both people:developer and people:admin can push to it but only people:admin are expected to move the phase boundary to mark changeset as validated and immutable.
:External: repositories created and controlled by people:external where they work on their contributions. The core development team can only pull contribution from these repositories.
9.1.3. Vocabulary
- people:XXX refer to a mercurial user in the group XXX described above.
- repo:XXX refer to a mercurial repositoty in the group XXX described above.
- phase:Public refer to the 0 phase in which a changeset are immutable
- ctx:X refer to a changeset "X"
9.1.4. Expectation
We need a way to make sure no changeset can be set in the phase:Public by something else than being pushed into repo:Public.
Below are some variants:
- We don't expect changeset of repo:Devel to in the public phase if they are not in repo:Public.
- Phase movement on repo:Devel should be only accepted is triggered by
- pull from repo:Public
- push from people:admin
- Changeset in a repo owned by people:devel of people:external should not be
- set in public phase by something else than pulling from repo:Public or repo:Devel
9.1.5. Examples
9.1.5.1. Example A
- (1) A people:Admin push ctx:A to repo:Public (2) A people:Developer pull ctx:A from repo:Public. The phase:Public boundary
- move to ctx:A
- to ctx:A
- A.1 MUST be allowed
- A.2 MUST be allowed
- A.3 COULD be allowed
9.1.5.2. Example B
- (1) A people:Developer make ctx:B public locally. (2) The same people:Developer push ctx:B to Dev. Public phase boundary move
- to B
- B.1 COULD be denied. But we can't ensure it is.
- B.2 MUST be denied
9.1.5.3. Example C
- (1) A people:Developer set ctx:C as phase:Public locally (2) A people:Admin pull from people:Developer which have publish=False.
- phase:Public boundary move to ctx:C
- boundary move to ctx:C
- C.1 COULD be denied. But we can't ensure it is
- C.2 and C.3 are more about people:Admin making a mistake. But if the phase
- movement is silent enough it won't be hard to make such mistake.
9.1.5.4. example D
- (1) A people:Developer set ctx:D as phase:Public locally (2) Another people:Developer pull from the first one. The phase:Public
- boundary move to ctx:D
- move to ctx:D
- D.1 COULD be denied. But we can't ensure it is.
D.2 CAN NOT be denied if we what to stay a Decentralized VCS (make it
- clear by having a case where dev exchange valid public changeset)
- D.3 SHOULD be denied (this conflict with A.1)
9.1.5.5. example E
- (1) A people:External push ctx:E to it's repo:External repo using old
- mercurial version.
- boundary move to ctx:E
- phase:Public boundary move to ctx:E.
- E.1 Is not something we have control on.
- E.2 SHOULD not deny the pull but COULD deny the phase movement (at least warn about it)
- E.3 MUST be denied
9.2. Basic use case for hook
MQ: When mq managed changeset move from secret to ready (or public) mq will want to detect it and either:
- qfinish those patches
- Abort the transaction
QA: When changeset are made public QA bot may trigger.
SYNC: When changeset are made public in a devel repo, automatic sync with public repo may be wished.
9.3. Hook change
The way to controll this should be through hooks. Bellow is proposal
9.3.1. existing hook change
The following existing hooks familly might receive and additional argument about phase:
- changegroup
- commit
- incoming
- outgoing
They would receive a new argument $IN_PHASE. This argument hold a generic information about the maximum phase the changeset may take.
This would affect the following case:
- hg unbundle into publish == True
changegroup IN_PHASE=0
incoming IN_PHASE=0
- hg unbundle into publish == False
changegroup IN_PHASE=1
incoming IN_PHASE=1 (I'm not sur bundle should be seens as public or not…)
- hg pull into publish == True from publish == True
changegroup IN_PHASE=0
incoming IN_PHASE=0
remote outgoing IN_PHASE=0
- hg pull into publish == True from publish == False
changegroup IN_PHASE=0
incoming IN_PHASE=1
remote outgoing IN_PHASE=1
- hg pull into publish == False from publish == True
changegroup IN_PHASE=1
incoming IN_PHASE=0
remote outgoing IN_PHASE=0
- hg pull into publish == False from publish == False
changegroup IN_PHASE=1
incoming IN_PHASE=1
remote outgoing IN_PHASE=1
- hg push from publish == True to publish == True
remote changegroup IN_PHASE=0
outgoing IN_PHASE=0
remote incoming IN_PHASE=0 (how should it know about it ?)
- hg push from publish == True to publish == False
remote changegroup IN_PHASE=1
outgoing IN_PHASE=0
remote incoming IN_PHASE=0 (how should it know about it ?)
- hg push from publish == False to publish == True
remote changegroup IN_PHASE=0
outgoing IN_PHASE=0
remote incoming IN_PHASE=1 (how should it know about it ?)
- hg push from publish == False to publish == False
remote changegroup IN_PHASE=1
outgoing IN_PHASE=1
remote incoming IN_PHASE=1 (how should it know about it ?)
- hg commit
commit IN_PHASE=1
- hg commit --phase=secret
commit IN_PHASE=2
- hg commit --phase=public
commit IN_PHASE=0
9.3.2. Introducing new hook
I can see two new possible famillies of hook:
movephase: triggered when we move phase boundary locally.
- reason (commit, pull, push, remotepush, addchangegroup, unbundle, other (eg manual))
- source
- publishing_source (true or false)
- user ?
- boundaries (how to serialise this for shell…)
pushphase: triggered when we send phase data to remote
- source (aka remote)
- user ?
- publishing_source (true or false)
- boundaries (how to serialise this for shell…)
Stuff that move phase:
- hg commit
- hg pull
- hg push
- hg unbundle
The more details we can get about phase movement is
- hg commit:
- premovephase with precommit
- prexnmovephase with prexncommit
- movephase with commit
- hg pull
- (Do we need something before discovery ?)
premovephase
- before we pull changegroup (move computed using remote phase data) ? (phase did not moved yet)
prexnmovephase after we pull changegroup (move computed using remote phase data).
- (phase did moved)
movephase after changegroup have been added (or nothing to pull)
- (phase did moved)
- hg push
- (Do we need something before discovery ?)
premovephase before we push changegroup (move computed using remote phase data ?)
prepushphase before we push changegroup (move computed using remote phase data ?)
prexnmovephase after we pushed changegroup but before we push phases to remote
prexnpushphase after we pushed changegroup but before push phases to remote
pushphase after changegroup have been pushed (or nothing to push) and phase pushed
movephase after changegroup have been added (or nothing to pull) and phase pushed
9.4. Solving usecase with Hook
9.4.1. Control usecase
repo:Devel set a premovephase hook that deny phase move for user outside people:Admin. This deny A.1, B.2, D.3, E.3
people:Admin set a movephase hook that display warning when movephase is called on something else than:
local operation
pull from repo:Public
push to repo:Public
people:Developer set a movephase hook that display flashy warning when movephase is called on something else than:
pull from repo:Public
people:Developer set a pushphase hook that display flashy warning in all case: case requirement behavior A.1 MUST be allowed works A.2 MUST be allowed works A.3 COULD be allowed denied (because D.3 and E.3) B.1 COULD be denied flashy warning it's not too late B.2 MUST be denied denied C.1 COULD be denied flashy warning it's not too late C.2 COULD be denied flashy warning it's not too late C.3 MUST be denied allowed D.1 COULD be denied flashy warning it's not too late D.2 Wrong flashy warning it's not too late D.2' Wrong flashy warning contact the other dev D.3 MUST be denied denied E.1 No control No Control E.2 DENY phase move flashy warning fixable locally E.3 MUST be denied denied
9.5. other case
MQ: can use a prexnmovephase hook to deny pull over mq patches.
MQ: can use a movephase hook to qfinish mqpatch made public.
QA: can use movephase hook to trigger build bot.
SYNC: can use movephase hook to trigger build bot.
10. Implementation status
10.1. Done
Basic phase mechanism. phase summary can be move and are properly save in mercurial transaction.
Public and draft phases. Changeset are committed as draft. Changeset are set as public on exchange operation.
- phase.publish option to control if echanged changeset are set in the public phase.
- Exchange phase boundary exchange
10.2. Implementation in progress
- secret phase
10.3. To be implemented
- display phase data in log + template support
- phase command to view and alter changeset phase
- revset support
- clone support
Public changeset are immutable, rebase and mq refuse to delete them.
- Hide secret changeset in hgweb graft.
- have ui.status about phase movement.
- Hide secret changeset from outgoing.
10.4. Still in discussion
- Hooks on Phase movement
- phases number: 0,1,2 or 0,10,20 keep room for future addition ?