Differences between revisions 3 and 4
Revision 3 as of 2012-12-10 22:09:22
Size: 3130
Editor: mpm
Comment:
Revision 4 as of 2013-03-25 18:37:28
Size: 3860
Comment: Some more thoughts and about some feedback
Deletions are marked like this. Additions are marked like this.
Line 17: Line 17:
 * (what is the third??)
Line 20: Line 21:
 * Despite usually using backslash, Windows internally will always accept slash instead of backslash  * Despite usually using backslash, Windows internally will always accept slash instead of backslash <<FootNote(Exceptions see [[#Notes|above]])>>
Line 67: Line 68:

Line 70: Line 73:
 * an automatic filter that accepts "\" for "/" on Windows (when we are not interested where \ is used)
Line 71: Line 75:

Questions to Tests:

 * Should we really test with ui.slash=on? Are we not more interested in the normal behavior?
Line 75: Line 83:
Considering how inconsistent Mercurial uses portable vs local slashes: Do Windows users actually rely on Mercurial emitting backslash? What are the usecases?  * Considering how inconsistent Mercurial uses portable vs local slashes:
  * Do Windows users actually rely on Mercurial emitting backslash?
   * One feedback in the forum was that "/" is irritating, but ok, as long as copy and paste works. (Also to other programs.)
  * What are the usecases?
  * Is there a use case for the setting ui.slash?
   * A user in the forum asked shortly how to do this. But he did not tell what for.
 * How much time does Mercurial spend normalizing paths and converting slashes over and over?
Line 77: Line 91:
How much time do Mercurial spend normalizing paths and converting slashes over and over? == Notes ==

 * Windows accepts / as path sep except
  * as start of a server path (//computer/path/to) in explorer (but mercurial accepts it)

Note:

This page is primarily intended for developers of Mercurial.

Path separator handling

How Mercurial handles path separators

1. Overview

This page is intended to collect information about Mercurial's current path separator behavior and help formulate coherent rules for when paths should be converted.

2. Basics

There are three basic path formats:

  • portable (with a forward slash)
  • local (slash on Unix, backslash on Windows)
  • (what is the third??)

OS-specific observations:

  • Despite usually using backslash, Windows internally will always accept slash instead of backslash 1

  • Backslashes are valid non-separator characters on Unix, which we should be able to store (though the repo may not be usable on Windows)
  • On Windows, ui.slash will cause some commands to output slashes

Basic rules:

  • paths stored in changelog and manifest should always be in portable format
  • paths read from user (command line, config files) may be in either portable or local format
  • URLs should always use portable format

APIs:

  • convert from native to portable with util.pconvert()
  • convert from portable to native with util.localpath()
  • util.normpath() converts to portable
  • repo.pathto() and dirstate.pathto() converts to local (honors ui.slash)
  • util.pathto() converts to local (ignores ui.slash)

3. Audit of current usage

As of Mercurial 2.4, we have the following behavior:

command

output

ui.slash

comments

hg add/copy/forget/rename/remove/revert

local

no

hg diff/export

portable

hg locate

portable

unusual for working directory command

hg manifest

portable

hg log

portable

hg resolve

portable

unusual for working directory command

hg status

local

yes

even if not checking working directory

hg clone/push/pull/in/out

local

no

shows normalized source

hg rebase/strip

local

no

reports path of backup bundle

Inferred rules:

  • commands for inspecting history should use portable format
  • commands for working directory should use local format
  • commands that default to the working directory (ie status) should use local format
  • diffs should use portable format... for portability

4. Tests

As of 2.4, the Mercurial test suite uses the '(glob)' filter to hide Windows-specific path changes. This has several downsides:

  • Unix developers usually fail to mark these paths when writing tests
  • doesn't detect accidental regressions
  • tedious to maintain

Some possible improvements:

  • a new filter '(path)' that normalizes local paths but complains about non-local paths
  • an automatic filter that accepts "\" for "/" on Windows (when we are not interested where \ is used)
  • more complete check-code rules to warn Unix devs about non-portable constructs

Questions to Tests:

  • Should we really test with ui.slash=on? Are we not more interested in the normal behavior?

5. Questions

  • Considering how inconsistent Mercurial uses portable vs local slashes:
    • Do Windows users actually rely on Mercurial emitting backslash?
      • One feedback in the forum was that "/" is irritating, but ok, as long as copy and paste works. (Also to other programs.)
    • What are the usecases?
    • Is there a use case for the setting ui.slash?
      • A user in the forum asked shortly how to do this. But he did not tell what for.
  • How much time does Mercurial spend normalizing paths and converting slashes over and over?

6. Notes

  • Windows accepts / as path sep except
    • as start of a server path (//computer/path/to) in explorer (but mercurial accepts it)


  1. Exceptions see above (1)

PathSeparator (last edited 2013-07-15 09:10:42 by SimonHeimberg)