Size: 660
Comment: First minimal version of Scale Mercurial.
|
Size: 2212
Comment: Add section on many commits
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
Mercurial can scale from single-developer Projects up to massive Codebases and huge developer teams. | Mercurial can scale from single-developer projects up to massive codebases and huge developer teams. |
Line 5: | Line 5: |
Scaling is not a problem with a single root cause. Instead, there are various patterns that can lead to separate scaling issues. == Concern: Many commits == For active repositories, the number of commits/changesets over time approaches infinity. This poses some scaling problems. A standard Mercurial install and clone maintain a full copy of the repository and all of its history. This is similar to how other distributed version control systems (like Git) work. === Impact === * Repositories take longer to clone and pull (because they have more data) * Iterating over all commits takes longer (because there are more) === When to expect problems === Scaling due to number of commits alone likely won't be a significant issue by itself. Instead, you'll likely hit issues dealing with manifests or file data size first. Mercurial repositories with a few hundred thousand commits exist. As of April 2014, Mozilla's [[https://hg.mozilla.org/mozilla-central/|mozilla-central]] repository is close to 200,000 commits with no apparent scaling problems due to commit volume alone. There are known to be private repositories at other companies that have over 100,000 additional commits and they don't have scaling problems. === Solutions === * Install the[[https://bitbucket.org/facebook/remotefilelog|remotefilelog]] extension to help reduce clone and pull times. * Install a modern Mercurial version. Scaling issues are always being addressed and running the newest stable release is a good bet to have the best performance. |
Mercurial can scale from single-developer projects up to massive codebases and huge developer teams.
For an example of a large-scale deployment, you can check the recent writeup by Durham Goode from Facebook: Scaling Mercurial at Facebook.
Scaling is not a problem with a single root cause. Instead, there are various patterns that can lead to separate scaling issues.
Concern: Many commits
For active repositories, the number of commits/changesets over time approaches infinity. This poses some scaling problems.
A standard Mercurial install and clone maintain a full copy of the repository and all of its history. This is similar to how other distributed version control systems (like Git) work.
1. Impact
- Repositories take longer to clone and pull (because they have more data)
- Iterating over all commits takes longer (because there are more)
2. When to expect problems
Scaling due to number of commits alone likely won't be a significant issue by itself. Instead, you'll likely hit issues dealing with manifests or file data size first.
Mercurial repositories with a few hundred thousand commits exist. As of April 2014, Mozilla's mozilla-central repository is close to 200,000 commits with no apparent scaling problems due to commit volume alone. There are known to be private repositories at other companies that have over 100,000 additional commits and they don't have scaling problems.
3. Solutions
Install theremotefilelog extension to help reduce clone and pull times.
- Install a modern Mercurial version. Scaling issues are always being addressed and running the newest stable release is a good bet to have the best performance.
Summary: Extensions allow scaling Mercurial with minimal maintenance overhead: hgwatchman makes hg status 5x faster and remotefilelog speeds up pull, push and clone by factor 10 and integrates them with the local memcached server.