Git best practices

Commit early

Make your first commit after you’ve finished the initial installation and before you make your first changes. For a cookie cutter template, for example, proceed as follows:

$ uv run cookiecutter https://github.com/veit/cookiecutter-namespace-template.git
full_name [Veit Schiele]:
email [veit@cusy.io]:
github_username [veit]:
project_name [cusy.example]:

These initial changes can then be checked in with:

$ cd cusy.example
$ git init
$ git add *
$ git add .gitignore
$ git commit -m 'Initial commit'
$ git remote add origin ssh://git@github.com:veit/cusy.example.git
$ git push -u origin main

Exclude undesired files

Temporary files, jupyter checkpoint folders and builds have no business in a git repository. Credentials do not either. The .gitignore file contains a list of paths that git will not add unless you ask for it explicitly.

You can find a template .gitignore file for Python projects in the dotfiles repository. The gitignore.io website contains .gitignore files for other programming languages. The .gitignore file itself should be checked in, too:

$ git add .gitignore
$ git commit -m 'add .gitignore file'

If you have accidentally checked undesired files into your Git repository, you can remove them again with:

$ git rm -r .ipynb_checkpoints/

Write a README

Each repository should also have a README.rst file that describes the deployment and the basic structure of the code.

Commit often

Each completed task and subtask should be immediately followed by a commit. Incomplete work also may be stored on git. As a rule of thumb you should commit at least daily before leaving work. In busy times it is common to commit every 10 minutes.

Frequent commits make it easier for you to:

  • isolate errors

  • understand the code

  • maintain the code in the future

If you have made several changes to a file, you can split them up into several commits later with:

$ git add -p my-changed-file.py

Don’t change the published history

Even if you later find out that a commit that has already been published with git push contains one or more errors, you should never try to undo this commit. Rather, you should fix the error that have occurred through further commits.

Warning

Workflows with git rebase are a reasonable exception to this rule.

Choose a Git workflow

Choose a workflow that fits best to your project. Projects are by no means identical and a workflow that fits one project does not necessarily have to fit in another project. A different workflow can be recommended initially than in the further progress of the project.

Write meaningful commit messages

Git commit messages

xkcd: Git Commit

Merge branch ‘asdfasjkfdlas/alkdjf’ into sdkjfls-final

By creating insightful and descriptive commit messages, you make working in a team a lot easier. They allow others to understand your changes. They are also helpful at a later point in time to understand which goal should be achieved with the code. A diff can tell you exactly what has changed, but the commit message can also tell you why.

Just as the Python Style Guide defines conventions for naming, formatting, etc., a team should also agree on conventions for commit messages. These should at least define style, content and metadata:

Style

markup syntax, grammar, capitalisation and punctuation.

Content

What content should the body of the commit message contain? And what should it not contain?

Metadata

How should issue IDs, pull requests etc. be referenced?

Fortunately, there are already established conventions as to what constitutes a typical Git commit message. So you don’t have to reinvent anything. Just follow these seven rules and you’ll be on the right track:

  1. Separate the subject from the text with a blank line.

  2. Limit the subject line to 50 characters.

  3. Capitalise the subject line.

  4. Do not end the subject line with a full stop.

  5. Use the imperative in the subject line.

  6. Limit the body text to 72 characters per line.

  7. Explain the what and why in the body text, not the how.

See also

Gitmojis

If you use gitmojis in your commit messages, you can easily see the intent of the commit later.

GitLab-specific references

GitLab also interprets certain commit messages as links, for example:

$ git commit -m "Awesome commit message (Fix #21 and close group/otherproject#22)"
  • links to issues: #NUMBER

  • links to issues in other projects: GROUP/PROJECT#NUMBER

  • links to merge requests: !NUMBER

  • links to snippets: $NUMBER

There should be at least one ticket for each commit that should provide more detailed information about the changes.

There should be at least one ticket for each commit, which should contain more detailed information about the changes. Alternatively, you can also write multi-line commit messages containing this information, for example with:

$ git commit -m 'Expand section on meaningful commit messages' -m 'Fix the serious problem'

Or, if you just enter git commit, your editor will open, for example with the following text:

# Please enter the commit message for your changes. Lines starting
# with '#' will be ignored, and an empty message aborts the commit.
#
# On branch main

Git expects you to insert your commit message at the beginning of the file. After you have finished editing the file, Git reads its contents and continues. It cleans up the file by removing lines commented with # and subsequent empty lines. If the message is empty after cleaning up, Git cancels the commit – this is useful if you realise that you have forgotten something. Otherwise, the commit is created with the remaining content. However, GitLab uses # as a prefix for the number of an item. This double meaning of # can lead to confusion if you write a commit message that refers to an item:

Expand section on meaningful commit messages
#21: Add multi-line commit messages
# Please enter the commit message for your changes. Lines starting
# with '#' will be ignored, and an empty message aborts the commit.
#
# On branch main
# Changes to be committed:
#       modified:   productive/git/best-practices.rst
#

Git usually removes the line starting with #21 so that the message looks like this:

Expand section on meaningful commit messages

Avoid this mishap by using an alternative clean-up mode called Scissors. You can activate it globally with:

$ git config --global commit.cleanup scissors

Git then starts each new commit message with the Scissors line:

# ------------------------ >8 ------------------------
# Do not modify or remove the line above.
# Everything below it will be ignored.
#
# On branch main
# ...
#

Specify co-authors

If you are working on a commit with a team member, it’s good to acknowledge their contribution with the co-authored-by trailer. Trailers are additional metadata at the end of the commit message that use a KEY: VALUE syntax and can be repeated to list multiple values:

Expand section on meaningful commit messages
#21: Add multi-line commit messages
co-authored-by: Kristian Rother <kristian.rother@cusy.io>
co-authored-by: Frank Hofmann <frank.hofmann@cusy.io>

GitLab analyses the co-authored-by lines to display all avatars of the commit and also to update the profile statistics of the co-authors, etc..

Maintain your repository regularly

You should perform the following maintenance work regularly:

Validate the repo

The command git fsck checks whether all objects in the internal data structure of git are consistently connected with each other.

Compresses the repo

Save storage space with the command git gc or git gc --aggressive.

Clean up remote tracking branches

Unused branches on a server can be removed with git remote update --prune. It is even better if you change the default setting so that remotely deleted branches are also deleted locally with git fetch and git pull. You can achieve this with:

$ git config --global fetch.prune true

Check forgotten work

Display a list of saved stashes with git stash list. They can be removed with git stash drop.

Check your repositories for unwanted files

With Gitleaks you can regularly check your repositories for unintentionally saved access data.

You can use Gitleaks with the pre-commit framework by entering the following in the .pre-commit-config.yaml file:

repos:
  - repo: https://github.com/gitleaks/gitleaks
    rev: v8.21.1
    hooks:
      - id: gitleaks

Note

To deactivate the Gitleaks hook, you can prefix it with SKIP=Gitleaks so that Gitleaks is not executed:

$ SKIP=gitleaks git commit -m "Add secret"
Detect hardcoded secrets................................................Skipped

Alternatively, you can also append the gitleaks:allow comment to a line, for example:

class MyClass:
    client_secret = "Srtub6pZcTSET9V4vUpUg7xPi64sh3NV"  #gitleaks:allow

With git-filter-repo you can remove unwanted files from your Git history.