Bisection is a well known method of localizing which commit caused a regression in a code repository. git-bisect is a particularly used tool for this problem in git repositories. However it is often the case that the failure is probabilistic in nature - either because we don't have a reliable reproducer of the failure and thus not reproducing a problem on a particular commit does not mean the problem is not still present there, or because of inherent variability of e.g. performance regressions. Bisection for such failures is problematic as it takes only one false result for the bisection to end up in an unrelated part of code history. So in these cases we usually have to heavily extend runtime of a reproducer or do multiple test runs or multiple bisection runs to minimize a chance of error.

The aim of the project is to implement stochastic bisection for git. I.e., a method that will count with the fact that test results at each point of code history have some error rate and provide points in code history to test to find commit in code history that is with high probability introducing the regression in the smallest possible number of tests. Then we can use this method for bisection of performance problems in our performance testing grid Marvin.

Goals for this Hackweek:

  • research state of the art in stochastic problem finding (it is a method used in various fields of engineering)
  • design algorithm computing next point in history to test given previous test results and their confidence
  • research how git-bisect works internally
  • integrate the algorithm with git-bisect

Looking for hackers with the skills:

git bisection performance statistics

This project is part of:

Hack Week 20

Activity

  • about 3 years ago: mkubecek liked this project.
  • about 3 years ago: okurz liked this project.
  • about 3 years ago: llansky3 liked this project.
  • about 3 years ago: jankara started this project.
  • about 3 years ago: jankara added keyword "statistics" to this project.
  • about 3 years ago: jankara added keyword "git" to this project.
  • about 3 years ago: jankara added keyword "bisection" to this project.
  • about 3 years ago: jankara added keyword "performance" to this project.
  • about 3 years ago: jankara originated this project.

  • Comments

    Be the first to comment!

    Similar Projects

    Investigate zypper/openSUSE repository refresh optimisations by dirkmueller

    [comment]: # (Please use the project descriptio...


    Visualization of historical sar(1) archives by ggherdovich

    Project Description

    The sar(1) tool, fr...