CI Insights Setup – Buildkite

Enable Mergify CI Insights and configure flaky test detection using Buildkite.

This page explains how to enable CI Insights for a repository using Buildkite and how to configure repeated test executions for flaky test detection.

Enabling CI Insights for Buildkite

Enable CI Insights on your repositories by visiting the GitHub Integration page (docs).
Create an application key with ci scope as described in the API Usage documentation. This key will be used both for the webhook authentication and for uploading test reports.

Configure a webhook in Buildkite to send build and job events to Mergify. Go to your Buildkite Organization Settings → Integrations → Notification Services and add a new Webhook notification with the following settings:

Webhook URL: set it to:
```
https://api.mergify.com/v1/ci/<org>/repositories/<repo>/buildkite/webhooks
```
Replace <org> with your Mergify organization name and <repo> with your repository name.
Token: paste your Mergify application key and select Send the token as X-Buildkite-Token.

Events: enable the following events:

Event	Description
`ping`	Webhook notification settings have changed
`build.scheduled`	A build has been scheduled
`build.running`	A build has started running
`build.failing`	A build has started failing
`build.finished`	A build has finished
`build.skipped`	A build has been skipped
`job.scheduled`	A job has been scheduled
`job.started`	A command step job has started running on an agent
`job.finished`	A job has finished
`job.activated`	A block step job has been unblocked

Buildkite Webhook Notification configuration

Set the application key as MERGIFY_TOKEN in your Buildkite pipeline environment variables. This will be used by the Buildkite plugin to upload test reports.
Click on CI Insights in the Mergify dashboard navigation. You should start seeing your Buildkite job runs appear:

Uploading Test Results

Use the mergifyio/mergify-ci Buildkite plugin to upload JUnit XML test reports:

steps:
  - label: "Run tests"
    command: pytest --junitxml=reports/junit.xml
    plugins:
      - mergifyio/mergify-ci#v2:
          action: junit-process
          report_path: "reports/*.xml"
          token: "${MERGIFY_TOKEN}"

The plugin runs in the post-command hook, so your test command executes first, then the plugin uploads the results. Even if your tests fail, the results are still uploaded.

For frameworks with native Mergify integration (e.g., pytest-mergify), you only need to set the MERGIFY_TOKEN environment variable. No plugin is required.

Setting Up Flaky Test Detection

To detect flaky tests, you need to run the same tests multiple times on the same code. This section explains how to set up your Buildkite pipeline for repeated test execution.

Prerequisites

Before setting up flaky test detection, ensure you have:

Enabled CI Insights for your repository
Created an application key with ci scope and set it as MERGIFY_TOKEN in your Buildkite pipeline environment variables
Configured test integration for your test framework

Buildkite Setup

Use a scheduled build that runs your test suite multiple times on the default branch. Here’s an example pipeline that runs tests with 5 repeated executions:

steps:
  - label: "Flaky test detection"
    command: |
      set +e
      failed=0
      for i in $(seq $${RUN_COUNT:-5}); do
        echo "--- Run $$i of $${RUN_COUNT:-5}"
        pytest --junitxml=reports/junit-$$i.xml tests/
        if [ $$? -ne 0 ]; then
          failed=1
        fi
      done
      exit $$failed
    plugins:
      - mergifyio/mergify-ci#v2:
          action: junit-process
          report_path: "reports/*.xml"
          token: "${MERGIFY_TOKEN}"

Configure a schedule in your Buildkite pipeline settings to run this every 12 hours on weekdays.

Configuration Tips

Frequency: Running twice daily (every 12 hours) provides a good balance between detection accuracy and resource usage
Default Branch Only: Focus on the default branch where flaky tests have the most impact
Weekday Schedule: Run Monday to Friday to avoid running when no changes are made on the code
Job name: Ensure the step label matches the one running tests on your pull request, allowing CI Insights to identify them. It can be overridden with the job_name plugin property if needed.

Test Framework Configuration

See the test framework guides for framework-specific instructions on generating JUnit XML reports and uploading them with both GitHub Actions and Buildkite.

Was this page helpful?

Edit this page on GitHub

Scopes

CI Setup

Test Frameworks Setup

Compare Tools

Actions

Reference