# github-webhook-server

> FastAPI webhook server for automating GitHub repository settings, pull request workflows, checks, releases, and log analysis.

---

Source: introduction.md

# Introduction

`github-webhook-server` is a self-hosted FastAPI service that receives GitHub webhooks and turns them into repository and pull request automation.

If you maintain several repositories and want one place to manage reviewer assignment, labels, checks, merge rules, cherry-picks, and release behavior, this is what the server is built for. You configure it once, connect repositories to it, and it applies the same workflow consistently across your GitHub organization.

It is not just a passive webhook receiver. On startup, it reads a central `config.yaml`, applies repository settings and labels, updates protected branch rules, resets stale in-progress checks, and creates or updates webhooks for every configured repository. After that, each incoming event is routed to the right handler for PRs, reviews, comments, checks, status updates, and tag pushes.

> **Note:** The webhook endpoint returns `200 OK` as soon as the payload is validated, then processes the event in the background. That keeps GitHub deliveries from timing out while the server clones repositories, runs checks, builds containers, or performs cherry-picks.

## What This Server Is For

This project is a good fit for:

- Teams maintaining multiple GitHub repositories and wanting one place to define automation.
- Platform, release, or DevOps engineers who want consistent labels, branch protection, and PR policy across repos.
- Projects that use `OWNERS` files and want reviewer and approver rules enforced automatically.
- Maintainers who want user-facing PR commands such as `/retest`, `/approve`, `/cherry-pick`, and `/build-and-push-container`.

## What It Automates

### Across repositories

At the repository level, the server can:

- Create or update GitHub webhooks for the events you configure per repository.
- Apply repository defaults such as delete-on-merge and auto-merge support.
- Create standard labels and colors, including review labels, merge-state labels, size labels, and cherry-pick labels.
- Configure protected branches and required status checks from your central configuration.
- Support optional release behavior such as package publishing, container builds, and Slack notifications.

### On pull requests

For pull requests, the server acts like a shared workflow layer. It can:

- Post a welcome comment when a PR opens or becomes ready for review.
- Create a tracking issue for a new PR and close it automatically when the PR is closed or merged.
- Assign reviewers from `OWNERS` files, including path-specific `OWNERS` files inside the repository.
- Add labels for PR size, target branch, merge conflicts, rebase-needed state, verification, hold/WIP state, review status, auto-merge, and cherry-pick requests.
- Queue and run built-in checks such as `tox`, `pre-commit`, `build-container`, `python-module-install`, and `conventional-title`.
- Run user-defined `custom-check-runs`, with optional checks that do not have to block merges.
- Calculate a `can-be-merged` check from approvals, status checks, blocker labels, mergeability, unresolved review conversations, and any extra required labels you configured.
- Auto-merge the PR when the `automerge` label is present and the `can-be-merged` check succeeds.

The core PR setup is explicit in the handler:

```779:857:webhook_server/libs/handlers/pull_request_handler.py
async def process_opened_or_synchronize_pull_request(self, pull_request: PullRequest) -> None:
    # Stage 1: Initial setup and check queue tasks
    setup_tasks: list[Coroutine[Any, Any, Any]] = []

    setup_tasks.append(self.owners_file_handler.assign_reviewers(pull_request=pull_request))
    setup_tasks.append(
        self.labels_handler._add_label(
            pull_request=pull_request,
            label=f"{BRANCH_LABEL_PREFIX}{pull_request.base.ref}",
        )
    )
    setup_tasks.append(self.label_pull_request_by_merge_state(pull_request=pull_request))
    setup_tasks.append(self.check_run_handler.set_check_queued(name=CAN_BE_MERGED_STR))
    # ... queue tox / pre-commit / python-module-install / build-container / verified / size ...

    ci_tasks.append(self.runner_handler.run_tox(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_pre_commit(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_install_python_module(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_build_container(pull_request=pull_request))
```

In this repository's end-to-end tests, a normal PR is expected to end up with successful `build-container`, `pre-commit`, `python-module-install`, and `tox` checks; a queued `verified`; a failing `can-be-merged` until approval and policy requirements are satisfied; and labels such as `size/M` and `branch-main`.

### From comments and reviews

Contributors and maintainers can control automation directly from PR comments. In addition to label-driven commands such as `/wip`, `/hold`, `/verified`, `/lgtm`, `/approve`, and `/automerge`, the comment handler supports a set of built-in workflow commands:

```154:202:webhook_server/libs/handlers/issue_comment_handler.py
available_commands: list[str] = [
    COMMAND_RETEST_STR,
    COMMAND_REPROCESS_STR,
    COMMAND_CHERRY_PICK_STR,
    COMMAND_ASSIGN_REVIEWERS_STR,
    COMMAND_CHECK_CAN_MERGE_STR,
    BUILD_AND_PUSH_CONTAINER_STR,
    COMMAND_ASSIGN_REVIEWER_STR,
    COMMAND_ADD_ALLOWED_USER_STR,
    COMMAND_REGENERATE_WELCOME_STR,
    COMMAND_TEST_ORACLE_STR,
]

# ...

if _command not in available_commands + list(USER_LABELS_DICT.keys()):
    self.logger.debug(f"{self.log_prefix} Command {command} is not supported.")
    return
```

In practice, that means users can do things like:

- `/assign-reviewers` or `/assign-reviewer @username`
- `/retest tox`, `/retest pre-commit`, or `/retest all`
- `/reprocess` to rebuild the whole PR workflow
- `/check-can-merge` to force a mergeability recalculation
- `/build-and-push-container` to publish a PR image on demand
- `/cherry-pick <branch>` to queue or perform backports
- `/test-oracle` to request AI-generated test recommendations when configured
- `/regenerate-welcome` to refresh the onboarding comment

Reviews matter too. The server tracks review state with labels such as `approved-*`, `lgtm-*`, `changes-requested-*`, and `commented-*`, and it also understands `/approve` when it appears inside a review body.

> **Note:** In this project, `/approve` and `/lgtm` are part of the merge logic, not just convenient comments. The server converts them into labels and uses those labels when deciding whether `can-be-merged` should pass.

### On tags, releases, and backports

The automation is not limited to PRs.

On tag pushes, the server can:

- Build a Python distribution with `uv build`
- Validate and upload it to PyPI with `twine`
- Build and push release container images when `container.release: true` is set
- Send Slack notifications for successful publish or push operations

On merged PRs, it can also:

- Detect `cherry-pick-<branch>` labels
- Create cherry-pick branches and PRs automatically
- Optionally use AI to resolve cherry-pick conflicts
- Mark AI-resolved cherry-picks for manual verification instead of auto-verifying them

### Optional AI-assisted features

The server also includes optional AI integrations:

- `test-oracle` connects to an external service that analyzes a PR and recommends which tests to run.
- `ai-features` can suggest or auto-fix PR titles to match your `conventional-title` rules.
- The same `ai-features` block can enable AI-assisted cherry-pick conflict resolution.

## Configuration Model

The configuration model is layered so you can set organization-wide defaults without losing per-repository flexibility.

Settings are resolved in this order:

```132:153:webhook_server/libs/config.py
def get_value(self, value: str, return_on_none: Any = None, extra_dict: dict[str, Any] | None = None) -> Any:
    """
    Get value from config

    Supports dot notation for nested values (e.g., "docker.username", "pypi.token")

    Order of getting value:
        1. Local repository file (.github-webhook-server.yaml)
        2. Repository level global config file (config.yaml)
        3. Root level global config file (config.yaml)
    """
    if extra_dict:
        result = self._get_nested_value(value, extra_dict)
        if result is not None:
            return result

    for scope in (self.repository_data, self.root_data):
        result = self._get_nested_value(value, scope)
        if result is not None:
            return result
```

That gives you three useful layers:

- Root-level defaults in the central `config.yaml`
- Per-repository overrides inside the `repositories` map in that same file
- Repository-local overrides in `.github-webhook-server.yaml`

> **Tip:** Keep shared policy in the central `config.yaml`, then use `.github-webhook-server.yaml` only for repositories that truly need exceptions.

A real example from `examples/config.yaml` shows the kind of repository-level behavior you can enable:

```139:183:examples/config.yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    slack-webhook-url: <Slack webhook url> # Send notification to slack on several operations
    verified-job: true

    events: # To listen to all events do not send events
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status

    tox:
      main: all # Run all tests in tox.ini when pull request parent branch is main
      dev: testenv1,testenv2 # Run testenv1 and testenv2 tests in tox.ini when pull request parent branch is dev

    pre-commit: true # Run pre-commit check

    protected-branches:
      dev: []
      main: # set [] in order to set all defaults run included
        include-runs:
          - "pre-commit.ci - pr"
          - "WIP"
        exclude-runs:
          - "SonarCloud Code Analysis"

    container:
      username: <registry username>
      password: <registry_password>
      repository: <registry_repository_full_path>
      tag: <image_tag>
      release: true # Push image to registry on new release with release as the tag
```

At the top level, the example configuration also includes sections such as `labels`, `pr-size-thresholds`, `branch-protection`, `test-oracle`, and `ai-features`, so one server can apply different automation profiles to different repositories without duplicating everything.

A few especially important settings to know early:

- `webhook-ip` must be a full URL, including the `/webhook_server` path.
- `webhook-secret` enables GitHub signature verification.
- `allow-commands-on-draft-prs` controls whether slash commands are blocked or allowed on draft PRs.
- `conventional-title` validates PR titles against a Conventional Commits-style pattern.
- `set-auto-merge-prs` and `auto-verified-and-merged-users` control automatic merge behavior.
- `custom-check-runs` lets you add your own shell commands as first-class check runs.

## OWNERS-Driven Reviews

Reviewer and approver logic is path-aware. The server reads `OWNERS` files from the cloned repository, matches them against the files changed in the PR, and requests the right reviewers automatically.

The root `OWNERS` file in this repository uses the expected YAML shape:

```1:6:OWNERS
approvers:
  - myakove
  - rnetser
reviewers:
  - myakove
  - rnetser
```

Subdirectories can have their own `OWNERS` files too. When a PR touches files under those paths, the server uses those path-specific approvers and reviewers. If a path-level `OWNERS` file sets `root-approvers: false`, root approvers are not automatically required for that area.

## Operational Notes

The server also writes structured webhook logs and can expose an optional internal log viewer and log APIs for troubleshooting PR flow, status checks, and failures.

> **Warning:** If you enable the optional log viewer, keep it on a trusted network. The project treats those endpoints as internal operational tooling, not a public-facing dashboard.

Taken together, `github-webhook-server` is best understood as a shared automation layer for GitHub: contributors interact with simple PR comments and labels, while maintainers get consistent policy, repeatable release automation, and one place to operate everything.


---

Source: architecture-and-event-flow.md

# Architecture and Event Flow

`github-webhook-server` is built around one simple idea: accept GitHub webhooks quickly, then do the real work asynchronously. As a user, that means GitHub gets a fast response, while pull request automation, release work, and logging continue in the background.

## At a Glance
- The HTTP endpoint validates the request and returns `200 OK` immediately.
- A background task creates a `GithubWebhook` object and routes the event to specialized handlers.
- PR automation is split across focused components such as `PullRequestHandler`, `IssueCommentHandler`, `PullRequestReviewHandler`, `CheckRunHandler`, `OwnersFileHandler`, `LabelsHandler`, and `RunnerHandler`.
- Each webhook gets one temporary base clone; checks and release actions run in isolated Git worktrees created from that clone.
- Every webhook produces both normal text logs and structured JSON records, which can be searched later or viewed through the optional log viewer.

## Before the First Event

Startup does more than launch the HTTP server. `entrypoint.py` first runs repository bootstrap logic, then starts Uvicorn with the configured worker count.

```23:45:webhook_server/utils/github_repository_and_webhook_settings.py
async def repository_and_webhook_settings(webhook_secret: str | None = None) -> None:
    config = Config(logger=LOGGER)
    apis_dict: dict[str, dict[str, Any]] = {}
    ...
    await set_repositories_settings(config=config, apis_dict=apis_dict)
    set_all_in_progress_check_runs_to_queued(repo_config=config, apis_dict=apis_dict)
    create_webhook(config=config, apis_dict=apis_dict, secret=webhook_secret)
```

That startup pass does three important jobs:

- It applies repository-side settings such as labels, branch protection, and related GitHub configuration.
- It resets built-in check runs that were left in `in_progress` during a previous shutdown back to `queued`.
- It creates or updates the GitHub webhook on each configured repository so GitHub actually sends the events listed in your config.

`entrypoint.py` then starts the app with `workers=int(_max_workers)`, so worker-level parallelism is controlled by the root `max-workers` setting.

> **Note:** The `events` list under each repository is operational, not just descriptive. Startup uses it to create or update the real GitHub webhook subscription.

## Webhook Intake Pipeline

When GitHub calls `POST /webhook_server`, the server does only the minimum synchronous work required to prove the request is valid: read the body, verify the signature if configured, parse JSON, and check that the repository and event metadata are present. Once that passes, it returns `200 OK` and hands everything else to a background task.

```418:529:webhook_server/app.py
# Return 200 immediately - all validation passed, we can process this webhook
LOGGER.info(f"{log_context} Webhook validation passed, queuing for background processing")

async def process_with_error_handling(
    _hook_data: dict[Any, Any], _headers: Headers, _delivery_id: str, _event_type: str
) -> None:
    # Create structured logging context at the VERY START
    repository_name = _hook_data.get("repository", {}).get("name", "unknown")
    repository_full_name = _hook_data.get("repository", {}).get("full_name", "unknown")
    ctx = create_context(
        hook_id=_delivery_id,
        event_type=_event_type,
        repository=repository_name,
        repository_full_name=repository_full_name,
        action=_hook_data.get("action"),
        sender=_hook_data.get("sender", {}).get("login"),
    )
    ...
    try:
        _api: GithubWebhook = GithubWebhook(hook_data=_hook_data, headers=_headers, logger=_logger)
        try:
            await _api.process()
        finally:
            await _api.cleanup()
    ...
    finally:
        if ctx:
            ctx.completed_at = datetime.now(UTC)
            log_webhook_summary(ctx, _logger, _log_context)

        try:
            write_webhook_log(ctx)
        except Exception:
            _logger.exception(f"{_log_context} Failed to write webhook log")
        finally:
            clear_context()

task = asyncio.create_task(
    process_with_error_handling(
        _hook_data=hook_data,
        _headers=request.headers,
        _delivery_id=delivery_id,
        _event_type=event_type,
    )
)
_background_tasks.add(task)
task.add_done_callback(_background_tasks.discard)

return JSONResponse(
    status_code=status.HTTP_200_OK,
    content={
        "status": status.HTTP_200_OK,
        "message": "Webhook queued for processing",
        "delivery_id": delivery_id,
        "event_type": event_type,
    },
)
```

In practice, the intake flow looks like this:

1. GitHub sends the event to `POST /webhook_server`.
2. The server optionally checks the source IP, verifies `x-hub-signature-256` when `webhook-secret` is set, parses the payload, and validates required fields.
3. The server returns a small JSON response containing `delivery_id` and `event_type`.
4. A background task creates the structured context, instantiates `GithubWebhook`, runs processing, performs cleanup, and always writes the final summary log.

> **Note:** A `200 OK` means "accepted and queued", not "automation finished successfully". The `delivery_id` is the key you use to trace a specific webhook through the logs.

For production deployments, the important security settings live near the top of the global config: `webhook-secret`, `verify-github-ips`, and `verify-cloudflare-ips`.

## Background Processing Model

The background model is intentionally simple:

- Uvicorn provides process-level concurrency.
- Inside each worker, webhook processing is queued with `asyncio.create_task`.
- Active tasks are tracked in memory and given up to 30 seconds to finish during shutdown before they are cancelled.
- Local work such as Git, `tox`, `pre-commit`, `podman`, `gh`, and `twine` runs as subprocesses through `run_command()`.
- PyGithub itself is synchronous, so the code regularly wraps blocking API calls and many property reads in `asyncio.to_thread()` to keep the event loop responsive.

This project does not use Celery, Redis, or an external broker. The “queue” is the application process itself.

> **Note:** Because the queue is in-process, recovery is operational rather than broker-based. If the server dies after GitHub already received `200 OK`, you recover with logs, GitHub redelivery, or the `/reprocess` command, not by checking a separate job system.

The official container image is designed around that model. It includes the toolchain the server expects to run locally, including `pre-commit`, `tox`, `gh`, `podman`, `regctl`, and the supported AI CLIs.

## Handler Architecture

`GithubWebhook.process()` is the router for the whole system. It resolves the event into either a tag flow or a pull-request-backed flow, enriches the structured context, and then dispatches to specialized handlers.

At a high level, the routes are:

- `pull_request`: initialize the PR, assign reviewers, queue and run checks, post the welcome message, create an issue if configured, and maintain merge-related labels.
- `pull_request_review`: translate review state into labels and optionally treat `/approve` in a review body as an approval command.
- `issue_comment`: parse slash commands such as `/retest`, `/assign-reviewers`, `/check-can-merge`, `/build-and-push-container`, `/cherry-pick`, `/reprocess`, and `/test-oracle`.
- `check_run`: ignore non-terminal runs, react to completed checks, and optionally auto-merge when `can-be-merged` succeeds and the PR has `automerge`.
- `status` and `pull_request_review_thread`: re-evaluate merge eligibility when a status reaches a terminal state or a review thread is resolved or unresolved.
- `push`: handle tag releases; ordinary branch pushes are intentionally skipped.

For a new or updated PR, the main handler is organized into two phases: setup first, then local CI/CD work.

```779:864:webhook_server/libs/handlers/pull_request_handler.py
async def process_opened_or_synchronize_pull_request(self, pull_request: PullRequest) -> None:
    if self.ctx:
        self.ctx.start_step("pr_workflow_setup")

    # Stage 1: Initial setup and check queue tasks
    setup_tasks: list[Coroutine[Any, Any, Any]] = []

    setup_tasks.append(self.owners_file_handler.assign_reviewers(pull_request=pull_request))
    setup_tasks.append(
        self.labels_handler._add_label(
            pull_request=pull_request,
            label=f"{BRANCH_LABEL_PREFIX}{pull_request.base.ref}",
        )
    )
    setup_tasks.append(self.label_pull_request_by_merge_state(pull_request=pull_request))
    setup_tasks.append(self.check_run_handler.set_check_queued(name=CAN_BE_MERGED_STR))
    ...
    self.logger.info(f"{self.log_prefix} Executing setup tasks")
    setup_results = await asyncio.gather(*setup_tasks, return_exceptions=True)
    ...
    if self.ctx:
        self.ctx.complete_step("pr_workflow_setup")

    # Stage 2: CI/CD execution tasks
    if self.ctx:
        self.ctx.start_step("pr_cicd_execution")

    ci_tasks: list[Coroutine[Any, Any, Any]] = []

    ci_tasks.append(self.runner_handler.run_tox(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_pre_commit(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_install_python_module(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_build_container(pull_request=pull_request))
    ...
    self.logger.info(f"{self.log_prefix} Executing CI/CD tasks")
    ci_results = await asyncio.gather(*ci_tasks, return_exceptions=True)
    ...
    if self.ctx:
        self.ctx.complete_step("pr_cicd_execution")
```

A few architectural choices are worth knowing:

- PR automation is OWNERS-driven. `OwnersFileHandler` determines reviewers, approvers, and command permissions from repository files and the changed paths in the PR.
- Merge eligibility is re-computed from current GitHub state rather than blindly trusting one earlier event. That is why `check_run`, `status`, and `pull_request_review_thread` all feed back into `check_if_can_be_merged()`.
- Optional features such as custom check runs, conventional-title validation, AI suggestions, and test-oracle calls plug into the same handler flow rather than creating a separate architecture.

On a typical new PR, the end-to-end suite expects the user-visible check state to look like this:

- `build-container`, `pre-commit`, `python-module-install`, and `tox` complete successfully when those features are configured.
- `verified` starts in `queued`.
- `can-be-merged` is expected to fail until approval, labels, status checks, and conversation rules are satisfied.

## Repository Cloning and Worktrees

The repository strategy is one of the most important architectural choices in this project.

Instead of recloning the repository for every operation, each webhook gets one temporary base clone. That clone is reused for local file inspection, and separate Git worktrees are created on demand for isolated execution.

The base clone is prepared once per webhook:

```262:393:webhook_server/libs/github_api.py
async def _clone_repository(
    self,
    pull_request: PullRequest | None = None,
    checkout_ref: str | None = None,
) -> None:
    ...
    rc, _, err = await run_command(
        command=f"git clone {clone_url_with_token} {self.clone_repo_dir}",
        log_prefix=self.log_prefix,
        redact_secrets=[github_token],
        mask_sensitive=self.mask_sensitive,
    )
    ...
    if pull_request:
        # Fetch the base branch first (needed for checkout)
        base_ref = await asyncio.to_thread(lambda: pull_request.base.ref)
        rc, _, err = await run_command(
            command=f"{git_cmd} fetch origin {base_ref}",
            log_prefix=self.log_prefix,
            mask_sensitive=self.mask_sensitive,
        )
        ...
        # Fetch only this specific PR's ref
        pr_number = await asyncio.to_thread(lambda: pull_request.number)
        rc, _, err = await run_command(
            command=f"{git_cmd} fetch origin +refs/pull/{pr_number}/head:refs/remotes/origin/pr/{pr_number}",
            log_prefix=self.log_prefix,
            mask_sensitive=self.mask_sensitive,
        )
    else:
        # For push events (tags only - branch pushes skip cloning)
        tag_name = checkout_ref.replace("refs/tags/", "")  # type: ignore[union-attr]
        fetch_refspec = f"refs/tags/{tag_name}:refs/tags/{tag_name}"
        rc, _, _ = await run_command(
            command=f"{git_cmd} fetch origin {fetch_refspec}",
            log_prefix=self.log_prefix,
            mask_sensitive=self.mask_sensitive,
        )
    ...
    rc, _, err = await run_command(
        command=f"{git_cmd} checkout {checkout_target}",
        log_prefix=self.log_prefix,
        mask_sensitive=self.mask_sensitive,
    )

    self._repo_cloned = True
    self.logger.info(f"{self.log_prefix} Repository cloned to {self.clone_repo_dir} (ref: {checkout_target})")
```

That base clone is then used for repository-aware logic such as OWNERS parsing and changed-file detection. `OwnersFileHandler` even uses local `git diff` instead of the GitHub API for changed paths, which keeps rate-limit usage down.

When the server needs an isolated execution checkout, it creates a worktree from the shared clone:

```71:164:webhook_server/libs/handlers/runner_handler.py
@contextlib.asynccontextmanager
async def _checkout_worktree(
    self,
    pull_request: PullRequest | None = None,
    is_merged: bool = False,
    checkout: str = "",
    tag_name: str = "",
) -> AsyncGenerator[tuple[bool, str, str, str]]:
    ...
    if checkout:
        checkout_target = checkout
    elif tag_name:
        checkout_target = tag_name
    elif is_merged and pull_request and base_ref is not None:
        checkout_target = base_ref
    elif pull_request and pr_number is not None:
        checkout_target = f"origin/pr/{pr_number}"
    ...
    rc, current_branch, _ = await run_command(
        command=f"git -C {repo_dir} rev-parse --abbrev-ref HEAD",
        log_prefix=self.log_prefix,
        mask_sensitive=self.github_webhook.mask_sensitive,
    )
    ...
    async with helpers_module.git_worktree_checkout(
        repo_dir=repo_dir,
        checkout=checkout_target,
        log_prefix=self.log_prefix,
        mask_sensitive=self.github_webhook.mask_sensitive,
    ) as (success, worktree_path, out, err):
        result: tuple[bool, str, str, str] = (success, worktree_path, out, err)

        # Merge base branch if needed (for PR testing)
        if success and pull_request and not is_merged and not tag_name:
            git_cmd = f"git -C {worktree_path}"
            rc, out, err = await run_command(
                command=f"{git_cmd} merge origin/{merge_ref} -m 'Merge {merge_ref}'",
                log_prefix=self.log_prefix,
                mask_sensitive=self.github_webhook.mask_sensitive,
            )
            if not rc:
                result = (False, worktree_path, out, err)

        yield result
```

This design gives the server a few advantages:

- The expensive `git clone` happens once per webhook, not once per check.
- The base clone stays on a stable checkout that is good for reading `OWNERS` files and computing diffs.
- Each execution path gets its own isolated workspace, which prevents one command from polluting another.
- PR checks are run against a worktree that merges the current base branch into the PR checkout, so validation is closer to what GitHub would merge.
- Tag-based release work can run against a tag worktree without disturbing PR-related state.

Cloning is also deliberately avoided when it is not useful:

- Branch pushes skip cloning entirely.
- Tag pushes clone because release actions need a real checkout.
- `check_run` events are ignored unless the action is `completed`.
- A failed `can-be-merged` check run does not trigger another clone-and-recheck cycle.

> **Tip:** This shared-clone-plus-worktree model is what lets the server run `tox`, `pre-commit`, Python packaging, container builds, `gh` commands, and AI-assisted flows locally without paying the cost of repeated full clones.

## Structured Logging Flow

Every webhook carries a structured execution context from the moment background processing starts to the moment the final summary is written.

The flow looks like this:

1. `create_context()` stores a `WebhookContext` in a `ContextVar`.
2. Handlers call `start_step()`, `complete_step()`, and `fail_step()` for major workflow stages such as `repo_clone`, `pr_workflow_setup`, `pr_cicd_execution`, `check_merge_eligibility`, and `push_handler`.
3. Normal log messages are still written, but `JsonLogHandler` also serializes them as JSON `log_entry` records and enriches them with webhook metadata from the current context.
4. At the end of processing, `write_webhook_log()` writes one `webhook_summary` record with timing, PR metadata, token usage, workflow steps, and overall success or failure.

The summary writer stores those records as one JSON object per line in daily files:

```93:152:webhook_server/utils/structured_logger.py
def write_log(self, context: WebhookContext) -> None:
    """Write webhook context as JSONL entry to date-based log file."""
    completed_at = context.completed_at if context.completed_at else datetime.now(UTC)

    # Get context dict and update timing locally (without mutating context)
    context_dict = context.to_dict()
    context_dict["type"] = "webhook_summary"
    if "timing" in context_dict:
        context_dict["timing"]["completed_at"] = completed_at.isoformat()
        if context.started_at:
            duration_ms = int((completed_at - context.started_at).total_seconds() * 1000)
            context_dict["timing"]["duration_ms"] = duration_ms

    # Get log file path
    log_file = self._get_log_file_path(completed_at)

    # Serialize context to JSON (compact JSONL format - single line, no indentation)
    log_entry = json.dumps(context_dict, ensure_ascii=False)
    ...
    # Write JSON entry with single newline (JSONL format)
    os.write(temp_fd, f"{log_entry}\n".encode())
    ...
    with open(log_file, "a") as log_fd:
        ...
        log_fd.write(data.decode("utf-8"))
```

For operators, the important outputs are:

- Text logs for day-to-day reading.
- `log_entry` JSON records for individual log messages.
- `webhook_summary` JSON records for the complete end-to-end outcome of one delivery.
- Daily files named `webhooks_YYYY-MM-DD.json` under `{data_dir}/logs`.

If you enable `ENABLE_LOG_SERVER=true`, the application also exposes a log viewer and related APIs that read these same structured files for filtering, export, workflow-step drill-down, and live streaming.

> **Warning:** Treat the log viewer as an internal operations surface. It is only mounted when `ENABLE_LOG_SERVER=true`, and it should be exposed only on a trusted network boundary.

## Configuration That Changes the Flow

These root settings shape intake, logging, and bootstrap behavior:

```3:17:examples/config.yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart
mcp-log-file: mcp_server.log # Set global MCP log file, change take effect immediately without server restart
logs-server-log-file: logs_server.log # Set global Logs Server log file, change take effect immediately without server restart
mask-sensitive-data: true # Mask sensitive data in logs (default: true). Set to false for debugging (NOT recommended in production)

# Server configuration
disable-ssl-warnings: true # Disable SSL warnings (useful in production to reduce log noise from SSL certificate issues)

# ...
webhook-ip: <HTTP://IP OR URL:PORT/webhook_server> # Full URL with path (e.g., https://your-domain.com/webhook_server or https://smee.io/your-channel)
```

These repository settings determine which events are registered and what a PR or tag push actually does when it arrives:

```139:182:examples/config.yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    mask-sensitive-data: false # Override global setting - disable masking for debugging this specific repo (NOT recommended in production)
    slack-webhook-url: <Slack webhook url> # Send notification to slack on several operations
    verified-job: true
    pypi:
      token: <PYPI TOKEN>

    events: # To listen to all events do not send events
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status
    tox:
      main: all # Run all tests in tox.ini when pull request parent branch is main
      dev: testenv1,testenv2 # Run testenv1 and testenv2 tests in tox.ini when pull request parent branch is dev

    pre-commit: true # Run pre-commit check

    protected-branches:
      dev: []
      main: # set [] in order to set all defaults run included
        include-runs:
          - "pre-commit.ci - pr"
          - "WIP"
        exclude-runs:
          - "SonarCloud Code Analysis"
    container:
      username: <registry username>
      password: <registry_password>
      repository: <registry_repository_full_path>
      tag: <image_tag>
      release: true # Push image to registry on new release with release as the tag
      build-args: # build args to send to podman build command
        - my-build-arg1=1
        - my-build-arg2=2
      args: # args to send to podman build command
        - --format docker
```

A few configuration rules are especially important when you are reasoning about the event flow:

- `repositories.<repo>.events` controls what GitHub sends to the server after startup sync.
- `tox`, `pre-commit`, `pypi`, `container`, `conventional-title`, and custom check-run settings decide which checks are queued and which local commands actually run.
- `protected-branches` shapes the status-check list that `can-be-merged` evaluates against.
- `mask-sensitive-data` controls whether secrets are scrubbed from text logs.
- `slack-webhook-url`, `test-oracle`, and AI features add side effects around the main PR pipeline, but they still fit into the same handler model.

> **Note:** Repository-local `.github-webhook-server.yaml` overrides matching values from the global `config.yaml`. That lets one server instance manage repositories with different PR rules, labels, checks, and release behavior without changing the intake architecture.

Put together, the architecture is straightforward: validate fast, process in the background, route by event type, work from one shared clone, isolate side effects in worktrees, and leave a structured trail behind for every delivery. That is what makes `github-webhook-server` feel responsive to GitHub while still doing substantial repository automation under the hood.# Architecture and Event Flow

`github-webhook-server` is built around one simple idea: accept GitHub webhooks quickly, then do the real work asynchronously. As a user, that means GitHub gets a fast response, while pull request automation, release work, and logging continue in the background.

## At a Glance
- The HTTP endpoint validates the request and returns `200 OK` immediately.
- A background task creates a `GithubWebhook` object and routes the event to specialized handlers.
- PR automation is split across focused components such as `PullRequestHandler`, `IssueCommentHandler`, `PullRequestReviewHandler`, `CheckRunHandler`, `OwnersFileHandler`, `LabelsHandler`, and `RunnerHandler`.
- Each webhook gets one temporary base clone; checks and release actions run in isolated Git worktrees created from that clone.
- Every webhook produces both normal text logs and structured JSON records, which can be searched later or viewed through the optional log viewer.

## Before the First Event

Startup does more than launch the HTTP server. `entrypoint.py` first runs repository bootstrap logic, then starts Uvicorn with the configured worker count.

```23:45:webhook_server/utils/github_repository_and_webhook_settings.py
async def repository_and_webhook_settings(webhook_secret: str | None = None) -> None:
    config = Config(logger=LOGGER)
    apis_dict: dict[str, dict[str, Any]] = {}
    ...
    await set_repositories_settings(config=config, apis_dict=apis_dict)
    set_all_in_progress_check_runs_to_queued(repo_config=config, apis_dict=apis_dict)
    create_webhook(config=config, apis_dict=apis_dict, secret=webhook_secret)
```

That startup pass does three important jobs:

- It applies repository-side settings such as labels, branch protection, and related GitHub configuration.
- It resets built-in check runs that were left in `in_progress` during a previous shutdown back to `queued`.
- It creates or updates the GitHub webhook on each configured repository so GitHub actually sends the events listed in your config.

`entrypoint.py` then starts the app with `workers=int(_max_workers)`, so worker-level parallelism is controlled by the root `max-workers` setting.

> **Note:** The `events` list under each repository is operational, not just descriptive. Startup uses it to create or update the real GitHub webhook subscription.

## Webhook Intake Pipeline

When GitHub calls `POST /webhook_server`, the server does only the minimum synchronous work required to prove the request is valid: read the body, verify the signature if configured, parse JSON, and check that the repository and event metadata are present. Once that passes, it returns `200 OK` and hands everything else to a background task.

```418:529:webhook_server/app.py
# Return 200 immediately - all validation passed, we can process this webhook
LOGGER.info(f"{log_context} Webhook validation passed, queuing for background processing")

async def process_with_error_handling(
    _hook_data: dict[Any, Any], _headers: Headers, _delivery_id: str, _event_type: str
) -> None:
    # Create structured logging context at the VERY START
    repository_name = _hook_data.get("repository", {}).get("name", "unknown")
    repository_full_name = _hook_data.get("repository", {}).get("full_name", "unknown")
    ctx = create_context(
        hook_id=_delivery_id,
        event_type=_event_type,
        repository=repository_name,
        repository_full_name=repository_full_name,
        action=_hook_data.get("action"),
        sender=_hook_data.get("sender", {}).get("login"),
    )
    ...
    try:
        _api: GithubWebhook = GithubWebhook(hook_data=_hook_data, headers=_headers, logger=_logger)
        try:
            await _api.process()
        finally:
            await _api.cleanup()
    ...
    finally:
        if ctx:
            ctx.completed_at = datetime.now(UTC)
            log_webhook_summary(ctx, _logger, _log_context)

        try:
            write_webhook_log(ctx)
        except Exception:
            _logger.exception(f"{_log_context} Failed to write webhook log")
        finally:
            clear_context()

task = asyncio.create_task(
    process_with_error_handling(
        _hook_data=hook_data,
        _headers=request.headers,
        _delivery_id=delivery_id,
        _event_type=event_type,
    )
)
_background_tasks.add(task)
task.add_done_callback(_background_tasks.discard)

return JSONResponse(
    status_code=status.HTTP_200_OK,
    content={
        "status": status.HTTP_200_OK,
        "message": "Webhook queued for processing",
        "delivery_id": delivery_id,
        "event_type": event_type,
    },
)
```

In practice, the intake flow looks like this:

1. GitHub sends the event to `POST /webhook_server`.
2. The server optionally checks the source IP, verifies `x-hub-signature-256` when `webhook-secret` is set, parses the payload, and validates required fields.
3. The server returns a small JSON response containing `delivery_id` and `event_type`.
4. A background task creates the structured context, instantiates `GithubWebhook`, runs processing, performs cleanup, and always writes the final summary log.

> **Note:** A `200 OK` means "accepted and queued", not "automation finished successfully". The `delivery_id` is the key you use to trace a specific webhook through the logs.

For production deployments, the important security settings live near the top of the global config: `webhook-secret`, `verify-github-ips`, and `verify-cloudflare-ips`.

## Background Processing Model

The background model is intentionally simple:

- Uvicorn provides process-level concurrency.
- Inside each worker, webhook processing is queued with `asyncio.create_task`.
- Active tasks are tracked in memory and given up to 30 seconds to finish during shutdown before they are cancelled.
- Local work such as Git, `tox`, `pre-commit`, `podman`, `gh`, and `twine` runs as subprocesses through `run_command()`.
- PyGithub itself is synchronous, so the code regularly wraps blocking API calls and many property reads in `asyncio.to_thread()` to keep the event loop responsive.

This project does not use Celery, Redis, or an external broker. The “queue” is the application process itself.

> **Note:** Because the queue is in-process, recovery is operational rather than broker-based. If the server dies after GitHub already received `200 OK`, you recover with logs, GitHub redelivery, or the `/reprocess` command, not by checking a separate job system.

The official container image is designed around that model. It includes the toolchain the server expects to run locally, including `pre-commit`, `tox`, `gh`, `podman`, `regctl`, and the supported AI CLIs.

## Handler Architecture

`GithubWebhook.process()` is the router for the whole system. It resolves the event into either a tag flow or a pull-request-backed flow, enriches the structured context, and then dispatches to specialized handlers.

At a high level, the routes are:

- `pull_request`: initialize the PR, assign reviewers, queue and run checks, post the welcome message, create an issue if configured, and maintain merge-related labels.
- `pull_request_review`: translate review state into labels and optionally treat `/approve` in a review body as an approval command.
- `issue_comment`: parse slash commands such as `/retest`, `/assign-reviewers`, `/check-can-merge`, `/build-and-push-container`, `/cherry-pick`, `/reprocess`, and `/test-oracle`.
- `check_run`: ignore non-terminal runs, react to completed checks, and optionally auto-merge when `can-be-merged` succeeds and the PR has `automerge`.
- `status` and `pull_request_review_thread`: re-evaluate merge eligibility when a status reaches a terminal state or a review thread is resolved or unresolved.
- `push`: handle tag releases; ordinary branch pushes are intentionally skipped.

For a new or updated PR, the main handler is organized into two phases: setup first, then local CI/CD work.

```779:864:webhook_server/libs/handlers/pull_request_handler.py
async def process_opened_or_synchronize_pull_request(self, pull_request: PullRequest) -> None:
    if self.ctx:
        self.ctx.start_step("pr_workflow_setup")

    # Stage 1: Initial setup and check queue tasks
    setup_tasks: list[Coroutine[Any, Any, Any]] = []

    setup_tasks.append(self.owners_file_handler.assign_reviewers(pull_request=pull_request))
    setup_tasks.append(
        self.labels_handler._add_label(
            pull_request=pull_request,
            label=f"{BRANCH_LABEL_PREFIX}{pull_request.base.ref}",
        )
    )
    setup_tasks.append(self.label_pull_request_by_merge_state(pull_request=pull_request))
    setup_tasks.append(self.check_run_handler.set_check_queued(name=CAN_BE_MERGED_STR))
    ...
    self.logger.info(f"{self.log_prefix} Executing setup tasks")
    setup_results = await asyncio.gather(*setup_tasks, return_exceptions=True)
    ...
    if self.ctx:
        self.ctx.complete_step("pr_workflow_setup")

    # Stage 2: CI/CD execution tasks
    if self.ctx:
        self.ctx.start_step("pr_cicd_execution")

    ci_tasks: list[Coroutine[Any, Any, Any]] = []

    ci_tasks.append(self.runner_handler.run_tox(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_pre_commit(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_install_python_module(pull_request=pull_request))
    ci_tasks.append(self.runner_handler.run_build_container(pull_request=pull_request))
    ...
    self.logger.info(f"{self.log_prefix} Executing CI/CD tasks")
    ci_results = await asyncio.gather(*ci_tasks, return_exceptions=True)
    ...
    if self.ctx:
        self.ctx.complete_step("pr_cicd_execution")
```

A few architectural choices are worth knowing:

- PR automation is OWNERS-driven. `OwnersFileHandler` determines reviewers, approvers, and command permissions from repository files and the changed paths in the PR.
- Merge eligibility is re-computed from current GitHub state rather than blindly trusting one earlier event. That is why `check_run`, `status`, and `pull_request_review_thread` all feed back into `check_if_can_be_merged()`.
- Optional features such as custom check runs, conventional-title validation, AI suggestions, and test-oracle calls plug into the same handler flow rather than creating a separate architecture.

On a typical new PR, the end-to-end suite expects the user-visible check state to look like this:

- `build-container`, `pre-commit`, `python-module-install`, and `tox` complete successfully when those features are configured.
- `verified` starts in `queued`.
- `can-be-merged` is expected to fail until approval, labels, status checks, and conversation rules are satisfied.

## Repository Cloning and Worktrees

The repository strategy is one of the most important architectural choices in this project.

Instead of recloning the repository for every operation, each webhook gets one temporary base clone. That clone is reused for local file inspection, and separate Git worktrees are created on demand for isolated execution.

The base clone is prepared once per webhook:

```262:393:webhook_server/libs/github_api.py
async def _clone_repository(
    self,
    pull_request: PullRequest | None = None,
    checkout_ref: str | None = None,
) -> None:
    ...
    rc, _, err = await run_command(
        command=f"git clone {clone_url_with_token} {self.clone_repo_dir}",
        log_prefix=self.log_prefix,
        redact_secrets=[github_token],
        mask_sensitive=self.mask_sensitive,
    )
    ...
    if pull_request:
        # Fetch the base branch first (needed for checkout)
        base_ref = await asyncio.to_thread(lambda: pull_request.base.ref)
        rc, _, err = await run_command(
            command=f"{git_cmd} fetch origin {base_ref}",
            log_prefix=self.log_prefix,
            mask_sensitive=self.mask_sensitive,
        )
        ...
        # Fetch only this specific PR's ref
        pr_number = await asyncio.to_thread(lambda: pull_request.number)
        rc, _, err = await run_command(
            command=f"{git_cmd} fetch origin +refs/pull/{pr_number}/head:refs/remotes/origin/pr/{pr_number}",
            log_prefix=self.log_prefix,
            mask_sensitive=self.mask_sensitive,
        )
    else:
        # For push events (tags only - branch pushes skip cloning)
        tag_name = checkout_ref.replace("refs/tags/", "")  # type: ignore[union-attr]
        fetch_refspec = f"refs/tags/{tag_name}:refs/tags/{tag_name}"
        rc, _, _ = await run_command(
            command=f"{git_cmd} fetch origin {fetch_refspec}",
            log_prefix=self.log_prefix,
            mask_sensitive=self.mask_sensitive,
        )
    ...
    rc, _, err = await run_command(
        command=f"{git_cmd} checkout {checkout_target}",
        log_prefix=self.log_prefix,
        mask_sensitive=self.mask_sensitive,
    )

    self._repo_cloned = True
    self.logger.info(f"{self.log_prefix} Repository cloned to {self.clone_repo_dir} (ref: {checkout_target})")
```

That base clone is then used for repository-aware logic such as OWNERS parsing and changed-file detection. `OwnersFileHandler` even uses local `git diff` instead of the GitHub API for changed paths, which keeps rate-limit usage down.

When the server needs an isolated execution checkout, it creates a worktree from the shared clone:

```71:164:webhook_server/libs/handlers/runner_handler.py
@contextlib.asynccontextmanager
async def _checkout_worktree(
    self,
    pull_request: PullRequest | None = None,
    is_merged: bool = False,
    checkout: str = "",
    tag_name: str = "",
) -> AsyncGenerator[tuple[bool, str, str, str]]:
    ...
    if checkout:
        checkout_target = checkout
    elif tag_name:
        checkout_target = tag_name
    elif is_merged and pull_request and base_ref is not None:
        checkout_target = base_ref
    elif pull_request and pr_number is not None:
        checkout_target = f"origin/pr/{pr_number}"
    ...
    rc, current_branch, _ = await run_command(
        command=f"git -C {repo_dir} rev-parse --abbrev-ref HEAD",
        log_prefix=self.log_prefix,
        mask_sensitive=self.github_webhook.mask_sensitive,
    )
    ...
    async with helpers_module.git_worktree_checkout(
        repo_dir=repo_dir,
        checkout=checkout_target,
        log_prefix=self.log_prefix,
        mask_sensitive=self.github_webhook.mask_sensitive,
    ) as (success, worktree_path, out, err):
        result: tuple[bool, str, str, str] = (success, worktree_path, out, err)

        # Merge base branch if needed (for PR testing)
        if success and pull_request and not is_merged and not tag_name:
            git_cmd = f"git -C {worktree_path}"
            rc, out, err = await run_command(
                command=f"{git_cmd} merge origin/{merge_ref} -m 'Merge {merge_ref}'",
                log_prefix=self.log_prefix,
                mask_sensitive=self.github_webhook.mask_sensitive,
            )
            if not rc:
                result = (False, worktree_path, out, err)

        yield result
```

This design gives the server a few advantages:

- The expensive `git clone` happens once per webhook, not once per check.
- The base clone stays on a stable checkout that is good for reading `OWNERS` files and computing diffs.
- Each execution path gets its own isolated workspace, which prevents one command from polluting another.
- PR checks are run against a worktree that merges the current base branch into the PR checkout, so validation is closer to what GitHub would merge.
- Tag-based release work can run against a tag worktree without disturbing PR-related state.

Cloning is also deliberately avoided when it is not useful:

- Branch pushes skip cloning entirely.
- Tag pushes clone because release actions need a real checkout.
- `check_run` events are ignored unless the action is `completed`.
- A failed `can-be-merged` check run does not trigger another clone-and-recheck cycle.

> **Tip:** This shared-clone-plus-worktree model is what lets the server run `tox`, `pre-commit`, Python packaging, container builds, `gh` commands, and AI-assisted flows locally without paying the cost of repeated full clones.

## Structured Logging Flow

Every webhook carries a structured execution context from the moment background processing starts to the moment the final summary is written.

The flow looks like this:

1. `create_context()` stores a `WebhookContext` in a `ContextVar`.
2. Handlers call `start_step()`, `complete_step()`, and `fail_step()` for major workflow stages such as `repo_clone`, `pr_workflow_setup`, `pr_cicd_execution`, `check_merge_eligibility`, and `push_handler`.
3. Normal log messages are still written, but `JsonLogHandler` also serializes them as JSON `log_entry` records and enriches them with webhook metadata from the current context.
4. At the end of processing, `write_webhook_log()` writes one `webhook_summary` record with timing, PR metadata, token usage, workflow steps, and overall success or failure.

The summary writer stores those records as one JSON object per line in daily files:

```93:152:webhook_server/utils/structured_logger.py
def write_log(self, context: WebhookContext) -> None:
    """Write webhook context as JSONL entry to date-based log file."""
    completed_at = context.completed_at if context.completed_at else datetime.now(UTC)

    # Get context dict and update timing locally (without mutating context)
    context_dict = context.to_dict()
    context_dict["type"] = "webhook_summary"
    if "timing" in context_dict:
        context_dict["timing"]["completed_at"] = completed_at.isoformat()
        if context.started_at:
            duration_ms = int((completed_at - context.started_at).total_seconds() * 1000)
            context_dict["timing"]["duration_ms"] = duration_ms

    # Get log file path
    log_file = self._get_log_file_path(completed_at)

    # Serialize context to JSON (compact JSONL format - single line, no indentation)
    log_entry = json.dumps(context_dict, ensure_ascii=False)
    ...
    # Write JSON entry with single newline (JSONL format)
    os.write(temp_fd, f"{log_entry}\n".encode())
    ...
    with open(log_file, "a") as log_fd:
        ...
        log_fd.write(data.decode("utf-8"))
```

For operators, the important outputs are:

- Text logs for day-to-day reading.
- `log_entry` JSON records for individual log messages.
- `webhook_summary` JSON records for the complete end-to-end outcome of one delivery.
- Daily files named `webhooks_YYYY-MM-DD.json` under `{data_dir}/logs`.

If you enable `ENABLE_LOG_SERVER=true`, the application also exposes a log viewer and related APIs that read these same structured files for filtering, export, workflow-step drill-down, and live streaming.

> **Warning:** Treat the log viewer as an internal operations surface. It is only mounted when `ENABLE_LOG_SERVER=true`, and it should be exposed only on a trusted network boundary.

## Configuration That Changes the Flow

These root settings shape intake, logging, and bootstrap behavior:

```3:17:examples/config.yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart
mcp-log-file: mcp_server.log # Set global MCP log file, change take effect immediately without server restart
logs-server-log-file: logs_server.log # Set global Logs Server log file, change take effect immediately without server restart
mask-sensitive-data: true # Mask sensitive data in logs (default: true). Set to false for debugging (NOT recommended in production)

# Server configuration
disable-ssl-warnings: true # Disable SSL warnings (useful in production to reduce log noise from SSL certificate issues)

# ...
webhook-ip: <HTTP://IP OR URL:PORT/webhook_server> # Full URL with path (e.g., https://your-domain.com/webhook_server or https://smee.io/your-channel)
```

These repository settings determine which events are registered and what a PR or tag push actually does when it arrives:

```139:182:examples/config.yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    mask-sensitive-data: false # Override global setting - disable masking for debugging this specific repo (NOT recommended in production)
    slack-webhook-url: <Slack webhook url> # Send notification to slack on several operations
    verified-job: true
    pypi:
      token: <PYPI TOKEN>

    events: # To listen to all events do not send events
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status
    tox:
      main: all # Run all tests in tox.ini when pull request parent branch is main
      dev: testenv1,testenv2 # Run testenv1 and testenv2 tests in tox.ini when pull request parent branch is dev

    pre-commit: true # Run pre-commit check

    protected-branches:
      dev: []
      main: # set [] in order to set all defaults run included
        include-runs:
          - "pre-commit.ci - pr"
          - "WIP"
        exclude-runs:
          - "SonarCloud Code Analysis"
    container:
      username: <registry username>
      password: <registry_password>
      repository: <registry_repository_full_path>
      tag: <image_tag>
      release: true # Push image to registry on new release with release as the tag
      build-args: # build args to send to podman build command
        - my-build-arg1=1
        - my-build-arg2=2
      args: # args to send to podman build command
        - --format docker
```

A few configuration rules are especially important when you are reasoning about the event flow:

- `repositories.<repo>.events` controls what GitHub sends to the server after startup sync.
- `tox`, `pre-commit`, `pypi`, `container`, `conventional-title`, and custom check-run settings decide which checks are queued and which local commands actually run.
- `protected-branches` shapes the status-check list that `can-be-merged` evaluates against.
- `mask-sensitive-data` controls whether secrets are scrubbed from text logs.
- `slack-webhook-url`, `test-oracle`, and AI features add side effects around the main PR pipeline, but they still fit into the same handler model.

> **Note:** Repository-local `.github-webhook-server.yaml` overrides matching values from the global `config.yaml`. That lets one server instance manage repositories with different PR rules, labels, checks, and release behavior without changing the intake architecture.

Put together, the architecture is straightforward: validate fast, process in the background, route by event type, work from one shared clone, isolate side effects in worktrees, and leave a structured trail behind for every delivery. That is what makes `github-webhook-server` feel responsive to GitHub while still doing substantial repository automation under the hood.


---

Source: installation.md

# Installation

`github-webhook-server` is configured around a small server data directory plus GitHub credentials. A working install needs a Python `3.13.x` interpreter, `uv`, `git`, a reachable webhook URL, and GitHub credentials that can manage the repositories you configure.

## Runtime requirements

The project pins Python exactly:

```45:45:pyproject.toml
requires-python = "==3.13.*"
```

Install these tools for a normal source install:

- `uv`
- `git`

Install these only if you use the matching features:

- `podman` for `docker:` login and repository `container:` build/push automation
- `gh` for automated cherry-pick PR creation
- `claude`, `gemini`, or `cursor` CLI if you enable `ai-features` or `test-oracle`
- Node.js and `npm` if you want to install the Gemini CLI locally

> **Note:** The built-in tox, pre-commit, and twine flows are launched through `uv` and `uvx`, so you do not need to install those tools globally.

## Python and `uv` setup

Once Python `3.13.x` and `uv` are available, install the project from the repository root:

```bash
uv sync
```

Start the server with a data directory of your choice:

```bash
WEBHOOK_SERVER_DATA_DIR=/path/to/data uv run entrypoint.py
```

The bind address, port, worker count, and webhook secret are read from `config.yaml`:

```13:16:entrypoint.py
_ip_bind = _root_config.get("ip-bind", "0.0.0.0")
_port = _root_config.get("port", 5000)
_max_workers = _root_config.get("max-workers", 10)
_webhook_secret = _root_config.get("webhook-secret")
```

> **Tip:** Put listener settings such as `ip-bind`, `port`, `max-workers`, and `webhook-secret` in `config.yaml`. The important environment variable for startup is `WEBHOOK_SERVER_DATA_DIR`.

## Prepare the data directory and config

The server always looks for `config.yaml` inside the data directory. If `WEBHOOK_SERVER_DATA_DIR` is not set, it defaults to `/home/podman/data`:

```20:33:webhook_server/libs/config.py
self.data_dir: str = os.environ.get("WEBHOOK_SERVER_DATA_DIR", "/home/podman/data")
self.config_path: str = os.path.join(self.data_dir, "config.yaml")
self.repository = repository
self.exists()
self.repositories_exists()

...

if not os.path.isfile(self.config_path):
    raise FileNotFoundError(f"Config file {self.config_path} not found")

...

if not self.root_data.get("repositories"):
    raise ValueError(f"Config {self.config_path} does not have `repositories`")
```

The GitHub App private key is also expected in the same directory, with this exact filename:

```413:418:webhook_server/utils/github_repository_settings.py
with open(os.path.join(config_.data_dir, "webhook-server.private-key.pem")) as fd:
    private_key = fd.read()

github_app_id: int = config_.root_data["github-app-id"]
auth: AppAuth = Auth.AppAuth(app_id=github_app_id, private_key=private_key)
```

Create a directory like this before first start:

```text
/path/to/data/
  config.yaml
  webhook-server.private-key.pem
  logs/
```

You only need to create `config.yaml` and `webhook-server.private-key.pem` yourself. The server creates the log directory and structured log files automatically:

```74:91:webhook_server/utils/structured_logger.py
self.log_dir = Path(self.config.data_dir) / "logs"

# Create log directory if it doesn't exist
self.log_dir.mkdir(parents=True, exist_ok=True)

...

date_str = date.strftime("%Y-%m-%d")
return self.log_dir / f"webhooks_{date_str}.json"
```

Relative log filenames are stored under `<data-dir>/logs`:

```141:147:webhook_server/utils/helpers.py
if log_file_name and not log_file_name.startswith("/"):
    log_file_path = os.path.join(config.data_dir, "logs")

    if not os.path.isdir(log_file_path):
        os.makedirs(log_file_path, exist_ok=True)
    return os.path.join(log_file_path, log_file_name)
```

Typical generated contents are:

- `logs/webhook-server.log`
- `logs/webhooks_YYYY-MM-DD.json`
- `logs/mcp_server.log` if MCP is enabled
- `logs/logs_server.log` if the log viewer is enabled
- `log-colors.json` in the data directory root when repository colors are first assigned

If you run the container image, mount your host data directory to `/home/podman/data`:

```5:6:examples/docker-compose.yaml
volumes:
  - "./webhook_server_data_dir:/home/podman/data:Z" # Should include config.yaml and webhook-server.private-key.pem
```

### GitHub credentials

A working install needs both of these:

- `github-app-id` in `config.yaml`, plus the matching private key in `webhook-server.private-key.pem`
- one or more GitHub tokens in `github-tokens`

From the shipped example config:

```12:17:examples/config.yaml
github-app-id: 123456 # GitHub app id
github-tokens:
  - <GITHIB TOKEN1>
  - <GITHIB TOKEN2>

webhook-ip: <HTTP://IP OR URL:PORT/webhook_server> # Full URL with path (e.g., https://your-domain.com/webhook_server or https://smee.io/your-channel)
```

Replace those placeholder values with your real credentials.

The `repositories` section uses the short repository name as the map key, and the full `owner/repo` string inside `name`:

```139:142:examples/config.yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
```

That means:

- the map key (`my-repository`) should match GitHub’s `repository.name`
- the `name` field must be the full `owner/repo`
- at least one repository entry is required

The server builds a client for every configured token and selects the one with the highest remaining rate limit:

```455:518:webhook_server/utils/helpers.py
apis_and_tokens: list[tuple[github.Github, str]] = []
tokens = config.get_value(value="github-tokens") or []

for _token in tokens:
    apis_and_tokens.append((github.Github(auth=github.Auth.Token(_token)), _token))

# ... choose the token with the highest remaining rate limit ...

if not _api_user or not api or not token:
    raise NoApiTokenError("Failed to get API with highest rate limit")
```

> **Warning:** A GitHub token alone is not enough. The server also reads `github-app-id` and `webhook-server.private-key.pem`, then requests the repository installation from GitHub. Make sure the GitHub App is installed on every repository listed in `config.yaml`.

> **Note:** `webhook-secret` is optional in code, but strongly recommended in any real deployment. If you set it, the server verifies GitHub’s webhook signature before queueing work.

> **Tip:** `webhook-ip` must be the full external URL GitHub can reach, including the `/webhook_server` path. For local testing, the example config explicitly allows a relay URL such as `https://smee.io/your-channel`.

Startup is active, not passive. Before serving requests, the application syncs repository settings and creates or updates webhooks for every configured repository:

```43:45:webhook_server/utils/github_repository_and_webhook_settings.py
await set_repositories_settings(config=config, apis_dict=apis_dict)
set_all_in_progress_check_runs_to_queued(repo_config=config, apis_dict=apis_dict)
create_webhook(config=config, apis_dict=apis_dict, secret=webhook_secret)
```

> **Warning:** Use credentials with enough permission to manage repository settings, branch protection, labels, hooks, and pull-request workflows. Read-only credentials are not enough for this server.

## Start and verify

Before first start, validate the config file:

```bash
uv run webhook_server/tests/test_schema_validator.py /path/to/data/config.yaml
```

Then start the server:

```bash
WEBHOOK_SERVER_DATA_DIR=/path/to/data uv run entrypoint.py
```

Verify that the health endpoint responds:

```bash
curl http://127.0.0.1:5000/webhook_server/healthcheck
```

A healthy server responds on `/webhook_server/healthcheck`, and if your credentials and `webhook-ip` are correct, startup will also sync repository settings and webhook configuration.

> **Warning:** If you enable `ENABLE_LOG_SERVER=true`, treat `/logs` as a trusted-network-only interface. It is intended for internal use, not public internet exposure.


---

Source: quick-start.md

# Quick Start

This guide gets `github-webhook-server` running with one repository. You will create a data directory, add a minimal `config.yaml`, place the GitHub App private key where the server expects it, start the app, and verify that it is alive.

## Before You Start

You need:
- Python `3.13`
- `uv`
- A GitHub App ID
- The matching GitHub App private key in PEM format
- At least one GitHub token the server can use for API calls
- A repository where that GitHub App is installed

> **Warning:** The server uses both `github-tokens` and GitHub App auth. The token pool is used for regular GitHub API calls, and `github-app-id` plus `webhook-server.private-key.pem` are used to authenticate as the app installation.

## 1. Create a Data Directory

The server loads `config.yaml` from `WEBHOOK_SERVER_DATA_DIR`. If you do not set that variable, it defaults to `/home/podman/data`.

```bash
export WEBHOOK_SERVER_DATA_DIR=/path/to/data
mkdir -p "$WEBHOOK_SERVER_DATA_DIR"
```

Your directory should look like this:

```text
/path/to/data/
├── config.yaml
└── webhook-server.private-key.pem
```

## 2. Create a Minimal `config.yaml`

A minimal working config needs:
- `github-app-id`
- `github-tokens`
- `webhook-ip`
- At least one repository under `repositories`

```yaml
# yaml-language-server: $schema=https://raw.githubusercontent.com/myk-org/github-webhook-server/refs/heads/main/webhook_server/config/schema.yaml

github-app-id: 123456
github-tokens:
  - token1

webhook-ip: https://your-domain.com/webhook_server

repositories:
  test-repo:
    name: org/test-repo
```

Replace `123456`, `token1`, `https://your-domain.com/webhook_server`, and `org/test-repo` with your real values.

What each part means:
- `github-app-id` is your GitHub App ID.
- `github-tokens` is the token pool the server will choose from at startup.
- `webhook-ip` is the public URL GitHub should call.
- `repositories` is the list of repositories the server should manage.
- `test-repo` is the short repository name.
- `name` is the full `owner/repo` name.

> **Warning:** The key under `repositories` should be the short repository name, such as `test-repo`, not the full `owner/repo`. The full name belongs in the nested `name` field.

> **Warning:** `webhook-ip` should be the full webhook URL. In a normal deployment that means including `/webhook_server`, for example `https://your-domain.com/webhook_server`.

> **Warning:** `localhost` is fine for the health check, but GitHub cannot deliver webhooks to `localhost`. Use a real public URL or a `smee.io` channel URL for webhook delivery.

> **Note:** If you omit `events`, the server creates the webhook with `*`, which subscribes it to all events.

> **Note:** You can list more than one token in `github-tokens`. The server checks them and selects the one with the highest remaining rate limit.

If you want GitHub to sign webhook deliveries, add a shared secret:

```yaml
webhook-secret: test-webhook-secret
```

> **Tip:** You do not need a repo-local `.github-webhook-server.yaml` file for a minimal setup. The global `config.yaml` is enough to get started.

## 3. Add the GitHub App Private Key

Save the GitHub App private key as:

`$WEBHOOK_SERVER_DATA_DIR/webhook-server.private-key.pem`

The filename matters. The server loads that exact file from the data directory when it creates the GitHub App installation client.

> **Warning:** The private key is not a replacement for `github-tokens`. You need both.

> **Warning:** The matching GitHub App must be installed on every repository you add, or the server will not be able to fetch the repository installation.

## 4. Install Dependencies and Start the Server

Install the project dependencies:

```bash
uv sync
```

Start the server:

```bash
WEBHOOK_SERVER_DATA_DIR=/path/to/data uv run entrypoint.py
```

By default, the server starts on `0.0.0.0:5000` with `10` workers. You can override that in `config.yaml` with:
- `ip-bind`
- `port`
- `max-workers`

> **Note:** On startup, the server applies repository settings, resets in-progress check runs to queued, and creates or updates GitHub webhooks for every repository in `config.yaml`.

> **Tip:** Validate the file before starting the server with `uv run webhook_server/tests/test_schema_validator.py "$WEBHOOK_SERVER_DATA_DIR/config.yaml"`.

## 5. Verify the Health Endpoint

Once the server is running, check the health endpoint:

```bash
curl http://127.0.0.1:5000/webhook_server/healthcheck
```

You should get:

```json
{"status":200,"message":"Alive"}
```

If you changed `port` in `config.yaml`, use that port instead of `5000`.

This is the same endpoint the container health check uses.

> **Note:** A healthy response means the web server is up. It does not confirm that GitHub can reach your public `webhook-ip` yet.

At this point, the process is running and listening for webhook traffic on `/webhook_server`. If GitHub can reach the URL you set in `webhook-ip`, the server is ready to receive events.


---

Source: docker-deployment.md

# Docker and Container Deployment

`github-webhook-server` ships with a container image that is built around Podman-in-container. That matters for deployment: this is not a thin FastAPI-only image. It is designed to run the webhook server itself and, when repository configuration enables it, run nested Podman commands for repository automation such as building and pushing images.

## The container image

The top-level `Dockerfile` makes the intent clear:

```dockerfile
FROM quay.io/podman/stable:v5

EXPOSE 5000

ENV USERNAME="podman"
ENV HOME_DIR="/home/$USERNAME"
ENV BIN_DIR="$HOME_DIR/.local/bin"
ENV PATH="$PATH:$BIN_DIR:$HOME_DIR/.npm-global/bin" \
  DATA_DIR="$HOME_DIR/data" \
  APP_DIR="$HOME_DIR/github-webhook-server"
```

```dockerfile
USER $USERNAME
WORKDIR $HOME_DIR

ENV UV_PYTHON=python3.13 \
  UV_COMPILE_BYTECODE=1 \
  UV_NO_SYNC=1 \
  UV_CACHE_DIR=${APP_DIR}/.cache \
  PYTHONUNBUFFERED=1

HEALTHCHECK CMD curl --fail http://127.0.0.1:5000/webhook_server/healthcheck || exit 1

ENTRYPOINT ["tini", "--", "uv", "run", "entrypoint.py"]
```

The same `Dockerfile` also installs Podman tooling, `git`, `gh`, Node/NPM, `uv`, `tini`, and several other CLIs. In other words, the image is intentionally heavier than a typical Python web image because it needs to do more than serve HTTP.

A few practical consequences:

- The server listens on port `5000`.
- It runs as the `podman` user inside the container.
- It uses `tini`, which helps with signal handling and process cleanup.
- The built-in health check calls `http://127.0.0.1:5000/webhook_server/healthcheck`.

## Persistent data and volume mounts

By default, the application reads its persistent state from `/home/podman/data`. That comes directly from the runtime configuration code:

```python
self.data_dir: str = os.environ.get("WEBHOOK_SERVER_DATA_DIR", "/home/podman/data")
self.config_path: str = os.path.join(self.data_dir, "config.yaml")
```

The GitHub App private key is also read from that same directory:

```python
with open(os.path.join(config_.data_dir, "webhook-server.private-key.pem")) as fd:
    private_key = fd.read()
```

That means your persistent data mount needs to contain at least:

- `config.yaml`
- `webhook-server.private-key.pem`
- `logs/` (created automatically if it does not exist)

A good mental model is:

| Container path | Purpose | Persist it? |
| --- | --- | --- |
| `/home/podman/data` | Main app data: config, GitHub App key, text logs, structured webhook logs | Yes |
| `/tmp/storage-run-1000` | Nested Podman runtime/storage used by in-container Podman operations | Use a dedicated disposable mount |

The structured webhook logs are written under `logs/` as daily files such as `webhooks_2026-03-18.json`. Text logs also live under `logs/`, using names from `config.yaml` such as `webhook-server.log`, `mcp_server.log`, and `logs_server.log`.

> **Tip:** If you keep the default in-container path `/home/podman/data`, you do not need to set `WEBHOOK_SERVER_DATA_DIR`. Only set that environment variable if you intentionally mount the data directory somewhere else inside the container.

> **Tip:** Keep the `:Z` suffix on the persistent bind mount on SELinux-enabled hosts. The checked-in example uses it so the container can read `config.yaml`, the private key, and log files correctly.

## The example Compose deployment

The repository includes this example in `examples/docker-compose.yaml`:

```yaml
services:
  github-webhook-server:
    container_name: github-webhook-server
    build: ghcr.io/myk-org/github-webhook-server:latest
    volumes:
      - "./webhook_server_data_dir:/home/podman/data:Z" # Should include config.yaml and webhook-server.private-key.pem
      # Mount temporary directories to prevent boot ID mismatch issues
      - "/tmp/podman-storage-${USER:-1000}:/tmp/storage-run-1000"
    environment:
      - PUID=1000
      - PGID=1000
      - TZ=Asia/Jerusalem
      - MAX_WORKERS=50 # Defaults to 10 if not set
      - WEBHOOK_SERVER_IP_BIND=0.0.0.0 # IP to listen
      - WEBHOOK_SERVER_PORT=5000 # Port to listen
      - WEBHOOK_SECRET=<secret> # If set verify hook is a valid hook from Github
      - VERIFY_GITHUB_IPS=1 # Verify hook request is from GitHub IPs
      - VERIFY_CLOUDFLARE_IPS=1 # Verify hook request is from Cloudflare IPs
      - ENABLE_LOG_SERVER=true # Enable log viewer endpoints (default: false)
      - ENABLE_MCP_SERVER=false # Enable MCP server for AI agent integration (default: false)
    ports:
      - "5000:5000"
    privileged: true
    restart: unless-stopped
```

What this example is doing:

- It mounts a persistent host directory into `/home/podman/data`.
- It mounts a second host directory into `/tmp/storage-run-1000` for nested Podman runtime state.
- It publishes container port `5000`.
- It runs the container in `privileged` mode.
- It uses `restart: unless-stopped` for long-running deployment.

> **Note:** The checked-in example points `ghcr.io/myk-org/github-webhook-server:latest` at the `build:` key. In standard Docker Compose semantics, a registry reference belongs under `image:`. Use `build:` only when you are pointing at a local build context such as `.`. The important deployment details in the example are the volume mounts, port mapping, and `privileged: true`.

## Health checks

The application exposes a dedicated health endpoint:

```python
@FASTAPI_APP.get(f"{APP_URL_ROOT_PATH}/healthcheck", operation_id="healthcheck")
def healthcheck() -> dict[str, Any]:
    return {"status": requests.codes.ok, "message": "Alive"}
```

The image wires that into the container health check:

```dockerfile
HEALTHCHECK CMD curl --fail http://127.0.0.1:5000/webhook_server/healthcheck || exit 1
```

A healthy container means the web process is up and answering on port `5000`. It does not mean every webhook has been processed successfully.

> **Note:** Webhook delivery handling is asynchronous. The main webhook endpoint returns `200 OK` after validation and queueing, so successful HTTP responses do not automatically mean that all downstream GitHub operations succeeded. For real troubleshooting, check the logs in the mounted `logs/` directory.

## What belongs in `config.yaml`

Most deployment settings are read from the mounted `config.yaml`, not from environment variables.

The checked-in example config shows the expected style:

```yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart
mcp-log-file: mcp_server.log # Set global MCP log file, change take effect immediately without server restart
logs-server-log-file: logs_server.log # Set global Logs Server log file, change take effect immediately without server restart
mask-sensitive-data: true

github-app-id: 123456

webhook-ip: <HTTP://IP OR URL:PORT/webhook_server> # Full URL with path
```

If you use the server's container-build automation, the per-repository container settings also live in `config.yaml`:

```yaml
repositories:
  my-repository:
    name: my-org/my-repository
    container:
      username: <registry username>
      password: <registry_password>
      repository: <registry_repository_full_path>
      tag: <image_tag>
      release: true
      build-args:
        - my-build-arg1=1
        - my-build-arg2=2
      args:
        - --format docker
```

For containerized deployments, put these runtime settings in `config.yaml`:

- `webhook-ip`
- `ip-bind`
- `port`
- `max-workers`
- `webhook-secret`
- `verify-github-ips`
- `verify-cloudflare-ips`

> **Warning:** The checked-in Compose example shows `MAX_WORKERS`, `WEBHOOK_SERVER_IP_BIND`, `WEBHOOK_SERVER_PORT`, `WEBHOOK_SECRET`, `VERIFY_GITHUB_IPS`, and `VERIFY_CLOUDFLARE_IPS` as environment variables, but the application code reads those values from `config.yaml` keys (`max-workers`, `ip-bind`, `port`, `webhook-secret`, `verify-github-ips`, and `verify-cloudflare-ips`). The environment variables consumed directly at runtime are `WEBHOOK_SERVER_DATA_DIR`, `ENABLE_LOG_SERVER`, and `ENABLE_MCP_SERVER`. The Podman cleanup script also reads `PUID`. `PGID` appears in the example, but the application code does not read it.

> **Note:** `ENABLE_LOG_SERVER` and `ENABLE_MCP_SERVER` are enabled only when they are set to the literal string `true`.

> **Note:** `webhook-ip` must be the external URL GitHub should call, and it must include the `/webhook_server` path. If you change `webhook-ip` or `webhook-secret`, restart the container so the startup webhook reconciliation can update GitHub with the new values.

## Startup behavior and operational caveats

Container startup does more than launch Uvicorn. The entrypoint runs Podman cleanup and repository/webhook setup first:

```python
if __name__ == "__main__":
    # Run Podman cleanup before starting the application
    run_podman_cleanup()

    result = asyncio.run(repository_and_webhook_settings(webhook_secret=_webhook_secret))

    uvicorn.run(
        "webhook_server.app:FASTAPI_APP",
        host=_ip_bind,
        port=int(_port),
        workers=int(_max_workers),
        reload=False,
    )
```

That leads to a few operational caveats that are worth planning for:

- Startup depends on valid mounted configuration. If `config.yaml` or `webhook-server.private-key.pem` is missing, the container will not start cleanly.
- Startup also depends on GitHub access. Before the server begins listening, it reconciles repository settings and creates or updates GitHub webhooks using the configured `webhook-ip`.
- If `verify-github-ips` or `verify-cloudflare-ips` is enabled, the app fetches allowlists at startup. If verification is enabled but no valid networks can be loaded, startup fails closed for security.
- The second volume mount is intentionally disposable. The cleanup script removes stale runtime directories under `/tmp/storage-run-${PUID}` and then prunes stopped containers, dangling images, unused volumes, and unused networks from the nested Podman environment.
- Use a dedicated host path for that nested Podman mount. Do not point it at shared or important host storage.
- The checked-in build path for repository image automation uses Podman inside the container and builds with `--network=host`. That is one reason the example deployment keeps `privileged: true`.

> **Warning:** `ENABLE_LOG_SERVER=true` exposes `/logs`, `/logs/api/*`, and `/logs/ws` without authentication. `ENABLE_MCP_SERVER=true` exposes `/mcp` without authentication. Treat both as internal-only endpoints and place them behind a trusted network or an authenticated reverse proxy.

> **Note:** The webhook receiver and health check live under `/webhook_server`, but the optional log viewer lives under `/logs` and the optional MCP endpoint lives under `/mcp`. If you deploy behind a reverse proxy or ingress, route those paths explicitly.

> **Tip:** Plan for log retention. The structured webhook logs are written as daily `webhooks_YYYY-MM-DD.json` files, and the code documents them as unbounded in size. Text logs are safer to rotate, but the JSON webhook summaries still need external cleanup or retention policies on long-running deployments.


---

Source: configuration-model.md

# Configuration Model

`github-webhook-server` has three potential configuration layers:

1. The root of the server's `config.yaml`
2. The matching `repositories.<repo>` entry inside `config.yaml`
3. An optional `.github-webhook-server.yaml` in the repository itself

Not every setting participates in all three layers, but when a repository-scoped setting does, the server resolves it from most specific to least specific: repository-local file first, then the repo entry in `config.yaml`, then the root of `config.yaml`.

```132:153:webhook_server/libs/config.py
def get_value(self, value: str, return_on_none: Any = None, extra_dict: dict[str, Any] | None = None) -> Any:
    """
    Get value from config

    Supports dot notation for nested values (e.g., "docker.username", "pypi.token")

    Order of getting value:
        1. Local repository file (.github-webhook-server.yaml)
        2. Repository level global config file (config.yaml)
        3. Root level global config file (config.yaml)
    """
    if extra_dict:
        result = self._get_nested_value(value, extra_dict)
        if result is not None:
            return result

    for scope in (self.repository_data, self.root_data):
        result = self._get_nested_value(value, scope)
        if result is not None:
            return result

    return return_on_none
```

Think of the model like this: root `config.yaml` provides shared defaults, `repositories.<repo>` provides server-side exceptions for one repository, and `.github-webhook-server.yaml` lets a repository carry some of its own runtime behavior in version control.

## Where `config.yaml` Lives

By default, the server reads `config.yaml` from `/home/podman/data/config.yaml`. Set `WEBHOOK_SERVER_DATA_DIR` if you want a different base directory. The Docker example mounts `./webhook_server_data_dir` into `/home/podman/data`, which is why that path is the default.

> **Warning:** `config.yaml` is required, and `repositories:` must exist and be non-empty. Missing file or missing `repositories:` is a hard error.

## Server-Managed `config.yaml`

Both the global defaults and the per-repository overrides live in the same file. Root keys apply to every repository unless a repo-specific entry overrides them.

```3:190:examples/config.yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart

github-app-id: 123456 # GitHub app id
github-tokens:
  - <GITHIB TOKEN1>
  - <GITHIB TOKEN2>

webhook-ip: <HTTP://IP OR URL:PORT/webhook_server>

# ...

repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    events:
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status

    # ...

    github-tokens: # override GitHub tokens per repository
      - <GITHUB TOKEN1>
      - <GITHUB TOKEN2>
```

Use the root of `config.yaml` for shared or server-level values such as `github-app-id`, global `github-tokens`, `webhook-ip`, global `labels`, and other defaults you want every repository to inherit.

Use `repositories.<repo>` for repo-specific settings that the server must know before it starts processing that repository. Common examples are `name`, `events`, and repo-specific `github-tokens`.

> **Note:** The key under `repositories:` is the short repository name, while `name:` stores the full `owner/repo`. In the example above, `my-repository` is the lookup key and `my-org/my-repository` is the actual GitHub repository. Because lookup is by short name, avoid configuring two different repos that share the same short name.

## Repository-Managed `.github-webhook-server.yaml`

Use `.github-webhook-server.yaml` when you want repository-owned behavior to live with the code and be reviewed in pull requests. This is a good fit for runtime settings such as `tox`, `pypi`, `container`, `pre-commit`, `conventional-title`, `ai-features`, `minimum-lgtm`, `create-issue-for-new-pr`, and label-related behavior.

```118:162:examples/.github-webhook-server.yaml
conventional-title: "feat,fix,build,chore,ci,docs,style,refactor,perf,test,revert"

minimum-lgtm: 2

create-issue-for-new-pr: true # Create tracking issues for new PRs
cherry-pick-assign-to-pr-author: true # Assign cherry-pick PRs to the original PR author

# ...

ai-features:
  ai-provider: "claude" # claude | gemini | cursor
  ai-model: "claude-opus-4-6[1m]"
  conventional-title:
    enabled: true
    mode: suggest
    timeout-minutes: 10
  resolve-cherry-pick-conflicts-with-ai:
    enabled: true
    timeout-minutes: 10
```

If the file is missing, the server simply falls back to `config.yaml`. If the file exists but contains invalid YAML, loading it fails instead of being silently ignored.

The local file is not applied first thing at startup. The webhook runtime loads base config, selects the API token, and only then fetches `.github-webhook-server.yaml` and reapplies the supported repository settings.

```114:151:webhook_server/libs/github_api.py
# Get config without .github-webhook-server.yaml data
self._repo_data_from_config(repository_config={})
github_api, self.token, self.api_user = get_api_with_highest_rate_limit(
    config=self.config, repository_name=self.repository_name
)

# ...

# Once we have a repository, we can get the config from .github-webhook-server.yaml
local_repository_config = self.config.repository_local_data(
    github_api=github_api, repository_full_name=self.repository_full_name
)
# Call _repo_data_from_config() again to update self args from .github-webhook-server.yaml
self._repo_data_from_config(repository_config=local_repository_config)
```

> **Warning:** `.github-webhook-server.yaml` is best thought of as a runtime-behavior layer, not a full replacement for `config.yaml`. Keep administrative settings such as `events`, repo tokens, logging, branch protection, draft-command rules, `pr-size-thresholds`, and `test-oracle` in `config.yaml`.

> **Note:** The repository-local file is fetched through GitHub's contents API without an explicit `ref`, so the default-branch version is the one the server sees. A config change in a pull request does not become active until that file reaches the default branch.

## Merge Rules

The precedence chain is key-by-key, not file-by-file. In practice, that means:

- If a key is missing at the repository-local level, lookup continues to the repo entry in `config.yaml`, then to the root.
- If a higher-precedence key is present but set to YAML `null`, the server treats it as not set and keeps falling back.
- For most nested objects, the higher-precedence object replaces the lower-precedence object instead of being recursively merged.
- `labels` is the main special case: the server merges the top-level `labels` object, and then merges `labels.colors` again so you can override a few colors without redefining every color.

A concrete example is in `examples/config.yaml`: the root `labels.colors.hold` is `red`, while the repo-specific `labels.colors.hold` is `purple`. For that repository, the effective `hold` color becomes `purple`, but the other global label colors still apply. The same merge behavior is used when `labels` comes from `.github-webhook-server.yaml`.

> **Tip:** To inherit a lower-precedence value, omit the key entirely or set it to `null`.

> **Tip:** When you override structured settings such as `container`, `branch-protection`, or `test-oracle`, restate every field you still need. Do not assume a deep merge unless that setting is explicitly documented as merged.

## Recommended Placement

- Put server-wide defaults and startup-time settings in the root of `config.yaml`.
- Put repo-specific server settings in `repositories.<repo>` inside `config.yaml`.
- Put repository-owned runtime behavior in `.github-webhook-server.yaml` when you want config changes reviewed and versioned alongside the repository.

> **Tip:** For webhook-time repository behavior, changes are picked up on later webhook deliveries because the server re-reads `config.yaml` and re-fetches `.github-webhook-server.yaml` instead of keeping one permanently merged config in memory.


---

Source: configuration-reference.md

# Configuration Reference

`github-webhook-server` reads its main configuration from `config.yaml` in the server data directory. In code, that directory defaults to `/home/podman/data`, so the default config path is `/home/podman/data/config.yaml`. Relative log file names are resolved under `<data_dir>/logs/`.

The checked-in example file shows the top-level shape:

```3:21:examples/config.yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart
mcp-log-file: mcp_server.log # Set global MCP log file, change take effect immediately without server restart
logs-server-log-file: logs_server.log # Set global Logs Server log file, change take effect immediately without server restart
mask-sensitive-data: true # Mask sensitive data in logs (default: true). Set to false for debugging (NOT recommended in production)

# Server configuration
disable-ssl-warnings: true # Disable SSL warnings (useful in production to reduce log noise from SSL certificate issues)

github-app-id: 123456 # GitHub app id
github-tokens:
  - <GITHIB TOKEN1>
  - <GITHIB TOKEN2>

webhook-ip: <HTTP://IP OR URL:PORT/webhook_server> # Full URL with path (e.g., https://your-domain.com/webhook_server or https://smee.io/your-channel)

docker: # Used to pull images from docker.io
  username: <username>
  password: <password>
```

Repository-specific settings live under `repositories`:

```139:183:examples/config.yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    mask-sensitive-data: false # Override global setting - disable masking for debugging this specific repo (NOT recommended in production)
    slack-webhook-url: <Slack webhook url> # Send notification to slack on several operations
    verified-job: true
    pypi:
      token: <PYPI TOKEN>

    events: # To listen to all events do not send events
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status
    tox:
      main: all # Run all tests in tox.ini when pull request parent branch is main
      dev: testenv1,testenv2 # Run testenv1 and testenv2 tests in tox.ini when pull request parent branch is dev

    pre-commit: true # Run pre-commit check

    protected-branches:
      dev: []
      main: # set [] in order to set all defaults run included
        include-runs:
          - "pre-commit.ci - pr"
          - "WIP"
        exclude-runs:
          - "SonarCloud Code Analysis"
    container:
      username: <registry username>
      password: <registry_password>
      repository: <registry_repository_full_path>
      tag: <image_tag>
      release: true # Push image to registry on new release with release as the tag
      build-args: # build args to send to podman build command
        - my-build-arg1=1
        - my-build-arg2=2
      args: # args to send to podman build command
        - --format docker
```

> **Note:** In `repositories`, the map key is the short GitHub repository name, while `name` inside the block is the full `owner/repo`.

> **Note:** This page lists keys in `config.yaml` form. The sample `.github-webhook-server.yaml` uses the same repository-level shape without the surrounding `repositories.<repo>` wrapper.

> **Note:** Most repository settings replace the global value entirely. Two important exceptions are `branch-protection`, which is merged with global defaults, and `labels.colors`, where repository colors override only the keys you redefine.

> **Warning:** Use exact branch names for `tox` and `protected-branches`, and use string values such as `all` or `testenv1,testenv2` for `tox`. The current runner/setup code looks up branches by exact key and builds the tox command from a string value.

## Global settings

### Logging and diagnostics

- `log-level`: Global application log level. Allowed values are `INFO` and `DEBUG`.
- `log-file`: Main webhook server log file. Relative names are written under `<data_dir>/logs/`; absolute paths are used as-is.
- `mcp-log-file`: Separate log file for the optional MCP server. Default is `mcp_server.log`.
- `logs-server-log-file`: Separate log file for the optional log viewer / logs server. Default is `logs_server.log`.
- `mask-sensitive-data`: Enables log redaction. Default is `true`. When enabled, the logger masks secrets such as tokens, passwords, webhook secrets, Slack webhook URLs, and similar values.

> **Warning:** `labels.colors` and `pr-size-thresholds.*.color` expect CSS3 color names such as `green`, `orange`, `royalblue`, and `darkred`. The label code converts those names to hex internally; hex strings are not the documented input format.

### Server, webhook, and security

- `webhook-ip`: The public webhook URL that GitHub should call. Include the full path, for example `https://example.com/webhook_server`.
- `ip-bind`: The bind address for the FastAPI / uvicorn server. If omitted, startup defaults to `0.0.0.0`.
- `port`: The listening port. If omitted, startup defaults to `5000`.
- `max-workers`: Uvicorn worker count. If omitted, startup defaults to `10`.
- `webhook-secret`: Optional shared secret for GitHub webhook signature verification. When set, the server validates the incoming `x-hub-signature-256` header and uses the same secret when it creates GitHub webhooks.
- `verify-github-ips`: If `true`, only accept webhook requests from GitHub’s published webhook IP ranges.
- `verify-cloudflare-ips`: If `true`, also trust Cloudflare’s published IP ranges. This is useful when traffic reaches the server through Cloudflare.
- `disable-ssl-warnings`: If `true`, suppress `urllib3` SSL warnings during runtime.

> **Warning:** IP allowlist verification is fail-closed. If `verify-github-ips` and/or `verify-cloudflare-ips` are enabled but the allowlists cannot be loaded, the server aborts startup instead of accepting requests insecurely.

### GitHub authentication and shared defaults

- `github-app-id`: GitHub App ID used for app-scoped repository management. In practice this goes with a `webhook-server.private-key.pem` file in the data directory and an installed GitHub App.
- `github-tokens`: List of GitHub tokens used for normal API calls. The server checks all configured tokens and picks the one with the highest remaining rate limit.
- `docker.username`: Docker Hub username used for the startup `podman login` step.
- `docker.password`: Docker Hub password used for the startup `podman login` step.
- `default-status-checks`: Extra check or status context names that should always be part of the generated branch-protection rules. Use exact GitHub context names.
- `auto-verified-and-merged-users`: Global default list of users or bots whose PRs can be auto-verified and auto-merged when the other merge rules are satisfied.
- `auto-verify-cherry-picked-prs`: Global default for automatic verification of cherry-picked PRs. Default is `true`.
- `create-issue-for-new-pr`: Global default for creating a tracking issue when a new PR opens. Default is `true`.
- `cherry-pick-assign-to-pr-author`: Global default for assigning cherry-pick PRs to the original PR author. Default is `true`.
- `allow-commands-on-draft-prs`: Global default for user commands on draft PRs. Omit it to block commands on draft PRs. Set it to `[]` to allow all commands. Set it to a list such as `["build-and-push-container", "retest"]` to allow only those command names.

> **Tip:** Repository-level `github-tokens` replace the global token list for that repository. During webhook processing, the server also adds the GitHub users behind the active API tokens to the auto-verified user list.

### Labels and PR size

The sample config includes label and size settings like this:

```47:102:examples/config.yaml
labels:
  # Optional: List of label categories to enable
  # If not set, all labels are enabled. If set, only listed categories are enabled.
  # Note: reviewed-by labels (approved-*, lgtm-*, etc.) are always enabled and cannot be disabled
  enabled-labels:
    - verified
    - hold
    - wip
    - needs-rebase
    - has-conflicts
    - can-be-merged
    - size
    - branch
    - cherry-pick
    - automerge
  # Optional: Custom colors for labels (CSS3 color names)
  colors:
    hold: red
    verified: green
    wip: orange
    needs-rebase: darkred
    has-conflicts: red
    can-be-merged: limegreen
    automerge: green
    # Dynamic label prefixes
    approved-: green
    lgtm-: yellowgreen
    changes-requested-: orange
    commented-: gold
    cherry-pick-: coral
    branch-: royalblue

# Global PR size label configuration (optional)
# Define custom categories based on total lines changed (additions + deletions)
# threshold: positive integer or 'inf' for unbounded largest category
# color: CSS3 color name (e.g., red, green, blue, lightgray, darkorange)
# Infinity behavior: 'inf' ensures all PRs beyond largest finite threshold are captured
#                   Always sorted last, regardless of definition order
pr-size-thresholds:
  Tiny:
    threshold: 10 # PRs with 0-9 lines changed
    color: lightgray
  Small:
    threshold: 50 # PRs with 10-49 lines changed
    color: green
  Medium:
    threshold: 150 # PRs with 50-149 lines changed
    color: orange
  Large:
    threshold: 300 # PRs with 150-299 lines changed
    color: red
  Massive:
    threshold: inf # PRs with 300+ lines changed (unbounded largest category)
    color: darkred # 'inf' means no upper limit - catches all PRs above 300 lines
```

- `labels.enabled-labels`: List of label categories to allow. Valid categories are `verified`, `hold`, `wip`, `needs-rebase`, `has-conflicts`, `can-be-merged`, `size`, `branch`, `cherry-pick`, and `automerge`. If omitted, all configurable categories are enabled. If set to `[]`, all configurable categories are disabled. Review-state labels such as `approved-*`, `lgtm-*`, `changes-requested-*`, and `commented-*` are always enabled.
- `labels.colors`: Map of label names or dynamic label prefixes to CSS3 color names. Exact keys such as `hold` or `verified` affect one label. Prefix keys such as `approved-` or `branch-` affect any label that starts with that prefix.
- `pr-size-thresholds.<label>.threshold`: Threshold used to compute the PR size label. The handler sorts thresholds ascending and picks the first bucket where `total_changes < threshold`. Use `inf` for the open-ended largest bucket.
- `pr-size-thresholds.<label>.color`: CSS3 color name for that bucket.

If you do not configure `pr-size-thresholds`, the built-in buckets are:

```34:41:webhook_server/libs/handlers/labels_handler.py
STATIC_PR_SIZE_THRESHOLDS: tuple[tuple[int | float, str, str], ...] = (
    (20, "XS", "ededed"),
    (50, "S", "0E8A16"),
    (100, "M", "F09C74"),
    (300, "L", "F5621C"),
    (500, "XL", "D93F0B"),
    (float("inf"), "XXL", "B60205"),
)
```

### Branch protection

- `branch-protection.strict`: GitHub branch protection `strict` setting. Global default is `true`.
- `branch-protection.require_code_owner_reviews`: Require CODEOWNERS reviews. Global default is `false`.
- `branch-protection.dismiss_stale_reviews`: Dismiss stale reviews on new commits. Global default is `true`.
- `branch-protection.required_approving_review_count`: Required approval count. Global default is `0`.
- `branch-protection.required_linear_history`: Require linear history. Global default is `true`.
- `branch-protection.required_conversation_resolution`: Require resolved review conversations. Global default is `true`. The webhook processor also uses this flag when deciding whether resolved/unresolved review-thread events should affect mergeability.

### PR Test Oracle and AI features

The sample global config includes both `test-oracle` and `ai-features`:

```104:137:examples/config.yaml
branch-protection:
  strict: True
  require_code_owner_reviews: True
  dismiss_stale_reviews: False
  required_approving_review_count: 1
  required_linear_history: True
  required_conversation_resolution: True

# PR Test Oracle integration
# Analyzes PR diffs with AI and recommends which tests to run
# See: https://github.com/myk-org/pr-test-oracle
test-oracle:
  server-url: "http://localhost:8000"
  ai-provider: "claude" # claude | gemini | cursor
  ai-model: "claude-opus-4-6[1m]"
  test-patterns:
    - "tests/**/*.py"
  triggers: # Default: [approved]
    - approved # Run when /approve command is used
    # - pr-opened             # Run when PR is opened
    # - pr-synchronized       # Run when new commits pushed

# AI Features configuration
# Enables AI-powered enhancements (e.g., conventional title suggestions)
ai-features:
  ai-provider: "claude" # claude | gemini | cursor
  ai-model: "claude-opus-4-6[1m]"
  conventional-title:
    enabled: true
    mode: suggest  # suggest: show in checkrun | fix: auto-update PR title
    timeout-minutes: 10
  resolve-cherry-pick-conflicts-with-ai:
    enabled: true
    timeout-minutes: 10  # Timeout in minutes for AI CLI (default: 10)
```

- `test-oracle.server-url`: Base URL of the PR Test Oracle service.
- `test-oracle.ai-provider`: AI provider sent to the oracle. Allowed values are `claude`, `gemini`, and `cursor`.
- `test-oracle.ai-model`: AI model sent to the oracle.
- `test-oracle.test-patterns`: Optional test-file globs sent to the oracle as `test_patterns`.
- `test-oracle.triggers`: Automatic oracle triggers. Valid values are `approved`, `pr-opened`, and `pr-synchronized`. Default is `["approved"]`. Here, `approved` means the `/approve` comment command, not GitHub’s native review approval event.
- `ai-features.ai-provider`: AI CLI provider for built-in AI features.
- `ai-features.ai-model`: AI CLI model for built-in AI features.
- `ai-features.conventional-title.enabled`: Turn AI assistance for failed `conventional-title` checks on or off.
- `ai-features.conventional-title.mode`: `suggest` appends an AI-suggested PR title to the failed check output. `fix` auto-edits the PR title when the suggestion validates.
- `ai-features.conventional-title.timeout-minutes`: Timeout for the title-suggestion AI CLI call. Default is `10`.
- `ai-features.resolve-cherry-pick-conflicts-with-ai.enabled`: Allow AI conflict resolution during cherry-pick workflows.
- `ai-features.resolve-cherry-pick-conflicts-with-ai.timeout-minutes`: Timeout for the cherry-pick conflict resolution AI CLI call. Default is `10`.

> **Note:** `/test-oracle` always works when `test-oracle` is configured, even if the current event is not listed in `triggers`.

> **Note:** If the Test Oracle health check fails, the server posts a PR comment and skips analysis. If the AI CLI fails for `ai-features`, the server logs the problem and continues without blocking the rest of the webhook flow.

> **Warning:** AI-resolved cherry-picks are never auto-verified. They always require manual review after the conflict resolution step.

### Required top-level map

- `repositories`: Required top-level map of repository-specific settings. The config loader refuses a config file that does not contain `repositories`.

## Repository settings

The following keys are written under `repositories.<short-repo-name>` in `config.yaml`.

### Identity, logging, and webhook subscription

- `name`: Full repository name in `owner/repo` form. This is required.
- `log-level`: Repository-specific log level override.
- `log-file`: Repository-specific log file override.
- `mask-sensitive-data`: Repository-specific redaction override.
- `events`: GitHub webhook events to subscribe when the server creates or updates the repo webhook. If omitted, webhook registration uses `*`. The current processing path explicitly handles `push`, `pull_request`, `pull_request_review`, `issue_comment`, `check_run`, `status`, `pull_request_review_thread`, and `ping`.
- `github-tokens`: Repository-specific token list. This replaces the global `github-tokens` list for that repository.
- `slack-webhook-url`: Slack incoming webhook for notifications related to PyPI publishing and container pushes.

### Checks and CI behavior

- `verified-job`: Enable or disable the built-in `verified` check. Default is `true`. When disabled, the repo setup code does not add `verified` to generated required checks, and merge requirements do not include the verified step.
- `tox`: Map of exact base branch name to tox env selection. Use `all` to run the default tox config, or a comma-separated env list such as `testenv1,testenv2`.
- `tox-python-version`: Optional Python version passed as `uvx --python=<version> tox ...`.
- `pre-commit`: Enable or disable the built-in pre-commit check. Default is `true`.
- `conventional-title`: Comma-separated list of allowed Conventional Commit types for PR titles, such as `feat,fix,docs`. Use `*` to accept any type while still enforcing the format `<type>[optional scope]: <description>`. This validates the PR title, not commit messages.
- `custom-check-runs[].name`: Custom GitHub check-run name. It must be unique, use only safe characters, and not collide with built-in names such as `tox`, `pre-commit`, `build-container`, `python-module-install`, `conventional-title`, or `can-be-merged`.
- `custom-check-runs[].command`: Command run in the repository worktree. Environment-variable prefixes and multiline commands are supported, but the executable must exist on the server.
- `custom-check-runs[].mandatory`: Whether the custom check must pass for mergeability. Default is `true`. `false` checks still run; they just stop gating merges.
- `test-oracle.server-url`, `test-oracle.ai-provider`, `test-oracle.ai-model`, `test-oracle.test-patterns`, `test-oracle.triggers`: Same meanings as the global `test-oracle` keys. A repository-level object replaces the global object for that repository.
- `ai-features.ai-provider`, `ai-features.ai-model`, `ai-features.conventional-title.enabled`, `ai-features.conventional-title.mode`, `ai-features.conventional-title.timeout-minutes`, `ai-features.resolve-cherry-pick-conflicts-with-ai.enabled`, `ai-features.resolve-cherry-pick-conflicts-with-ai.timeout-minutes`: Same meanings as the global `ai-features` keys. A repository-level object replaces the global object for that repository.

> **Tip:** Custom check names become valid `/retest <name>` targets, and `/retest all` includes them.

### Branch protection and merge policy

- `protected-branches`: Map of exact branch names that should have branch protection configured by the startup repository-setup job.
- `protected-branches.<branch>: []`: Protect the branch and use the auto-generated required status-check list.
- `protected-branches.<branch>.include-runs`: Replace the auto-generated required status-check list with exactly these contexts.
- `protected-branches.<branch>.exclude-runs`: Remove these contexts from the auto-generated required list.
- `branch-protection.strict`, `branch-protection.require_code_owner_reviews`, `branch-protection.dismiss_stale_reviews`, `branch-protection.required_approving_review_count`, `branch-protection.required_linear_history`, `branch-protection.required_conversation_resolution`: Same meanings as the global branch-protection keys. Repository values are merged on top of the global defaults.

When the server auto-generates required checks for a protected branch, it starts from `default-status-checks`, always adds `can-be-merged`, and then adds built-in checks that are enabled for the repo. That includes `tox`, `verified`, `build-container`, `python-module-install`, `pre-commit`, `conventional-title`, and `pre-commit.ci - pr` when the repo contains a `.pre-commit-config.yaml`.

- `auto-verified-and-merged-users`: Repository-specific auto-verified user list. This replaces the global list for that repository.
- `auto-verify-cherry-picked-prs`: Repository override for automatic verification of cherry-picked PRs. Default is `true`.
- `set-auto-merge-prs`: List of base branch names that should be put into GitHub auto-merge. The server enables GitHub auto-merge with squash merge.
- `can-be-merged-required-labels`: Extra labels that must be present before the `can-be-merged` check can pass.
- `minimum-lgtm`: Minimum number of reviewer `/lgtm` labels required before the PR is considered mergeable. Default is `0`.
- `create-issue-for-new-pr`: Repository override for tracking-issue creation on new PRs.
- `cherry-pick-assign-to-pr-author`: Repository override for assigning cherry-pick PRs to the original PR author.
- `allow-commands-on-draft-prs`: Repository override for draft-PR comment commands. Omit it to block commands on draft PRs. Set it to `[]` to allow all commands. Set it to a list to allow only those raw command names, for example `build-and-push-container` or `retest`.

> **Warning:** For `protected-branches`, stick to `[]` or the object form with `include-runs` / `exclude-runs`. Those are the forms the branch-protection setup code actually uses.

> **Note:** `/test-oracle` is exempt from the normal draft-PR command restriction and can still be triggered when configured.

### Labels and PR size at repo level

- `labels.enabled-labels`: Repository-specific enabled-label list. If you set it, it replaces the global enabled-label list for that repository.
- `labels.colors`: Repository-specific color overrides. Repo color keys override the same keys from the global `labels.colors` map and inherit the rest.
- `pr-size-thresholds.<label>.threshold`: Repository-specific PR size threshold.
- `pr-size-thresholds.<label>.color`: Repository-specific PR size color. Repository `pr-size-thresholds` replace the global threshold map for that repository.

### Publishing, releases, and notifications

- `pypi.token`: Enable Python package publishing on tag pushes. On a tag push, the server builds an sdist with `uv build`, runs `twine check`, and uploads with `twine upload --skip-existing`. If publishing fails, the handler opens a GitHub issue.
- `container.username`: Registry username used for `podman push`.
- `container.password`: Registry password used for `podman push`.
- `container.repository`: Full image repository, for example `quay.io/org/repo` or `ghcr.io/org/pkg`.
- `container.tag`: Default merged-PR image tag for `main` and `master`. If omitted, the build code defaults to `latest`. PR builds use `pr-<number>`, and tag pushes use the Git tag itself.
- `container.release`: If `true`, a tag push builds and pushes a release image. If `false`, tag pushes do not push a release image.
- `container.build-args`: Extra `--build-arg` values passed to `podman build`.
- `container.args`: Raw extra arguments prepended to `podman build`, for example `--format docker` or `--platform=linux/amd64`.

The checked-in source handles release publishing only on tag pushes:

```34:52:webhook_server/libs/handlers/push_handler.py
tag = re.search(r"^refs/tags/(.+)$", self.hook_data["ref"])
if tag:
    tag_name = tag.group(1)
    self.logger.info(f"{self.log_prefix} Processing push for tag: {tag.group(1)}")
    self.logger.debug(f"{self.log_prefix} Tag: {tag_name}")
    if self.github_webhook.pypi:
        self.logger.info(f"{self.log_prefix} Processing upload to pypi for tag: {tag_name}")
        try:
            await self.upload_to_pypi(tag_name=tag_name)
        except Exception as ex:
            self.logger.exception(f"{self.log_prefix} PyPI upload failed")
            if self.ctx:
                self.ctx.fail_step("push_handler", ex, traceback.format_exc())
            return

    if self.github_webhook.build_and_push_container and self.github_webhook.container_release:
        self.logger.info(f"{self.log_prefix} Processing build and push container for tag: {tag_name}")
        try:
            await self.runner_handler.run_build_container(push=True, set_check=False, tag=tag_name)
```

> **Note:** `slack-webhook-url` is used for successful PyPI publish messages and for container push success/failure notifications. It is not a general-purpose notification switch for every webhook event.


---

Source: repository-overrides.md

# Repository Overrides

Repository overrides let you change behavior for one repository without changing the default behavior for every repo the server manages.

This project has two per-repository configuration layers:

1. `config.yaml` on the server, under `repositories.<repo>`
2. `.github-webhook-server.yaml` in the root of the repository itself

For runtime settings that the webhook reads from the repository, precedence is:

1. `.github-webhook-server.yaml`
2. The matching repository entry in `config.yaml`
3. Top-level defaults in `config.yaml`

> **Note:** The server looks for `.github-webhook-server.yaml` in the repository root. If that file is missing, the repository falls back to its `config.yaml` entry, then to global defaults.

> **Tip:** Keep repo-owned workflow behavior in `.github-webhook-server.yaml`, and keep server-owned settings such as credentials, webhook events, and protected branch setup in `config.yaml`.

| Area | `.github-webhook-server.yaml` | `config.yaml` `repositories.<repo>` | Notes |
| --- | --- | --- | --- |
| `labels.enabled-labels`, `labels.colors` | Yes | Yes | Repo-local values win. |
| `pr-size-thresholds` | No | Yes | Current runtime reads PR size buckets from `config.yaml`. |
| `tox`, `tox-python-version`, `pre-commit`, `verified-job`, `conventional-title`, `custom-check-runs`, `set-auto-merge-prs`, `can-be-merged-required-labels`, `minimum-lgtm` | Yes | Yes | Repo-local values win. |
| `pypi`, `container` | Yes | Yes | Repo-local values win. |
| `github-tokens` | No | Yes | Needed before the repo-local file can be read. |
| `protected-branches` and branch protection sync | No | Yes | Applied by the server when it configures repositories. |
| `branch-protection.required_conversation_resolution` | Yes | Yes | Also affects the runtime `can-be-merged` gate. |
| `events`, `test-oracle`, `allow-commands-on-draft-prs` | No | Yes | Current code reads these from `config.yaml`. |

## Labels

Per-repo label overrides are useful when one repository wants fewer automation labels, different colors, or different PR size buckets than the global defaults.

The repo-specific label shape in `config.yaml` looks like this:

```236:255:examples/config.yaml
labels:
  enabled-labels:
    - verified
    - hold
    - size
  colors:
    hold: purple

pr-size-thresholds:
  Express:
    threshold: 25 # PRs with 0-24 lines changed
    color: lightblue
  Standard:
    threshold: 100 # PRs with 25-99 lines changed
    color: green
  Premium:
    threshold: 500 # PRs with 100-499 lines changed
    color: orange # PRs with 500+ lines changed get this category
```

`labels.enabled-labels` is a whitelist.

- Leave it unset to keep all configurable label categories enabled.
- Set it to `[]` to disable configurable label categories for that repo.
- Review labels such as `approved-*`, `lgtm-*`, `changes-requested-*`, and `commented-*` still stay enabled.
- The exact `lgtm` and `approve` labels also stay enabled.

`labels.colors` overrides only the keys you provide, so you can inherit global colors and replace just a few. The code also supports dynamic prefixes such as `approved-` and `branch-`, not just exact label names.

> **Note:** `pr-size-thresholds` is per-repository, but in the current code it belongs in `config.yaml`, not `.github-webhook-server.yaml`.

## Checks And Merge Rules

This is the area where repo-local overrides are most useful. You can change what checks run, how strict the merge gate is, and whether auto-merge is enabled for selected branches.

The repo-local example shows branch-specific `tox`, an optional Python version for `tox`, and `pre-commit`:

```13:41:examples/.github-webhook-server.yaml
verified-job: true # Enable/disable verified job functionality

# ... other repo-local settings ...

tox:
  main: "tests,linting" # Commands for main branch
  develop: "tests" # Commands for develop branch
  feature/*: ["tests", "quick-lint"] # Array format also supported

tox-python-version: "3.11"

pre-commit: true
```

The same repo-local file also shows merge-gate controls:

```87:124:examples/.github-webhook-server.yaml
set-auto-merge-prs:
  - main
  - develop

can-be-merged-required-labels:
  - "approved"
  - "tests-passed"
  - "security-reviewed"

conventional-title: "feat,fix,build,chore,ci,docs,style,refactor,perf,test,revert"

minimum-lgtm: 2

create-issue-for-new-pr: true # Create tracking issues for new PRs
```

These keys change behavior in practical ways:

- `verified-job` enables the `verified` check flow for that repository.
- `tox` maps base branches to the `tox` envs that should run.
- `tox-python-version` chooses the Python version passed to `tox`.
- `pre-commit` enables the `pre-commit` check.
- `conventional-title` validates PR titles against the configured Conventional Commit types.
- `can-be-merged-required-labels` adds extra labels that must be present before `can-be-merged` passes.
- `minimum-lgtm` raises the LGTM threshold before a PR is considered approved.
- `set-auto-merge-prs` enables GitHub auto-merge automatically when the PR targets one of those base branches.
- `create-issue-for-new-pr` controls automatic tracking issue creation per repository.

Custom checks are configured as `custom-check-runs`:

```580:613:webhook_server/config/schema.yaml
custom-check-runs:
  # Examples from the schema:
  - name: lint
    command: uv tool run --from ruff ruff check
    mandatory: true
  - name: security-scan
    command: TOKEN=xyz DEBUG=true uv tool run --from bandit bandit -r .
    mandatory: false
  - name: complex-check
    command: |
      uv run python -c "
      import sys
      print('Running complex check')
      sys.exit(0)
      "
```

`custom-check-runs` behave like built-in checks, with two especially useful details:

- The check name is used exactly as configured.
- `mandatory: false` means the check still runs, but it does not block `can-be-merged`.

> **Note:** Custom checks run in the repository worktree and support shell syntax, including environment variable prefixes.

> **Warning:** Custom checks are validated when they are loaded. The check name cannot collide with built-in checks such as `tox`, `pre-commit`, `build-container`, `python-module-install`, `conventional-title`, or `can-be-merged`, and the executable in `command` must already exist on the server. Invalid checks are skipped.

One more merge-related setting is worth calling out: `branch-protection.required_conversation_resolution`. The runtime merge gate reads that flag and, when enabled, unresolved review threads will cause `can-be-merged` to fail.

Other supported repo-local workflow overrides include `auto-verified-and-merged-users`, `auto-verify-cherry-picked-prs`, `cherry-pick-assign-to-pr-author`, `slack-webhook-url`, and `ai-features`.

## Tokens And Protected Branches

Some settings are still per-repository, but they remain server-side because the server needs them before it can read `.github-webhook-server.yaml`.

The repo-specific server config supports branch protection setup, per-repo tokens, and branch protection policy:

```164:227:examples/config.yaml
protected-branches:
  dev: []
  main: # set [] in order to set all defaults run included
    include-runs:
      - "pre-commit.ci - pr"
      - "WIP"
    exclude-runs:
      - "SonarCloud Code Analysis"

# ... other repo-specific settings ...

github-tokens: # override GitHub tokens per repository
  - <GITHUB TOKEN1>
  - <GITHUB TOKEN2>

# ... other repo-specific settings ...

branch-protection:
  strict: True
  require_code_owner_reviews: True
  dismiss_stale_reviews: False
  required_approving_review_count: 1
  required_linear_history: True
  required_conversation_resolution: True
```

Use `github-tokens` when one repository needs its own API budget or a different permission set than the global default. The server will build GitHub clients from that repo's token list and select the token with the highest remaining rate limit.

> **Warning:** Put `github-tokens` in `config.yaml`, not in `.github-webhook-server.yaml`. Token selection happens before the server reads the repo-local file.

`protected-branches` controls which branches get protection and what required checks are applied.

- `[]` means "protect this branch with the computed default required checks".
- `include-runs` gives an explicit required-check list for that branch.
- `exclude-runs` removes checks from the computed default list.

> **Note:** If `include-runs` is present, the server uses that explicit list. If it is not present, the server builds the list from `default-status-checks`, enabled repo features such as `tox` and `pre-commit`, and `can-be-merged`, then removes anything listed in `exclude-runs`.

`branch-protection` controls the actual GitHub branch protection settings for that repository. Per-repo values override global values field by field, so you can change only `strict`, only `required_approving_review_count`, or only `required_conversation_resolution` without redefining everything.

## Release Settings

Release behavior is also override-friendly. A repository can decide whether tag pushes should publish packages, push container images, or both.

The repo-local example shows both `pypi` and `container`:

```16:63:examples/.github-webhook-server.yaml
pypi:
  token: pypi-your-token-here

# ... other repo-local settings ...

container:
  username: your-registry-username
  password: your-registry-password # pragma: allowlist secret
  repository: quay.io/your-org/your-repo
  tag: latest
  release: true # Push on new releases
  build-args:
    - "BUILD_ARG=value"
  args:
    - "--platform=linux/amd64"
```

These keys affect release flow like this:

- If `pypi` is configured, a tag push triggers a build and upload to PyPI.
- If `container` is configured, the repo gets container build behavior.
- If `container.release` is `true`, tag pushes also push the built image to the configured registry.
- `container.build-args` and `container.args` are passed through to the container build command.

> **Tip:** On tag pushes, the published container image uses the Git tag name. For PR builds, the image tag is `pr-<number>`. For merged builds to `main` or `master`, the server uses `container.tag`.

## Recommended Split

A good working pattern is:

- Keep `github-tokens`, `protected-branches`, `branch-protection`, `events`, `pr-size-thresholds`, `test-oracle`, and `allow-commands-on-draft-prs` in `config.yaml`.
- Keep `labels`, `tox`, `pre-commit`, `custom-check-runs`, `conventional-title`, `set-auto-merge-prs`, `can-be-merged-required-labels`, `minimum-lgtm`, `pypi`, `container`, and other PR workflow behavior in `.github-webhook-server.yaml`.

That gives you the best of both worlds: the server keeps control over secrets and GitHub setup, while each repository can own its day-to-day workflow behavior in version control.


---

Source: security-configuration.md

# Security Configuration

A secure production deployment usually needs four things: a webhook secret, the right source-IP allowlist, masked logs, and strict network boundaries for optional admin/debug endpoints.

> **Note:** The server reads `config.yaml` from `WEBHOOK_SERVER_DATA_DIR` (default `/home/podman/data`). `webhook-secret`, `verify-github-ips`, and `verify-cloudflare-ips` are global settings. `mask-sensitive-data` can be global or per repository. `ENABLE_LOG_SERVER` and `ENABLE_MCP_SERVER` are environment variables.

> **Tip:** Use secret validation and IP allowlisting together. The secret proves the payload was signed by GitHub. The allowlist limits who can reach the endpoint at all.

## Webhook Secret Validation

If `webhook-secret` is set, the server validates the incoming `x-hub-signature-256` header before it queues the webhook for background processing. The comparison is HMAC-SHA256 over the raw request body:

```python
if not signature_header:
    raise HTTPException(status_code=403, detail="x-hub-signature-256 header is missing!")

hash_object = hmac.new(secret_token.encode("utf-8"), msg=payload_body, digestmod=hashlib.sha256)
expected_signature = "sha256=" + hash_object.hexdigest()

if not hmac.compare_digest(expected_signature, signature_header):
    raise HTTPException(status_code=403, detail="Request signatures didn't match!")
```

In practice:

- Set the same value in GitHub and in the server's root `config.yaml` as `webhook-secret`.
- A missing or invalid signature is rejected with `403`.
- If `webhook-secret` is not set, signature checking is skipped.

> **Warning:** Do not leave `webhook-secret` unset on an internet-facing server.

If the server manages repository webhooks for you, it also includes that secret when creating the hook:

```python
config_: dict[str, str] = {"url": webhook_ip, "content_type": "json"}

if secret:
    config_["secret"] = secret
```

There is one important rotation caveat. The webhook-management code can detect when you moved between "no secret" and "secret configured", but it does not compare one non-empty secret value to another:

```python
secret_presence_mismatch = bool(_hook.config.get("secret")) != bool(secret)
if secret_presence_mismatch:
    LOGGER.info(f"[API user {api_user}] - {full_repository_name}: Deleting old webhook")
    _hook.delete()
```

> **Note:** If you rotate from one non-empty secret to another, update the GitHub-side webhook secret too. Otherwise GitHub can keep signing with the old value while the server starts validating against the new one.

## GitHub And Cloudflare IP Allowlists

The server can optionally restrict the webhook endpoint to GitHub or Cloudflare source networks. This check is applied to the webhook endpoint, not to the optional log or MCP endpoints.

Use the mode that matches your traffic path:

- Enable `verify-github-ips` when GitHub delivers webhooks directly to the server.
- Enable `verify-cloudflare-ips` when Cloudflare proxies traffic to the server.
- Enable both if you intentionally accept both delivery paths. The server merges both CIDR sets and accepts a request if it matches either source.
- Leave both unset or `false` if you do not want source-IP filtering.

At startup, the app loads the enabled CIDR lists and fails closed if verification was requested but no valid networks were available:

```python
if networks:
    ALLOWED_IPS = tuple(networks)
    LOGGER.info(f"IP allowlist initialized successfully with {len(ALLOWED_IPS)} networks.")
elif verify_github_ips or verify_cloudflare_ips:
    # Fail-close: If IP verification is enabled but no networks loaded, reject all requests
    LOGGER.error("IP verification enabled but no valid IPs loaded - failing closed for security")
    raise RuntimeError(
        "IP verification enabled but no allowlist loaded. "
        "Cannot start server in insecure state. "
        "Check network connectivity to GitHub/Cloudflare API endpoints."
    )
```

The upstream sources are:

- GitHub: `https://api.github.com/meta` using the `hooks` CIDR list
- Cloudflare: `https://api.cloudflare.com/client/v4/ips`

> **Warning:** The allowlist check uses the client IP the app actually sees in `request.client.host`. If another reverse proxy or load balancer sits in front of the app, you may end up validating the proxy IP instead of GitHub or Cloudflare.

> **Note:** The CIDR lists are fetched during startup, not continuously. Restart the service if you need to pick up upstream IP-range changes.

> **Note:** These allowlists protect only the `POST /webhook_server` webhook endpoint. They do not secure `/logs/*` or `/mcp`.

## Sensitive-Data Masking

The logging layer masks sensitive data by default. That includes common credential-like values such as passwords, secrets, tokens, private keys, webhook URLs, and similar auth-related fields. This is a logging safeguard, not a replacement for webhook signature validation or IP allowlisting.

The example configuration keeps masking enabled globally:

```yaml
mask-sensitive-data: true # Mask sensitive data in logs (default: true). Set to false for debugging (NOT recommended in production)
```

You can override it per repository inside the main `config.yaml`:

```yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    mask-sensitive-data: false # Override global setting - disable masking for debugging this specific repo (NOT recommended in production)
```

This is useful when you are debugging a single repository, but it should be temporary.

> **Warning:** Turning masking off can leak credentials into normal log files and exported JSON webhook logs. Use it only for short-lived debugging on a trusted system, then turn it back on.

## Restrict Optional Admin Endpoints

The server also has optional log-viewer and MCP endpoints. These are operationally useful, but they should be treated as internal-only.

The compose example shows how they are enabled:

```yaml
- ENABLE_LOG_SERVER=true # Enable log viewer endpoints (default: false)
- ENABLE_MCP_SERVER=false # Enable MCP server for AI agent integration (default: false)
```

The app checks for the literal string `true` when enabling either feature.

When `ENABLE_LOG_SERVER=true`, the app exposes the log viewer routes, including:

- `/logs`
- `/logs/api/entries`
- `/logs/api/export`
- `/logs/api/pr-flow/{hook_id}`
- `/logs/api/workflow-steps/{hook_id}`
- `/logs/api/step-logs/{hook_id}/{step_name}`
- `/logs/ws`

When `ENABLE_MCP_SERVER=true`, the app exposes `/mcp`.

The only log endpoint with an extra built-in network restriction is the step-log route:

```python
@FASTAPI_APP.get(
    "/logs/api/step-logs/{hook_id}/{step_name}",
    operation_id="get_step_logs",
    dependencies=[Depends(require_log_server_enabled), Depends(require_trusted_network)],
)
```

That trusted-network check allows only private, loopback, or link-local client addresses. The rest of the log viewer is feature-flagged but not protected by that extra network gate. The MCP endpoint is also created without built-in authentication.

> **Warning:** Treat the entire log viewer as unauthenticated internal tooling. Most `/logs/*` routes are not protected by anything beyond the feature flag.

> **Warning:** The `/mcp` endpoint is unauthenticated as well. If you enable it, keep it on a VPN or internal network, or put it behind a reverse proxy with authentication and TLS.

> **Tip:** The safest production default is to leave `ENABLE_LOG_SERVER` and `ENABLE_MCP_SERVER` disabled unless you actively need them.

## Production Checklist

- Set `webhook-secret` and keep the same value in GitHub.
- Enable `verify-github-ips` or `verify-cloudflare-ips` to match your network path.
- Leave `mask-sensitive-data: true`.
- Keep `/logs/*` and `/mcp` off the public internet.
- If remote access is unavoidable, add authentication at the reverse proxy and keep origin access limited to trusted networks.


---

Source: supported-github-events.md

# Supported GitHub Events

`github-webhook-server` automates a focused set of GitHub webhook events for pull requests, merge gating, and release workflows. You can subscribe the webhook to more events than this, but only the events on this page have built-in behavior.

> **Note:** The webhook endpoint returns `200 OK` after the payload is validated and queued. That means GitHub reached the server successfully. It does **not** mean every downstream action finished successfully.

```507:529:webhook_server/app.py
    # Start background task immediately using asyncio.create_task
    # This ensures the HTTP response is sent immediately without waiting
    # Store task reference for observability and graceful shutdown
    task = asyncio.create_task(
        process_with_error_handling(
            _hook_data=hook_data,
            _headers=request.headers,
            _delivery_id=delivery_id,
            _event_type=event_type,
        )
    )
    _background_tasks.add(task)
    task.add_done_callback(_background_tasks.discard)

    # Return 200 immediately with JSONResponse for fastest serialization
    return JSONResponse(
        status_code=status.HTTP_200_OK,
        content={
            "status": status.HTTP_200_OK,
            "message": "Webhook queued for processing",
            "delivery_id": delivery_id,
            "event_type": event_type,
        },
    )
```

## Configure Events

Use the `events` list in `config.yaml` or `.github-webhook-server.yaml` to control which GitHub deliveries this server should receive for a repository.

```150:157:examples/config.yaml
    events: # To listen to all events do not send events
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status
```

If you want GitHub to send every event type, omit the `events` key entirely.

## Event Summary

| Event | What it is used for | Intentionally skipped cases |
| --- | --- | --- |
| `ping` | Webhook connectivity check | Always stops after logging |
| `push` | Tag-based release automation | Branch pushes, branch/tag deletions |
| `pull_request` | PR lifecycle automation | Draft PRs, unhandled PR actions |
| `issue_comment` | Slash commands on PR comments | Plain issue comments, edited/deleted comments, non-command comments, draft restrictions |
| `pull_request_review` | Review labels and `/approve` handling | Review actions other than `submitted` |
| `check_run` | Mergeability re-checks and auto-merge | `action != completed`, failing `can-be-merged` runs |
| `status` | Mergeability re-checks for commit statuses | `pending`, no matching PR |
| `pull_request_review_thread` | Conversation-resolution merge gating | Actions other than `resolved`/`unresolved`, conversation resolution disabled |

> **Note:** Several GitHub events are broader than pull requests. When the server cannot map an `issue_comment`, `status`, or `check_run` delivery back to a pull request, it logs the delivery and skips it.

## `ping`

`ping` is just GitHub's connectivity test. The server logs it and stops. No pull request lookup, labels, checks, or builds happen for this event.

## `push`

`push` is used for releases, not normal branch CI. The server only does real work for tag pushes such as `refs/tags/v1.2.3`.

When a matching tag push arrives, the server can:

- publish a package to PyPI if `pypi` is configured
- build and push a release container image if container release settings are enabled

Branch pushes are intentionally ignored. Delete events are also ignored.

```486:512:webhook_server/libs/github_api.py
            # Skip branch/tag deletions - no processing needed
            if self.hook_data.get("deleted"):
                self.logger.info(f"{self.log_prefix} Branch/tag deletion detected, skipping processing")
                token_metrics = await self._get_token_metrics()
                self.logger.info(
                    f"{self.log_prefix} Webhook processing completed: deletion event (skipped) - {token_metrics}"
                )
                await self._update_context_metrics()
                return None

            ref = self.hook_data["ref"]

            # Only clone for tag pushes - branch pushes don't require cloning
            # because PushHandler only processes tags (PyPI upload, container build)
            if ref.startswith("refs/tags/"):
                await self._clone_repository(checkout_ref=ref)
                await PushHandler(github_webhook=self).process_push_webhook_data()
                token_metrics = await self._get_token_metrics()
                self.logger.info(
                    f"{self.log_prefix} Webhook processing completed successfully: push - {token_metrics}",
                )
            else:
                self.logger.debug(f"{self.log_prefix} Skipping clone for branch push: {ref}")
                token_metrics = await self._get_token_metrics()
                self.logger.info(
                    f"{self.log_prefix} Webhook processing completed: branch push (skipped) - {token_metrics}"
                )
```

If neither PyPI publishing nor container release is configured, even a tag push is effectively a no-op.

## `pull_request`

This is the main automation event. The server has first-class behavior for these `action` values:

- `opened`
- `reopened`
- `ready_for_review`
- `synchronize`
- `closed`
- `edited`
- `labeled`
- `unlabeled`

In practice, that means:

- `opened` initializes the PR: welcome comment, optional tracking issue, reviewer assignment, labels, queued checks, CI tasks, possible auto-merge setup, and an optional Test Oracle run when `pr-opened` is enabled.
- `reopened` reruns the main PR setup flow.
- `ready_for_review` reruns the main PR setup flow and posts the welcome comment.
- `synchronize` reruns setup and CI, removes old review-state labels, and can trigger Test Oracle when `pr-synchronized` is enabled.
- `closed` closes PR tracking artifacts. If the PR was merged, it can also run queued cherry-picks, push the merged container build, and refresh merge-state labels on other open PRs.
- `edited` recalculates `wip` and reruns `conventional-title` when the PR title changed.
- `labeled` and `unlabeled` refresh `verified` and re-check mergeability when merge-control labels change.

The main setup and CI work is visible in the handler itself:

```786:853:webhook_server/libs/handlers/pull_request_handler.py
        setup_tasks.append(self.owners_file_handler.assign_reviewers(pull_request=pull_request))
        setup_tasks.append(
            self.labels_handler._add_label(
                pull_request=pull_request,
                label=f"{BRANCH_LABEL_PREFIX}{pull_request.base.ref}",
            )
        )
        setup_tasks.append(self.label_pull_request_by_merge_state(pull_request=pull_request))
        setup_tasks.append(self.check_run_handler.set_check_queued(name=CAN_BE_MERGED_STR))

        # Only queue built-in checks when their corresponding feature is enabled
        if self.github_webhook.tox:
            setup_tasks.append(self.check_run_handler.set_check_queued(name=TOX_STR))

        if self.github_webhook.pre_commit:
            setup_tasks.append(self.check_run_handler.set_check_queued(name=PRE_COMMIT_STR))

        if self.github_webhook.pypi:
            setup_tasks.append(self.check_run_handler.set_check_queued(name=PYTHON_MODULE_INSTALL_STR))

        if self.github_webhook.build_and_push_container:
            setup_tasks.append(self.check_run_handler.set_check_queued(name=BUILD_CONTAINER_STR))

        setup_tasks.append(self._process_verified_for_update_or_new_pull_request(pull_request=pull_request))
        setup_tasks.append(self.labels_handler.add_size_label(pull_request=pull_request))
        setup_tasks.append(self.add_pull_request_owner_as_assingee(pull_request=pull_request))

        if self.github_webhook.conventional_title:
            setup_tasks.append(self.check_run_handler.set_check_queued(name=CONVENTIONAL_TITLE_STR))

        # Queue custom check runs (same as built-in checks)
        # Note: custom checks are validated in GithubWebhook._validate_custom_check_runs()
        # so name is guaranteed to exist
        for custom_check in self.github_webhook.custom_check_runs:
            check_name = custom_check["name"]
            setup_tasks.append(self.check_run_handler.set_check_queued(name=check_name))

        self.logger.info(f"{self.log_prefix} Executing setup tasks")
        setup_results = await asyncio.gather(*setup_tasks, return_exceptions=True)

        for result in setup_results:
            if isinstance(result, Exception):
                self.logger.error(f"{self.log_prefix} Setup task failed: {result}")

        if self.ctx:
            self.ctx.complete_step("pr_workflow_setup")

        # Stage 2: CI/CD execution tasks
        if self.ctx:
            self.ctx.start_step("pr_cicd_execution")

        ci_tasks: list[Coroutine[Any, Any, Any]] = []

        ci_tasks.append(self.runner_handler.run_tox(pull_request=pull_request))
        ci_tasks.append(self.runner_handler.run_pre_commit(pull_request=pull_request))
        ci_tasks.append(self.runner_handler.run_install_python_module(pull_request=pull_request))
        ci_tasks.append(self.runner_handler.run_build_container(pull_request=pull_request))

        if self.github_webhook.conventional_title:
            ci_tasks.append(self.runner_handler.run_conventional_title_check(pull_request=pull_request))

        # Launch custom check runs (same as built-in checks)
        for custom_check in self.github_webhook.custom_check_runs:
            ci_tasks.append(
                self.runner_handler.run_custom_check(
                    pull_request=pull_request,
                    check_config=custom_check,
                )
            )
```

> **Note:** Draft PRs are intentionally quiet. Normal `pull_request` automation stops early for drafts. Draft-specific comment behavior is handled separately through `issue_comment`.

```38:45:examples/config.yaml
# Commands allowed on draft PRs (optional)
# If not set: commands are blocked on draft PRs (default behavior)
# If empty list []: all commands allowed on draft PRs
# If list with values: only those commands allowed on draft PRs
# allow-commands-on-draft-prs: []  # Uncomment to allow all commands on draft PRs
# allow-commands-on-draft-prs:     # Or allow only specific commands:
#   - build-and-push-container
#   - retest
```

Other `pull_request` actions are currently ignored.

## `issue_comment`

GitHub sends `issue_comment` for both issues and pull requests. This server only acts when the comment belongs to a pull request.

It intentionally ignores:

- edited comments
- deleted comments
- its own welcome-message comment
- comments that do not contain slash commands on lines starting with `/`

Supported commands include `retest`, `reprocess`, `check-can-merge`, `assign-reviewers`, `assign-reviewer`, `cherry-pick`, `build-and-push-container`, `regenerate-welcome`, `test-oracle`, and label-style commands such as `wip`, `hold`, `verified`, `lgtm`, `approve`, and `automerge`.

A few especially important behaviors:

- `/retest <check>` reruns supported configured checks. `/retest all` reruns every supported check for that PR.
- `/cherry-pick <branch...>` adds cherry-pick labels on unmerged PRs, or immediately creates cherry-pick PRs for merged PRs. If AI cherry-pick conflict resolution is enabled, the immediate flow can use it.
- `/approve` is the project's label-driven approval command and can trigger Test Oracle's `approved` trigger.
- `/test-oracle` can always be run manually when Test Oracle is configured.

> **Note:** Slash commands are permission-checked. Commands such as `/retest`, `/reprocess`, `/hold`, and `/automerge` may be ignored or rejected if the commenter is not allowed to run them.

For draft PRs, the server uses `allow-commands-on-draft-prs`. An empty list means "allow all commands." A non-empty list means "allow only these commands." `/test-oracle` is the one built-in exception that bypasses this draft filter.

```174:197:webhook_server/libs/handlers/issue_comment_handler.py
        # Check if command is allowed on draft PRs
        if is_draft and _command != COMMAND_TEST_ORACLE_STR:
            allow_commands_on_draft = self.github_webhook.config.get_value("allow-commands-on-draft-prs")
            if not isinstance(allow_commands_on_draft, list):
                self.logger.debug(
                    f"{self.log_prefix} Command {_command} blocked: "
                    "draft PR and allow-commands-on-draft-prs not configured"
                )
                return
            # Empty list means all commands allowed; non-empty list means only those commands
            if len(allow_commands_on_draft) > 0:
                # Sanitize: ensure all entries are strings for safe join and comparison
                allow_commands_on_draft = [str(cmd) for cmd in allow_commands_on_draft]
                if _command not in allow_commands_on_draft:
                    self.logger.debug(
                        f"{self.log_prefix} Command {_command} is not allowed on draft PRs. "
                        f"Allowed commands: {allow_commands_on_draft}"
                    )
                    await asyncio.to_thread(
                        pull_request.create_issue_comment,
                        f"Command `/{_command}` is not allowed on draft PRs.\n"
                        f"Allowed commands on draft PRs: {', '.join(allow_commands_on_draft)}",
                    )
                    return
```

## `pull_request_review`

This event matters only when the review `action` is `submitted`. The server updates its review labels based on the review state, such as comment, approval, or requested changes.

If the review body contains a literal `/approve`, the server also applies the project's approval label and can trigger Test Oracle's `approved` trigger.

```37:78:webhook_server/libs/handlers/pull_request_review_handler.py
            if self.hook_data["action"] == "submitted":
                """
                Available actions:
                    commented
                    approved
                    changes_requested
                """
                reviewed_user = self.hook_data["review"]["user"]["login"]
                review_state = self.hook_data["review"]["state"]
                self.github_webhook.logger.debug(
                    f"{self.github_webhook.log_prefix} "
                    f"Processing pull request review for user {reviewed_user} with state {review_state}"
                )

                await self.labels_handler.manage_reviewed_by_label(
                    pull_request=pull_request,
                    review_state=review_state,
                    action=ADD_STR,
                    reviewed_user=reviewed_user,
                )

                if body := self.hook_data["review"]["body"]:
                    self.github_webhook.logger.debug(f"{self.github_webhook.log_prefix} Found review body: {body}")
                    # In this project, "approved" means a maintainer uses the /approve command
                    # (which adds an approved-<user> label), NOT GitHub's review approval state.
                    # The oracle trigger fires only when /approve is found in the review body.
                    if any(line.strip() == f"/{APPROVE_STR}" for line in body.splitlines()):
                        await self.labels_handler.label_by_user_comment(
                            pull_request=pull_request,
                            user_requested_label=APPROVE_STR,
                            remove=False,
                            reviewed_user=reviewed_user,
                        )
                        task = asyncio.create_task(
                            call_test_oracle(
                                github_webhook=self.github_webhook,
                                pull_request=pull_request,
                                trigger="approved",
                            )
                        )
                        _background_tasks.add(task)
                        task.add_done_callback(_background_tasks.discard)
```

> **Note:** In this project, GitHub's green "Approved" review state is not the same as the `/approve` command. The special `approved` Test Oracle trigger follows the command, not the raw GitHub review state.

Review actions other than `submitted`, such as `edited` or `dismissed`, are intentionally ignored.

## `check_run`

`check_run` is where the server reacts to finished GitHub checks.

When a completed `can-be-merged` check succeeds and the PR has the `automerge` label, the server attempts a squash merge. Other completed check runs cause the server to re-evaluate whether the PR can be merged.

```64:109:webhook_server/libs/handlers/check_run_handler.py
        if self.hook_data.get("action", "") != "completed":
            self.logger.debug(
                f"{self.log_prefix} check run {check_run_name} action is "
                f"{self.hook_data.get('action', 'N/A')} and not completed, skipping"
            )
            if self.ctx:
                self.ctx.complete_step("check_run_handler")
            return False

        check_run_status: str = _check_run["status"]
        check_run_conclusion: str = _check_run["conclusion"]
        self.logger.debug(
            f"{self.log_prefix} processing check_run - Name: {check_run_name} "
            f"Status: {check_run_status} Conclusion: {check_run_conclusion}"
        )

        if check_run_name == CAN_BE_MERGED_STR:
            if getattr(self, "labels_handler", None) and pull_request and check_run_conclusion == SUCCESS_STR:
                if await self.labels_handler.label_exists_in_pull_request(
                    label=AUTOMERGE_LABEL_STR, pull_request=pull_request
                ):
                    try:
                        await asyncio.to_thread(pull_request.merge, merge_method="SQUASH")
                        self.logger.info(
                            f"{self.log_prefix} Successfully auto-merged pull request #{pull_request.number}"
                        )
                        if self.ctx:
                            self.ctx.complete_step("check_run_handler")
                        return False
                    except Exception as ex:
                        self.logger.error(
                            f"{self.log_prefix} Failed to auto-merge pull request #{pull_request.number}: {ex}"
                        )
                        if self.ctx:
                            self.ctx.complete_step("check_run_handler")
                        return True

            else:
                self.logger.debug(f"{self.log_prefix} check run is {CAN_BE_MERGED_STR}, skipping")
                if self.ctx:
                    self.ctx.complete_step("check_run_handler")
                return False

        if self.ctx:
            self.ctx.complete_step("check_run_handler")
        return True
```

The server intentionally skips:

- `check_run` deliveries whose `action` is not `completed`
- extra work when the finished check is `can-be-merged` but the conclusion is not `success`

That skip behavior keeps the server from re-processing every queued or in-progress check update.

## `status`

`status` is the older commit-status event. It is useful for tools that report commit statuses instead of GitHub check runs.

When the status reaches a terminal state such as `success`, `failure`, or `error`, the server re-checks mergeability for the matching PR.

It intentionally ignores:

- `pending` statuses
- any status delivery that cannot be mapped back to a PR

## `pull_request_review_thread`

`pull_request_review_thread` is used only for conversation-resolution gating.

When a review thread becomes `resolved` or `unresolved`, the server re-evaluates `can-be-merged` so unresolved conversations can block merging when that rule is enabled.

```221:227:examples/config.yaml
    branch-protection:
      strict: True
      require_code_owner_reviews: True
      dismiss_stale_reviews: False
      required_approving_review_count: 1
      required_linear_history: True
      required_conversation_resolution: True
```

It intentionally skips:

- thread actions other than `resolved` and `unresolved`
- all review-thread deliveries when `branch-protection.required_conversation_resolution` is `false`

> **Tip:** Turn `required_conversation_resolution` off if you do not want unresolved review threads to participate in merge gating.

## Events That Are Not Automated

The webhook subscription and the application logic are not the same thing. GitHub can send an event that the server accepts but does not automate.

> **Warning:** The repo-local example file still lists `pull_request_review_comment`, but the server does not have first-class handling for that event. If GitHub sends it, the delivery is accepted and then effectively ignored.

```20:29:examples/.github-webhook-server.yaml
# GitHub events to listen to
events:
  - push
  - pull_request
  - pull_request_review
  - pull_request_review_comment
  - pull_request_review_thread
  - issue_comment
  - check_run
  - status
```

The same rule applies to any other GitHub event that is not listed on this page. If it is not one of the supported events above, the server does not have a dedicated automation path for it.


---

Source: owners-and-reviewer-assignment.md

## What goes in an `OWNERS` file

Each `OWNERS` file applies to the directory that contains it.

The repository's own root `OWNERS` file looks like this:

```yaml
approvers:
  - myakove
  - rnetser
reviewers:
  - myakove
  - rnetser
```

Use these keys:

- `reviewers`: users who should be automatically requested for review
- `approvers`: users who can satisfy approval requirements for the affected paths
- `root-approvers: false`: optional subtree override that removes the root `OWNERS` file from the required approver set when the PR stays entirely inside that subtree

A file can define only `reviewers`, only `approvers`, both, or neither. An empty `OWNERS` file still counts as a match for that directory, but it adds no reviewers or approvers of its own.

> **Tip:** If someone should both get a review request and be allowed to approve, list them in both `reviewers` and `approvers`.

## How `OWNERS` files are discovered

For pull-request processing, the server clones the repository, checks out the PR's base branch, and walks the working tree looking for files named exactly `OWNERS`.

```python
def find_owners_files() -> list[Path]:
    return [
        p
        for p in clone_path.rglob("OWNERS")
        if not any(part.startswith(".") for part in p.relative_to(clone_path).parts)
    ]
```

That means:

- discovery is recursive
- hidden paths such as `.github/` are skipped
- the filename must be exactly `OWNERS`
- `OWNERS` data comes from the checked-out base branch, not from the PR head
- invalid YAML or invalid `approvers`/`reviewers` field types are skipped and logged
- processing stops after 1000 `OWNERS` files

Changed files are also computed locally with `git diff --name-only` between the PR base and head SHAs. The server first tries a three-dot diff and falls back to a two-dot diff if Git cannot find a merge base.

> **Warning:** A pull request that changes an `OWNERS` file does not change its own reviewer assignment or approver permissions. The server reads `OWNERS` from the base branch checkout for that PR.

## How changed files turn into reviewers and approvers

The server takes the parent directory of every changed file, then matches every `OWNERS` file whose directory is:

- the same directory, or
- an ancestor of that directory

Matching is additive. The server does not stop at the nearest `OWNERS` file.

In practice:

- a change under `folder1/file1.py` matches `folder1/OWNERS` and the root `OWNERS`
- a change under `folder/folder4/another_file.txt` matches `folder/folder4/OWNERS` and the root `OWNERS`
- a change under `folder_with_no_owners/file` falls back to the root `OWNERS`
- a change under `folder5/file` uses `folder5/OWNERS`, which in the test scenarios sets `root-approvers: false`

After matching, the server builds two pull-request-scoped sets:

- PR reviewers: the union of all matched `reviewers`
- PR approvers: the union of all matched `approvers`

Duplicates are removed, and the final lists are sorted.

### What `root-approvers: false` actually does

`root-approvers: false` is a subtree opt-out for required root approval.

If a PR stays entirely inside that subtree, the root `OWNERS` file is not added to the required approver set for that PR. If the PR also touches files outside that subtree, or paths that do not match that subtree, the root `OWNERS` file is added back.

That is why these two cases behave differently:

- `folder5/file` can be approved by `folder5` approvers without needing root approval
- `folder5/file` plus `folder_with_no_owners/file` requires root approval again

> **Note:** `root-approvers: false` removes root approval from the required set for that PR. It does not stop a root approver from approving the PR anyway.

## How review requests work

Automatic review requests use the pull request's derived `reviewers` list, not the `approvers` list.

The server requests those reviewers when a pull request is:

- opened
- reopened
- marked ready for review
- synchronized

The same automatic assignment can be rerun with `/assign-reviewers`.

A few details matter:

- the PR author is skipped, even if they appear in `reviewers`
- approvers are not automatically requested unless they also appear in `reviewers`
- reviewers are requested one at a time
- if GitHub rejects a reviewer request, the server posts a comment explaining which reviewer could not be added

There is also a manual override:

- `/assign-reviewer @username` asks GitHub to request that user directly
- the target user must be a repository contributor

## How approval and LGTM work

This project separates "LGTM" from "approver approval".

### LGTM

A normal GitHub review with state `approved` is treated like LGTM, not like approver approval. The server records that as an `lgtm-<user>` label.

The `minimum-lgtm` setting controls how many LGTM votes are required. The repository example enables it like this:

```yaml
minimum-lgtm: 2
```

LGTM counting uses:

- reviewers derived from the changed files
- root reviewers
- root approvers

The PR author does not count toward LGTM.

### Approver approval

Approver approval is driven by `/approve`, which creates an `approved-<user>` label when the commenter is allowed to approve.

A plain GitHub "Approve" review does not do that by itself. If you want a review submission to count as an approver approval, put `/approve` on its own line in the review body:

```python
if any(line.strip() == f"/{APPROVE_STR}" for line in body.splitlines()):
    await self.labels_handler.label_by_user_comment(
        pull_request=pull_request,
        user_requested_label=APPROVE_STR,
        remove=False,
        reviewed_user=reviewed_user,
    )
```

Approval works like this:

- any root approver can approve the whole PR
- otherwise, each matched `OWNERS` file needs approval from at least one of its approvers
- not every approver listed in a file has to approve; one approver from that file is enough

> **Tip:** If you rely on approver approval, tell approvers to use `/approve`, not just GitHub's standard "Approve" button.

## How `OWNERS` affects command permissions

`OWNERS` influences several slash commands, but not all of them.

### PR-scoped permissions

These depend on the current PR's derived reviewer and approver lists:

- automatic review requests use matched `reviewers`
- `/approve` works for matched PR approvers and root approvers
- `/hold` works for matched PR approvers
- LGTM counting uses matched reviewers plus root reviewers and root approvers

### Repository-wide permissions

Some commands use repository-wide roles, not just the current PR's matched paths.

A repository-wide approver here means any user listed under `approvers` in any `OWNERS` file in the repository.

Those broader rules are:

- `/automerge` works for repository maintainers and repository-wide approvers
- `/add-allowed-user @username` only takes effect when the comment author is a repository maintainer or repository-wide approver
- protected commands such as `/retest`, `/reprocess`, and `/regenerate-welcome` can be run by repository collaborators, repository contributors, repository-wide approvers, and the PR's derived reviewers

Repository maintainers are discovered from GitHub collaborator permissions. The server treats collaborators with `admin` or `maintain` permission as maintainers.

> **Note:** `/automerge` and `/add-allowed-user` are broader than PR ownership. They use repository-wide approvers, not just approvers for the files changed in the current PR.

### Temporary command access

If a user is not normally allowed to run guarded commands, a maintainer or repository-wide approver can grant access by commenting:

```text
/add-allowed-user @username
```

Only comments from a maintainer or repository-wide approver are honored for that purpose.

## Draft PR command policy

Draft PRs have an extra command gate that is configured separately from `OWNERS`.

The example global config documents it like this:

```yaml
# allow-commands-on-draft-prs: []  # Uncomment to allow all commands on draft PRs
# allow-commands-on-draft-prs:     # Or allow only specific commands:
#   - build-and-push-container
#   - retest
```

The behavior is:

- not set: commands are blocked on draft PRs
- `[]`: all commands are allowed on draft PRs
- non-empty list: only the listed commands are allowed on draft PRs

This draft-PR filter is applied in addition to the normal `OWNERS`-based permission rules.

> **Note:** Repository-specific settings can live in `.github-webhook-server.yaml` and override the global `config.yaml` values.

## Practical checklist

If reviewer assignment or approval does not behave the way you expect, check these first:

- the file is named `OWNERS`, not `owners` or `CODEOWNERS`
- the `OWNERS` file exists on the base branch, not only in the PR
- the file is not inside a hidden path
- `approvers` and `reviewers` are lists of GitHub usernames
- the PR actually touches files in the directory that the `OWNERS` file covers
- `root-approvers: false` is only used where you want to drop root approval for subtree-only changes
- approvers use `/approve` when approver approval is required
- `minimum-lgtm` matches the review policy you want

If all of those look correct, the next place to inspect is the server logs: invalid or unreadable `OWNERS` files are skipped rather than failing the whole webhook.


---

Source: pull-request-automation.md

# Pull Request Automation

This server turns a pull request into a guided workflow. When a PR is opened or moved out of draft, it can post a welcome comment, assign reviewers from `OWNERS`, add labels, queue checks, create a tracking issue, add an assignee, and optionally prepare auto-merge or cherry-pick work.

The `can-be-merged` check is the center of the workflow. It summarizes whether the current revision is ready to merge, and several other features, especially `/automerge`, wait for that check to pass.

## Welcome Comment

The welcome comment is the user-facing control panel on the PR. It explains what the server will do automatically, shows the commands available for that repository, and lists the current merge requirements and review participants.

```255:320:webhook_server/libs/handlers/pull_request_handler.py
def _prepare_welcome_comment(self) -> str:
    # ...
    return f"""
{self.github_webhook.issue_url_for_welcome_msg}

## Welcome! 🎉

This pull request will be automatically processed with the following features:{auto_verified_note}

### 🔄 Automatic Actions
* **Reviewer Assignment**: Reviewers are automatically assigned based on the OWNERS file in the repository root
* **Size Labeling**: PR size labels (XS, S, M, L, XL, XXL) are automatically applied based on changes
{issue_creation_note}
{self._prepare_pre_commit_welcome_line}\
* **Branch Labeling**: Branch-specific labels are applied to track the target branch
* **Auto-verification**: Auto-verified users have their PRs automatically marked as verified
{self._prepare_labels_config_welcome_section}\

### 📋 Available Commands
#### PR Status Management
{self._prepare_pr_status_commands_section}

#### Review & Approval
* `/lgtm` - Approve changes (looks good to me)
* `/approve` - Approve PR (approvers only)
{self._prepare_automerge_command_line}\
* `/assign-reviewers` - Assign reviewers based on OWNERS file
* `/assign-reviewer @username` - Assign specific reviewer
* `/check-can-merge` - Check if PR meets merge requirements
"""
```

The content is built from the active repository configuration. If a repo does not have container builds, custom retests, `verified`, `automerge`, or cherry-pick labels enabled, those parts disappear from the comment instead of showing dead commands.

> **Tip:** `/regenerate-welcome` refreshes the current welcome comment in place. `/reprocess` reruns the setup workflow and, on the reprocess path, skips creating duplicate welcome comments or tracking issues.

## Tracking Issues

If `create-issue-for-new-pr` is enabled, the server creates a tracking issue for the pull request and assigns it to the PR author. That issue is then closed automatically when the PR is merged or closed.

```866:916:webhook_server/libs/handlers/pull_request_handler.py
async def create_issue_for_new_pull_request(self, pull_request: PullRequest) -> None:
    if not self.github_webhook.create_issue_for_new_pr:
        self.logger.info(f"{self.log_prefix} Issue creation for new PRs is disabled for this repository")
        return

    if self.github_webhook.parent_committer in self.github_webhook.auto_verified_and_merged_users:
        self.logger.info(
            f"{self.log_prefix} Committer {self.github_webhook.parent_committer} is part of "
            f"{self.github_webhook.auto_verified_and_merged_users}, will not create issue."
        )
        return

    await asyncio.to_thread(
        self.repository.create_issue,
        title=self._generate_issue_title(pull_request=pull_request),
        body=self._generate_issue_body(pull_request=pull_request),
        assignee=pull_request.user.login,
    )

async def set_pull_request_automerge(self, pull_request: PullRequest) -> None:
    set_auto_merge_base_branch = pull_request.base.ref in self.github_webhook.set_auto_merge_prs
    parent_committer_in_auto_merge_users = (
        self.github_webhook.parent_committer in self.github_webhook.auto_verified_and_merged_users
    )
    auto_merge = set_auto_merge_base_branch or parent_committer_in_auto_merge_users

    if auto_merge and not pull_request.raw_data.get("auto_merge"):
        await asyncio.to_thread(pull_request.enable_automerge, merge_method="SQUASH")
```

> **Note:** Users listed in `auto-verified-and-merged-users` skip tracking issue creation. In practice, this is useful for trusted bots or highly automated contributor flows where you do not want an extra issue per PR.

## WIP and Hold Behavior

`wip` and `hold` are explicit merge blockers.

- `wip` is for "not ready yet". It can be set automatically from the PR title or manually with `/wip`.
- `hold` is for "ready, but do not merge". It is manual, and only approvers can set or remove it.
- Both states keep the `can-be-merged` check from succeeding until they are removed.

```227:349:webhook_server/libs/handlers/issue_comment_handler.py
if _command == AUTOMERGE_LABEL_STR:
    if reviewed_user not in (
        await self.owners_file_handler.get_all_repository_maintainers()
        + self.owners_file_handler.all_repository_approvers
    ):
        msg = "Only maintainers or approvers can set pull request to auto-merge"
        await asyncio.to_thread(pull_request.create_issue_comment, body=msg)
        return

    await self.labels_handler._add_label(pull_request=pull_request, label=AUTOMERGE_LABEL_STR)

# ...

elif _command == WIP_STR:
    wip_for_title: str = f"{WIP_STR.upper()}:"
    if remove:
        label_changed = await self.labels_handler._remove_label(pull_request=pull_request, label=WIP_STR)
        if label_changed:
            pr_title = await asyncio.to_thread(lambda: pull_request.title)
            if pr_title.upper().startswith("WIP: "):
                await asyncio.to_thread(pull_request.edit, title=pr_title[5:])
            elif pr_title.upper().startswith("WIP:"):
                await asyncio.to_thread(pull_request.edit, title=pr_title[4:])
    else:
        label_changed = await self.labels_handler._add_label(pull_request=pull_request, label=WIP_STR)
        if label_changed and not pr_title.upper().startswith("WIP:"):
            await asyncio.to_thread(pull_request.edit, title=f"{wip_for_title} {pr_title}")

elif _command == HOLD_LABEL_STR:
    if reviewed_user not in self.owners_file_handler.all_pull_request_approvers:
        await asyncio.to_thread(
            pull_request.create_issue_comment,
            f"{reviewed_user} is not part of the approver, only approvers can mark pull request with hold",
        )
    else:
        if remove:
            await self.labels_handler._remove_label(pull_request=pull_request, label=HOLD_LABEL_STR)
        else:
            await self.labels_handler._add_label(pull_request=pull_request, label=HOLD_LABEL_STR)

elif _command == VERIFIED_LABEL_STR:
    if remove:
        await self.labels_handler._remove_label(pull_request=pull_request, label=VERIFIED_LABEL_STR)
        await self.check_run_handler.set_check_queued(name=VERIFIED_LABEL_STR)
    else:
        await self.labels_handler._add_label(pull_request=pull_request, label=VERIFIED_LABEL_STR)
        await self.check_run_handler.set_check_success(name=VERIFIED_LABEL_STR)
```

A few practical details matter here:

- Editing the PR title to add or remove a `WIP:` prefix also syncs the `wip` label automatically.
- `hold` does not change the title. It is purely a merge control.
- The server also manages `has-conflicts` and `needs-rebase` labels automatically, so users can see when GitHub says the PR has conflicts or has fallen behind its base branch.
- On new commits, merge readiness is recalculated for the new revision.

> **Note:** Most comment commands are blocked on draft PRs by default. Use `allow-commands-on-draft-prs` if you want to allow all commands on draft PRs (`[]`) or only a specific allowlist.

## Auto-verification and Assignee Updates

When `verified-job` is enabled, the server maintains both the `verified` label and the `verified` check run. Trusted authors can be auto-verified, while everyone else is reset back to queued verification when new commits arrive.

```1112:1162:webhook_server/libs/handlers/pull_request_handler.py
async def _process_verified_for_update_or_new_pull_request(self, pull_request: PullRequest) -> None:
    if not self.github_webhook.verified_job:
        return

    labels = await asyncio.to_thread(lambda: list(pull_request.labels))

    is_ai_resolved = any(label.name == AI_RESOLVED_CONFLICTS_LABEL for label in labels)
    if is_ai_resolved:
        await self.labels_handler._remove_label(pull_request=pull_request, label=VERIFIED_LABEL_STR)
        await self.check_run_handler.set_check_queued(name=VERIFIED_LABEL_STR)
        return

    is_cherry_picked = any(label.name.startswith(CHERRY_PICKED_LABEL) for label in labels)
    if is_cherry_picked and not self.github_webhook.auto_verify_cherry_picked_prs:
        await self.labels_handler._remove_label(pull_request=pull_request, label=VERIFIED_LABEL_STR)
        await self.check_run_handler.set_check_queued(name=VERIFIED_LABEL_STR)
        return

    if self.github_webhook.parent_committer in self.github_webhook.auto_verified_and_merged_users:
        await self.labels_handler._add_label(pull_request=pull_request, label=VERIFIED_LABEL_STR)
        await self.check_run_handler.set_check_success(name=VERIFIED_LABEL_STR)
    else:
        await self.labels_handler._remove_label(pull_request=pull_request, label=VERIFIED_LABEL_STR)
        await self.check_run_handler.set_check_queued(name=VERIFIED_LABEL_STR)

async def add_pull_request_owner_as_assingee(self, pull_request: PullRequest) -> None:
    try:
        await asyncio.to_thread(pull_request.add_to_assignees, pull_request.user.login)
    except Exception:
        if self.owners_file_handler.root_approvers:
            await asyncio.to_thread(pull_request.add_to_assignees, self.owners_file_handler.root_approvers[0])
```

In practice, this means:

- Authors in `auto-verified-and-merged-users` are auto-verified on PR open and on later updates.
- Other authors need verification again after each new commit, because the server resets `verified` and re-queues its check.
- Standard PRs are assigned to the PR author automatically.
- If the author cannot be assigned, the server falls back to the first root approver.

> **Warning:** Cherry-picked PRs with `ai-resolved-conflicts` are never auto-verified. The server explicitly re-queues verification and expects a human to review and test that PR before it is merged.

## Auto-merge

This project has two separate auto-merge patterns, and it helps to treat them as different tools:

- Native GitHub auto-merge setup: if a base branch is listed in `set-auto-merge-prs`, or if the author is in `auto-verified-and-merged-users`, the server enables GitHub auto-merge with `SQUASH`.
- Comment-driven merge: if a maintainer or approver uses `/automerge`, the PR gets the `automerge` label. Once `can-be-merged` finishes with success, the server performs a squash merge itself.

```80:89:webhook_server/libs/handlers/check_run_handler.py
if check_run_name == CAN_BE_MERGED_STR:
    if getattr(self, "labels_handler", None) and pull_request and check_run_conclusion == SUCCESS_STR:
        if await self.labels_handler.label_exists_in_pull_request(
            label=AUTOMERGE_LABEL_STR, pull_request=pull_request
        ):
            try:
                await asyncio.to_thread(pull_request.merge, merge_method="SQUASH")
                self.logger.info(
                    f"{self.log_prefix} Successfully auto-merged pull request #{pull_request.number}"
                )
```

The `can-be-merged` check is what enforces merge readiness. In user terms, it waits for:

- the OWNERS approval requirement to be satisfied
- the configured `minimum-lgtm` reviewer count
- all required checks to pass
- no `wip`, `hold`, or `has-conflicts`
- `verified`, if the verified job is enabled
- no blocking change-request labels from approvers
- no unresolved review conversations, if `branch-protection.required_conversation_resolution` is enabled
- any exact labels listed in `can-be-merged-required-labels`

> **Note:** In this project, `/approve` is the approver signal. A normal GitHub review marked "Approved" is not the same thing. The review webhook treats `/approve` in the review body as the approver action, while normal review approvals behave like reviewer feedback and LGTM signals.

```32:69:webhook_server/libs/handlers/pull_request_review_handler.py
await self.labels_handler.manage_reviewed_by_label(
    pull_request=pull_request,
    review_state=review_state,
    action=ADD_STR,
    reviewed_user=reviewed_user,
)

if body := self.hook_data["review"]["body"]:
    # In this project, "approved" means a maintainer uses the /approve command
    # (which adds an approved-<user> label), NOT GitHub's review approval state.
    if any(line.strip() == f"/{APPROVE_STR}" for line in body.splitlines()):
        await self.labels_handler.label_by_user_comment(
            pull_request=pull_request,
            user_requested_label=APPROVE_STR,
            remove=False,
            reviewed_user=reviewed_user,
        )
```

## Cherry-pick Workflows

Cherry-picking is label-driven and works in two modes.

- On an unmerged PR, `/cherry-pick <branch>` schedules work for later by adding `cherry-pick-<branch>` labels.
- On an already merged PR, the same command runs the cherry-pick immediately.
- In both cases, the server validates that the target branch exists and comments back if it does not.

```388:463:webhook_server/libs/handlers/issue_comment_handler.py
async def process_cherry_pick_command(
    self, pull_request: PullRequest, command_args: str, reviewed_user: str
) -> None:
    _target_branches: list[str] = command_args.split()
    _exits_target_branches: set[str] = set()
    _non_exits_target_branches_msg: str = ""

    for _target_branch in _target_branches:
        try:
            await asyncio.to_thread(self.repository.get_branch, _target_branch)
            _exits_target_branches.add(_target_branch)
        except Exception:
            _non_exits_target_branches_msg += f"Target branch `{_target_branch}` does not exist\n"

    cp_labels: list[str] = [
        f"{CHERRY_PICK_LABEL_PREFIX}{_target_branch}" for _target_branch in _exits_target_branches
    ]

    if _exits_target_branches:
        if not self.hook_data["issue"].get("pull_request", {}).get("merged_at"):
            info_msg: str = f"""
Cherry-pick requested for PR: `{pull_request.title}` by user `{reviewed_user}`
Adding label/s `{" ".join([_cp_label for _cp_label in cp_labels])}` for automatic cheery-pick once the PR is merged
"""
            await asyncio.to_thread(pull_request.create_issue_comment, info_msg)
        else:
            for _exits_target_branch in _exits_target_branches:
                await self.runner_handler.cherry_pick(
                    pull_request=pull_request,
                    target_branch=_exits_target_branch,
                    assign_to_pr_owner=self.github_webhook.cherry_pick_assign_to_pr_author,
                )

        for _cp_label in cp_labels:
            await self.labels_handler._add_label(pull_request=pull_request, label=_cp_label)
```

When the runner performs a cherry-pick successfully, it creates a new PR against the target branch, labels it with `CherryPicked-from-<source-branch>`, and tries to request review from the original PR author. If `cherry-pick-assign-to-pr-author` is enabled, the new PR is also assigned to the original PR author, not to the person who typed `/cherry-pick`.

If the cherry-pick hits conflicts and AI conflict resolution is enabled, the server attempts to resolve them, labels the new PR with `ai-resolved-conflicts`, and tells users that manual verification is required. If AI is disabled, unavailable, or fails, the original PR gets a comment with manual cherry-pick commands instead.

```1113:1127:webhook_server/libs/handlers/runner_handler.py
if cherry_pick_had_conflicts:
    ai_config = self.github_webhook.ai_features
    ai_result = get_ai_config(ai_config)
    ai_provider, ai_model = ai_result if ai_result else ("unknown", "unknown")
    await asyncio.to_thread(
        pull_request.create_issue_comment,
        f"**Cherry-pick conflicts were resolved by AI**\n\n"
        f"Cherry-picked PR {pull_request.title} into {target_branch}: {cherry_pick_pr_url}\n"
        f"Conflicts were automatically resolved by AI ({ai_provider}/{ai_model}).\n\n"
        f"**Manual verification is required** — please review the changes and test before merging.",
    )
else:
    await asyncio.to_thread(
        pull_request.create_issue_comment,
        f"Cherry-picked PR {pull_request.title} into {target_branch}: {cherry_pick_pr_url}",
    )
```

> **Tip:** If you rely on cherry-pick automation, keep the `cherry-pick` label category enabled and set `cherry-pick-assign-to-pr-author: true` if you want the follow-up PR to land on the original author by default.

## Key Configuration

Most PR automation settings can be defined globally in `config.yaml`. Many of the same keys can also be overridden per repository in `.github-webhook-server.yaml`.

A representative global setup from the example config looks like this:

```28:63:examples/config.yaml
auto-verified-and-merged-users:
  - "renovate[bot]"
  - "pre-commit-ci[bot]"

auto-verify-cherry-picked-prs: true # Default: true - automatically verify cherry-picked PRs. Set to false to require manual verification.

create-issue-for-new-pr: true # Global default: create tracking issues for new PRs

cherry-pick-assign-to-pr-author: true # Default: true - assign cherry-pick PRs to the original PR author

# Commands allowed on draft PRs (optional)
# If not set: commands are blocked on draft PRs (default behavior)
# If empty list []: all commands allowed on draft PRs
# If list with values: only those commands allowed on draft PRs
# allow-commands-on-draft-prs: []  # Uncomment to allow all commands on draft PRs

labels:
  enabled-labels:
    - verified
    - hold
    - wip
    - needs-rebase
    - has-conflicts
    - can-be-merged
    - size
    - branch
    - cherry-pick
    - automerge
```

A repository-level override can tighten or relax the workflow for just one repo:

```65:90:examples/.github-webhook-server.yaml
auto-verified-and-merged-users:
  - "renovate[bot]"
  - "dependabot[bot]"
  - "trusted-user"

auto-verify-cherry-picked-prs: false # Set to false to require manual verification for cherry-picked PRs

branch-protection:
  strict: true
  require_code_owner_reviews: true
  dismiss_stale_reviews: false
  required_approving_review_count: 1
  required_linear_history: true
  required_conversation_resolution: true

# Auto-merge configuration
set-auto-merge-prs:
  - main
  - develop
```

And the same example file shows repository-specific knobs for reviewer LGTM counts, tracking issues, and cherry-pick assignee behavior:

```120:126:examples/.github-webhook-server.yaml
# Minimum LGTM count required
minimum-lgtm: 2

# Issue creation for new pull requests
create-issue-for-new-pr: true # Create tracking issues for new PRs

cherry-pick-assign-to-pr-author: true # Assign cherry-pick PRs to the original PR author (default: true)
```

The most important keys for this page are:

- `auto-verified-and-merged-users`: trusted authors whose PRs can be auto-verified and can also have GitHub auto-merge enabled automatically.
- `verified-job`: turns the `verified` label/check workflow on or off.
- `create-issue-for-new-pr`: enables or disables tracking issues.
- `allow-commands-on-draft-prs`: controls which comment commands can run before a PR leaves draft.
- `labels.enabled-labels`: turns label-driven features like `wip`, `hold`, `automerge`, and `cherry-pick` on or off.
- `minimum-lgtm`: sets the reviewer LGTM requirement.
- `set-auto-merge-prs`: enables native GitHub auto-merge on matching base branches.
- `auto-verify-cherry-picked-prs`: controls whether cherry-picked PRs can be auto-verified.
- `cherry-pick-assign-to-pr-author`: decides who gets assigned on the cherry-pick follow-up PR.
- `can-be-merged-required-labels`: adds extra exact label requirements before `can-be-merged` can succeed.

> **Warning:** `can-be-merged-required-labels` uses exact label names. It is best suited for fixed labels such as `security-reviewed` or `tests-passed`, not dynamic reviewer labels such as `approved-<user>` or `lgtm-<user>`.


---

Source: issue-comment-commands.md

# Issue Comment Commands

Issue comment commands let you control pull request automation directly from the PR conversation.

> **Note:** Only newly created comments are processed. Editing or deleting a comment later does not re-run a command.

> **Tip:** Put each command at the start of its own line. If you include several commands in one comment, the server parses each `/...` line separately and runs them concurrently.

```92:115:webhook_server/libs/handlers/issue_comment_handler.py
_user_commands: list[str] = [_cmd.strip("/") for _cmd in body.strip().splitlines() if _cmd.startswith("/")]

user_login: str = self.hook_data["sender"]["login"]

# Execute all commands in parallel
if _user_commands:
    # Cache draft status once to avoid repeated API calls
    is_draft = await asyncio.to_thread(lambda: pull_request.draft)

    tasks: list[Coroutine[Any, Any, Any] | Task[Any]] = []
    for user_command in _user_commands:
        task = asyncio.create_task(
            self.user_commands(
                pull_request=pull_request,
                command=user_command,
                reviewed_user=user_login,
                issue_comment_id=self.hook_data["comment"]["id"],
                is_draft=is_draft,
            )
        )
        tasks.append(task)

    # Execute all commands concurrently
    results = await asyncio.gather(*tasks, return_exceptions=True)
```

## Permissions

The server has two different permission styles:

- Some commands are open to anyone who can comment on the PR.
- Some commands use the "valid command runner" check.

A valid command runner is any user in one of these groups:

- Repository collaborators
- Repository contributors
- Repository approvers from `OWNERS`
- Reviewers resolved for the current PR from `OWNERS`
- A user explicitly approved by a maintainer or repository approver with `/add-allowed-user @username`

In this project, a "maintainer" is a repository collaborator with GitHub `admin` or `maintain` permission.

```469:515:webhook_server/libs/handlers/owners_files_handler.py
_allowed_user_to_approve = await self.get_all_repository_maintainers() + self.all_repository_approvers
allowed_user_to_approve = list(set(_allowed_user_to_approve))
allow_user_comment = f"/{COMMAND_ADD_ALLOWED_USER_STR} @{reviewed_user}"

comment_msg = f"""
{reviewed_user} is not allowed to run retest commands.
maintainers can allow it by comment `{allow_user_comment}`
Maintainers:
 - {"\n - ".join(allowed_user_to_approve)}
"""
valid_users = await self.valid_users_to_run_commands

# ...

return set((
    *repository_collaborators,
    *repository_contributors,
    *self.all_repository_approvers,
    *self.all_pull_request_reviewers,
))
```

| Command | Who can use it | Notes |
| --- | --- | --- |
| `/retest <name>` | Valid command runner | Same model is used for `/reprocess` and `/build-and-push-container` |
| `/reprocess` | Valid command runner | Skips merged PRs |
| `/build-and-push-container` | Valid command runner | Requires container support to be configured |
| `/assign-reviewers` | No additional role check | Still subject to draft-PR rules |
| `/assign-reviewer @username` | No additional role check | Target user must already be a repository contributor |
| `/check-can-merge` | No additional role check | Still subject to draft-PR rules |
| `/test-oracle` | No additional role check | Also allowed on draft PRs |
| `/wip` and `/wip cancel` | No additional role check | Only works when the `wip` label category is enabled |
| `/verified` and `/verified cancel` | No additional role check | Only works when the `verified` label category is enabled |
| `/hold` and `/hold cancel` | PR approvers for the current PR | Uses approvers resolved from `OWNERS` |
| `/approve` and `/approve cancel` | PR approvers or root approvers | Also affects Test Oracle auto-triggers |
| `/lgtm` and `/lgtm cancel` | No additional role check | The PR author's own `/lgtm` is ignored |
| `/automerge` | Repository maintainers or repository approvers | Adds the `automerge` label; there is no dedicated `/automerge cancel` issue-comment path |
| `/add-allowed-user @username` | Must be posted by a maintainer or repository approver to actually grant access | The later permission check only trusts approval comments from those users |

### Draft PRs

> **Warning:** On draft PRs, every issue comment command except `/test-oracle` is blocked unless you allow it with `allow-commands-on-draft-prs`.

```38:45:examples/config.yaml
# Commands allowed on draft PRs (optional)
# If not set: commands are blocked on draft PRs (default behavior)
# If empty list []: all commands allowed on draft PRs
# If list with values: only those commands allowed on draft PRs
# allow-commands-on-draft-prs: []  # Uncomment to allow all commands on draft PRs
# allow-commands-on-draft-prs:     # Or allow only specific commands:
#   - build-and-push-container
#   - retest
```

Use bare command names in this list, without the leading slash. For example, use `retest`, not `/retest`.

If the setting is omitted entirely, blocked commands on a draft PR are simply not run. If the setting is present and a command is not in the allowlist, the bot replies with the list of allowed commands.

## Command reference

### `/retest <name>`

Use `/retest` to rerun one or more configured checks for the current pull request.

Supported built-in retest names come from the repository's enabled features:

- `tox`
- `build-container`
- `python-module-install`
- `pre-commit`
- `conventional-title`

Custom retests come from `custom-check-runs`, and the command name is the configured `name` exactly. If you define a custom check named `lint`, you rerun it with `/retest lint`.

```360:388:webhook_server/libs/handlers/pull_request_handler.py
if self.github_webhook.tox:
    retest_msg += f" * `/retest {TOX_STR}` - Run Python test suite with tox\n"

if self.github_webhook.build_and_push_container:
    retest_msg += f" * `/retest {BUILD_CONTAINER_STR}` - Rebuild and test container image\n"

if self.github_webhook.pypi:
    retest_msg += f" * `/retest {PYTHON_MODULE_INSTALL_STR}` - Test Python package installation\n"

if self.github_webhook.pre_commit:
    retest_msg += f" * `/retest {PRE_COMMIT_STR}` - Run pre-commit hooks and checks\n"

if self.github_webhook.conventional_title:
    retest_msg += f" * `/retest {CONVENTIONAL_TITLE_STR}` - Validate commit message format\n"

# Add custom check runs (both mandatory and optional)
for custom_check in self.github_webhook.custom_check_runs:
    check_name = custom_check["name"]
    is_mandatory = custom_check.get("mandatory", True)
    status_indicator = "" if is_mandatory else " (optional)"
    retest_msg += f" * `/retest {check_name}` - {check_name}{status_indicator}\n"

if retest_msg:
    retest_msg += " * `/retest all` - Run all available tests\n"
```

```580:593:webhook_server/config/schema.yaml
custom-check-runs:
  type: array
  description: |
    Custom check runs that execute user-defined commands on PR events.
    Commands run in the repository worktree and behave like built-in checks
    (tox, pre-commit, etc.) - if a command is not found, the check will fail.

    Examples:
      - name: lint
        command: uv tool run --from ruff ruff check
        mandatory: true
      - name: security-scan
        command: TOKEN=xyz DEBUG=true uv tool run --from bandit bandit -r .
        mandatory: false
```

Behavior to know:

- `/retest all` runs every configured retest target, including optional custom checks.
- `/retest all` cannot be combined with other names in the same command.
- If you request a mix of supported and unsupported names, the supported ones still run and the bot comments about the unsupported ones.
- `/retest` requires an argument.

```465:521:webhook_server/libs/handlers/issue_comment_handler.py
if not _target_tests:
    msg = "No test defined to retest"
    await asyncio.to_thread(pull_request.create_issue_comment, msg)
    return

if "all" in command_args:
    if len(_target_tests) > 1:
        msg = "Invalid command. `all` cannot be used with other tests"
        await asyncio.to_thread(pull_request.create_issue_comment, msg)
        return
    else:
        _supported_retests = self.github_webhook.current_pull_request_supported_retest
else:
    for _test in _target_tests:
        if _test in self.github_webhook.current_pull_request_supported_retest:
            _supported_retests.append(_test)
        else:
            _not_supported_retests.append(_test)

if _not_supported_retests:
    msg = f"No {' '.join(_not_supported_retests)} configured for this repository"
    await asyncio.to_thread(pull_request.create_issue_comment, msg)

if _supported_retests:
    await self.runner_handler.run_retests(
        supported_retests=_supported_retests,
        pull_request=pull_request,
    )
```

> **Note:** `/retest build-container` rebuilds and tests the image as a check run. It does **not** push a container image. Use `/build-and-push-container` when you want the image published.

### `/reprocess`

Use `/reprocess` to rerun the main PR workflow for an existing open pull request.

It is useful when:

- A webhook failed partway through processing
- `OWNERS` changed and you want reviewer assignment recalculated
- Repository config changed and you want the PR automation refreshed

```1455:1500:webhook_server/libs/handlers/pull_request_handler.py
async def process_new_or_reprocess_pull_request(self, pull_request: PullRequest) -> None:
    """Process a new or reprocessed PR - handles welcome message, tracking issue, and full workflow.

    This method extracts the core logic from the "opened" event handler to make it reusable
    for both new PRs and the /reprocess command. It includes duplicate prevention checks.
    """
    tasks: list[Coroutine[Any, Any, Any]] = []

    # Add welcome message if it doesn't exist yet
    if not await self._welcome_comment_exists(pull_request=pull_request):
        welcome_msg = self._prepare_welcome_comment()
        tasks.append(asyncio.to_thread(pull_request.create_issue_comment, body=welcome_msg))
    else:
        self.logger.info(f"{self.log_prefix} Welcome message already exists, skipping")

    # Add tracking issue if it doesn't exist yet
    if not await self._tracking_issue_exists(pull_request=pull_request):
        tasks.append(self.create_issue_for_new_pull_request(pull_request=pull_request))
    else:
        self.logger.info(f"{self.log_prefix} Tracking issue already exists, skipping")

    # Always run these tasks
    tasks.append(self.set_wip_label_based_on_title(pull_request=pull_request))
    tasks.append(self.process_opened_or_synchronize_pull_request(pull_request=pull_request))

async def process_command_reprocess(self, pull_request: PullRequest) -> None:
    """Handle /reprocess command - triggers full PR workflow from scratch."""
    # Check if PR is already merged - skip if merged
    if await asyncio.to_thread(lambda: pull_request.is_merged()):
        return

    await self.process_new_or_reprocess_pull_request(pull_request=pull_request)
```

What `/reprocess` does:

- Re-runs the same core workflow used for new or synchronized PRs
- Recreates the welcome comment only if it is missing
- Recreates the tracking issue only if it is missing
- Reapplies WIP-from-title handling
- Re-runs the main opened/synchronize PR processing flow
- Skips merged PRs entirely

> **Tip:** Use `/regenerate-welcome` if you only want to refresh the welcome comment itself.

### `/assign-reviewers`

Use `/assign-reviewers` to assign reviewers from the `OWNERS` files that match the paths changed in the pull request.

```442:466:webhook_server/libs/handlers/owners_files_handler.py
async def assign_reviewers(self, pull_request: PullRequest) -> None:
    self._ensure_initialized()

    _to_add: list[str] = list(set(self.all_pull_request_reviewers))

    if not _to_add:
        return

    for reviewer in _to_add:
        if reviewer != pull_request.user.login:
            try:
                await asyncio.to_thread(pull_request.create_review_request, [reviewer])
            except GithubException as ex:
                await asyncio.to_thread(
                    pull_request.create_issue_comment, f"{reviewer} can not be added as reviewer. {ex}"
                )
```

Key points:

- Reviewers are derived from matching `OWNERS` files for the current PR.
- The PR author is skipped.
- There is no extra role check beyond being able to comment on the PR.
- The command is still subject to the draft-PR rules described earlier.

### `/assign-reviewer @username`

Use `/assign-reviewer @username` to request one specific reviewer.

```373:386:webhook_server/libs/handlers/issue_comment_handler.py
async def _add_reviewer_by_user_comment(self, pull_request: PullRequest, reviewer: str) -> None:
    reviewer = reviewer.strip("@")
    repo_contributors = list(await asyncio.to_thread(self.repository.get_contributors))

    for contributer in repo_contributors:
        if contributer.login == reviewer:
            await asyncio.to_thread(pull_request.create_review_request, [reviewer])
            return

    _err = f"not adding reviewer {reviewer} by user comment, {reviewer} is not part of contributers"
    await asyncio.to_thread(pull_request.create_issue_comment, _err)
```

Key points:

- The leading `@` is optional; the command strips it before lookup.
- The target user must already be in the repository contributors list.
- If the user is not a contributor, the bot comments instead of assigning them.
- The command requires an argument.

### `/check-can-merge`

Use `/check-can-merge` to recalculate merge readiness immediately.

The command updates the `can-be-merged` check run and label based on the current PR state. In the current implementation, that evaluation includes:

- Whether the PR is mergeable
- Required checks and commit statuses
- In-progress required checks
- Blocking labels such as `wip` and `hold`
- Extra labels configured in `can-be-merged-required-labels`
- Unresolved review conversations when conversation resolution is required
- Approval and LGTM requirements

```1164:1274:webhook_server/libs/handlers/pull_request_handler.py
async def check_if_can_be_merged(self, pull_request: PullRequest) -> None:
    """
    Check if PR can be merged and set the job for it

    Check the following:
        None of the required status checks in progress.
        Has verified label.
        Has approved from one of the approvers.
        All required run check passed.
        PR status is not 'dirty'.
        PR has no changed requests from approvers.
    """
    # ...
    labels_failure_output = self.labels_handler.wip_or_hold_labels_exists(labels=_labels)
    # ...
    labels_failure_output = self._check_labels_for_can_be_merged(labels=_labels)
    # ...
    if self.github_webhook.required_conversation_resolution and _unresolved_threads:
        conversation_failure = f"PR has {len(_unresolved_threads)} unresolved review conversation(s):\n"
    # ...
    pr_approvered_failure_output = await self._check_if_pr_approved(labels=_labels)
    # ...
    if not failure_output:
        await self.labels_handler._add_label(pull_request=pull_request, label=CAN_BE_MERGED_STR)
        await self.check_run_handler.set_check_success(name=CAN_BE_MERGED_STR)
```

If the PR passes, the bot adds the `can-be-merged` label and marks the check successful. If it does not pass, the bot removes that label and publishes the failure reasons in the check output.

Repository-specific required labels are configured like this:

```193:195:examples/config.yaml
can-be-merged-required-labels: # check for extra labels to set PR as can be merged
  - my-label1
  - my-label2
```

### `/build-and-push-container`

Use `/build-and-push-container` to manually build **and push** a PR container image.

The command only works when container support is configured for the repository. Otherwise, the bot replies that no `build-and-push-container` is configured.

```400:405:webhook_server/libs/handlers/pull_request_handler.py
if self.github_webhook.build_and_push_container:
    return """
#### Container Operations
* `/build-and-push-container` - Build and push container image (tagged with PR number)
  * Supports additional build arguments: `/build-and-push-container --build-arg KEY=value`
```

```172:183:examples/config.yaml
container:
  username: <registry username>
  password: <registry_password>
  repository: <registry_repository_full_path>
  tag: <image_tag>
  release: true # Push image to registry on new release with release as the tag
  build-args: # build args to send to podman build command
    - my-build-arg1=1
    - my-build-arg2=2
  args: # args to send to podman build command
    - --format docker
```

For PR comment builds, the image tag is derived from the PR number:

```1111:1127:webhook_server/libs/github_api.py
def container_repository_and_tag(
    self, is_merged: bool = False, tag: str = "", pull_request: PullRequest | None = None
) -> str | None:
    if not tag:
        if not pull_request:
            return None

        if is_merged:
            pull_request_branch = pull_request.base.ref
            tag = (
                pull_request_branch
                if pull_request_branch not in (OTHER_MAIN_BRANCH, "main")
                else self.container_tag
            )
        else:
            tag = f"pr-{pull_request.number}"
```

What to expect:

- The command uses the repository's configured `container` settings.
- Extra arguments after the command are passed through to the build invocation.
- Default `container.build-args` and `container.args` from config are applied too.
- On success, the bot comments that a new container was published.
- On push failure, the bot comments that the build-and-push failed.

> **Warning:** This is different from `/retest build-container`. The retest command runs the build as a check; `/build-and-push-container` publishes an image.

### `/test-oracle`

Use `/test-oracle` to ask the configured PR Test Oracle service for test recommendations.

The direct issue comment command always works when `test-oracle` is configured. The `triggers` list controls automatic background runs, not the direct `/test-oracle` command.

```112:124:examples/config.yaml
# PR Test Oracle integration
# Analyzes PR diffs with AI and recommends which tests to run
# See: https://github.com/myk-org/pr-test-oracle
test-oracle:
  server-url: "http://localhost:8000"
  ai-provider: "claude" # claude | gemini | cursor
  ai-model: "claude-opus-4-6[1m]"
  test-patterns:
    - "tests/**/*.py"
  triggers: # Default: [approved]
    - approved # Run when /approve command is used
    # - pr-opened             # Run when PR is opened
    # - pr-synchronized       # Run when new commits pushed
```

```17:63:webhook_server/libs/test_oracle.py
async def call_test_oracle(
    github_webhook: GithubWebhook,
    pull_request: PullRequest,
    trigger: str | None = None,
) -> None:
    """Call the pr-test-oracle service to analyze a PR for test recommendations.

    Args:
        trigger: The event trigger (e.g., "approved", "pr-opened").
                 "approved" means the /approve command, not GitHub review state.
                 None means command-triggered (always runs if configured).
    """
    config: dict[str, Any] | None = github_webhook.config.get_value("test-oracle")
    if not config:
        return

    if trigger is not None:
        triggers: list[str] = config.get("triggers", DEFAULT_TRIGGERS)
        if trigger not in triggers:
            return

    server_url: str = config["server-url"]

    # Health check
    try:
        health_response = await client.get("/health", timeout=5.0)
        health_response.raise_for_status()
    except httpx.HTTPError as e:
        await asyncio.to_thread(
            pull_request.create_issue_comment,
            f"Test Oracle server is not responding{status_info}, skipping test analysis",
        )
        return
```

Behavior to know:

- If `test-oracle` is not configured, `/test-oracle` quietly does nothing.
- If the Oracle server fails its health check, the bot comments on the PR.
- If the analyze call fails after the health check passes, the error is logged but no PR comment is posted.
- `/test-oracle` runs asynchronously in the background.
- `/test-oracle` is the only issue comment command explicitly allowed on draft PRs even when other commands are blocked.

> **Warning:** In `test-oracle.triggers`, `approved` means the `/approve` issue comment command, not GitHub's native review approval event.

### Label commands

The current issue-comment handler exposes the built-in label-related commands below.

```425:449:webhook_server/libs/handlers/pull_request_handler.py
commands: list[str] = []

if self.labels_handler.is_label_enabled(WIP_STR):
    commands.append("* `/wip` - Mark PR as work in progress (adds WIP: prefix to title)")
    commands.append("* `/wip cancel` - Remove work in progress status")

if self.labels_handler.is_label_enabled(HOLD_LABEL_STR):
    commands.append("* `/hold` - Block PR merging (approvers only)")
    commands.append("* `/hold cancel` - Unblock PR merging")

if self.labels_handler.is_label_enabled(VERIFIED_LABEL_STR):
    commands.append("* `/verified` - Mark PR as verified")
    commands.append("* `/verified cancel` - Remove verification status")

# These commands are always available
commands.append(
    "* `/reprocess` - Trigger complete PR workflow reprocessing "
    "(useful if webhook failed or configuration changed)"
)
```

| Command | What it does | Who can use it |
| --- | --- | --- |
| `/wip` | Adds the `wip` label and prepends `WIP:` to the title if it is not already there | No additional role check |
| `/wip cancel` | Removes the `wip` label and removes a leading `WIP:` or `WIP: ` prefix from the title | No additional role check |
| `/hold` | Adds the `hold` label, which blocks merge readiness | PR approvers for the current PR |
| `/hold cancel` | Removes the `hold` label | PR approvers for the current PR |
| `/verified` | Adds the `verified` label and marks the `verified` check successful | No additional role check |
| `/verified cancel` | Removes the `verified` label and sets the `verified` check back to queued | No additional role check |
| `/approve` | Adds the `approved-<user>` review label | PR approvers or root approvers |
| `/approve cancel` | Removes the `approved-<user>` review label | PR approvers or root approvers |
| `/lgtm` | Adds the `lgtm-<user>` review label used for `minimum-lgtm` | No additional role check, but the PR author's own `/lgtm` is ignored |
| `/lgtm cancel` | Removes the `lgtm-<user>` review label | No additional role check |
| `/automerge` | Adds the `automerge` label | Repository maintainers or repository approvers |

The `automerge` label is more than a marker. When the `can-be-merged` check later completes successfully, the server performs a squash merge automatically:

```80:89:webhook_server/libs/handlers/check_run_handler.py
if check_run_name == CAN_BE_MERGED_STR:
    if getattr(self, "labels_handler", None) and pull_request and check_run_conclusion == SUCCESS_STR:
        if await self.labels_handler.label_exists_in_pull_request(
            label=AUTOMERGE_LABEL_STR, pull_request=pull_request
        ):
            try:
                await asyncio.to_thread(pull_request.merge, merge_method="SQUASH")
                self.logger.info(
                    f"{self.log_prefix} Successfully auto-merged pull request #{pull_request.number}"
                )
```

> **Tip:** `/automerge` only adds the label. The actual merge happens later, after the `can-be-merged` check reports success.

Label categories are configurable:

```47:79:examples/config.yaml
# Labels configuration - control which labels are enabled and their colors
# If not set, all labels are enabled with default colors
labels:
  # Optional: List of label categories to enable
  # If not set, all labels are enabled. If set, only listed categories are enabled.
  # Note: reviewed-by labels (approved-*, lgtm-*, etc.) are always enabled and cannot be disabled
  enabled-labels:
    - verified
    - hold
    - wip
    - needs-rebase
    - has-conflicts
    - can-be-merged
    - size
    - branch
    - cherry-pick
    - automerge
  # Optional: Custom colors for labels (CSS3 color names)
  colors:
    hold: red
    verified: green
    wip: orange
    needs-rebase: darkred
    has-conflicts: red
    can-be-merged: limegreen
    automerge: green
    # Dynamic label prefixes
    approved-: green
    lgtm-: yellowgreen
    changes-requested-: orange
    commented-: gold
    cherry-pick-: coral
    branch-: royalblue
```

> **Note:** If `labels.enabled-labels` is empty, configurable label commands such as `wip`, `hold`, `verified`, and `automerge` are effectively disabled. Review-state labels such as `approved-*` and `lgtm-*` remain enabled because the review workflow depends on them.

### `/add-allowed-user @username`

This is the permission override command.

Use it when someone outside the default valid-user set needs to run `/retest`, `/reprocess`, or `/build-and-push-container`.

How it works:

- A maintainer or repository approver comments `/add-allowed-user @username`
- Later permission checks look for that exact comment on the PR
- If the approving comment was posted by someone else, it is ignored for authorization purposes
- The command requires an argument

This is most useful for letting an occasional contributor rerun checks without changing `OWNERS` or repository membership.


---

Source: labels-check-runs-and-mergeability.md

# Labels, Check Runs, and Mergeability

This server uses labels and GitHub check runs together to show PR state, enforce review rules, and decide when a pull request is ready to merge.

> **Note:** You can configure these features globally in `config.yaml` or per repository in `.github-webhook-server.yaml`. Repository-local settings override the global file.

## Built-in labels

These labels are applied automatically or by PR comment commands when their category is enabled.

| Label | How it is added | What it means |
| --- | --- | --- |
| `verified` | Added automatically for auto-verified users, or manually with `/verified` | The PR has been marked as verified |
| `hold` | Added with `/hold` by an approver | Blocks mergeability |
| `wip` | Added automatically when the title starts with `WIP:`, or manually with `/wip` | Marks the PR as work in progress and blocks mergeability |
| `needs-rebase` | Added automatically when the PR branch is behind or diverged from the base branch | The PR should be rebased or updated |
| `has-conflicts` | Added automatically when GitHub reports merge conflicts | The PR is not mergeable until conflicts are resolved |
| `can-be-merged` | Added automatically when all merge rules pass | The PR currently satisfies the server’s mergeability checks |
| `automerge` | Added with `/automerge` by a maintainer or approver | Tells the server to squash-merge once `can-be-merged` succeeds |
| `branch-<base-branch>` | Added automatically on PR open and update | Shows the target branch, such as `branch-main` |
| `size/<category>` | Added automatically on PR open and update | Shows PR size, such as `size/M` |

`needs-rebase` and `has-conflicts` are separate signals. A PR can be behind the base branch, have conflicts, or both.

The built-in label categories and colors are configurable:

```yaml
labels:
  enabled-labels:
    - verified
    - hold
    - wip
    - needs-rebase
    - has-conflicts
    - can-be-merged
    - size
    - branch
    - cherry-pick
    - automerge
  colors:
    hold: red
    verified: green
    wip: orange
    needs-rebase: darkred
    has-conflicts: red
    can-be-merged: limegreen
    automerge: green
    approved-: green
    lgtm-: yellowgreen
    changes-requested-: orange
    commented-: gold
    cherry-pick-: coral
    branch-: royalblue
```

> **Note:** If `labels.enabled-labels` is omitted, all configurable built-in label categories are enabled. If you set it to `[]`, all configurable built-in labels are disabled, but reviewed-by labels still remain active.

## Reviewed-by labels

The server also creates dynamic labels that reflect review activity:

- `approved-<user>`
- `lgtm-<user>`
- `changes-requested-<user>`
- `commented-<user>`

These labels are always enabled and cannot be disabled through `labels.enabled-labels`.

The distinction that matters most is this:

- GitHub’s normal “Approve review” action becomes `lgtm-<user>`.
- This project’s explicit `/approve` command becomes `approved-<user>`.
- A `changes_requested` review becomes `changes-requested-<user>`.
- A comment-only review becomes `commented-<user>`.

That is what lets the server separate “looks good” from “formal approver approval”.

> **Tip:** When new commits are pushed to a PR, the server removes existing reviewed-by labels and rebuilds mergeability from fresh review activity. This is why `approved-*` and `lgtm-*` labels disappear after a `synchronize` event.

## Size and branch labels

Every opened or updated PR gets a branch label and a size label.

The branch label uses the base branch name:

- `branch-main`
- `branch-develop`
- `branch-release-1.2`

The size label is based on `additions + deletions`. The built-in defaults come directly from the source:

```python
STATIC_PR_SIZE_THRESHOLDS: tuple[tuple[int | float, str, str], ...] = (
    (20, "XS", "ededed"),
    (50, "S", "0E8A16"),
    (100, "M", "F09C74"),
    (300, "L", "F5621C"),
    (500, "XL", "D93F0B"),
    (float("inf"), "XXL", "B60205"),
)
```

That means the default labels are:

- `size/XS` for fewer than 20 changed lines
- `size/S` for 20 to 49
- `size/M` for 50 to 99
- `size/L` for 100 to 299
- `size/XL` for 300 to 499
- `size/XXL` for 500 and up

You can replace those defaults with your own categories:

```yaml
pr-size-thresholds:
  Tiny:
    threshold: 10
    color: lightgray
  Small:
    threshold: 50
    color: green
  Medium:
    threshold: 150
    color: orange
  Large:
    threshold: 300
    color: red
  Massive:
    threshold: inf
    color: darkred
```

Even if you rename the buckets to `Tiny`, `Express`, or `Critical`, they are still controlled by the single `size` category in `labels.enabled-labels`.

> **Tip:** Use `inf` for the last size bucket so every PR larger than your biggest numeric threshold still gets a label.

## Verified Check

When `verified-job` is enabled, the server maintains a `verified` check run and uses it as one of the merge requirements.

For a normal contributor, the flow is:

- A new PR starts with `verified` in `queued`.
- Someone can mark it with `/verified`.
- The server adds the `verified` label and sets the `verified` check to `success`.
- When new commits are pushed, the server removes the `verified` label and resets the check back to `queued`.

For trusted or automation accounts, the server can do this automatically:

```yaml
verified-job: true

auto-verified-and-merged-users:
  - "renovate[bot]"
  - "dependabot[bot]"
  - "trusted-user"

auto-verify-cherry-picked-prs: false
```

A few details matter here:

- `verified-job` defaults to `true`.
- `auto-verified-and-merged-users` are auto-verified on PR open and update.
- The server also adds the users behind configured GitHub tokens to the auto-verified list automatically.
- If `auto-verify-cherry-picked-prs` is `false`, cherry-picked PRs are not auto-verified.
- AI-resolved cherry-picks are never auto-verified, even if cherry-pick auto-verification is enabled.

If you do not want verification to be part of mergeability, set `verified-job: false`.

## Can-Be-Merged Check

`can-be-merged` is the server’s final readiness check. When it succeeds, the server marks the PR with the `can-be-merged` label. When it fails, the label is removed and the check output explains why.

A PR must satisfy all of these rules:

- The PR must still be open and mergeable.
- No required checks can be in progress.
- No required checks can be missing or failed.
- `hold` and `wip` must not be present.
- Any configured `can-be-merged-required-labels` must be present.
- There must be no blocking `changes-requested-<user>` label from an approver.
- Approval and LGTM rules must pass.
- If conversation resolution is enabled, there must be no unresolved review threads.

You can also add extra label-based gates:

```yaml
can-be-merged-required-labels:
  - "approved"
  - "tests-passed"
  - "security-reviewed"
```

The check considers the following status sources:

- GitHub required status checks from the base branch’s branch protection
- Built-in server checks such as `tox`, `verified`, `build-container`, `python-module-install`, and `conventional-title`
- Mandatory custom check runs
- Legacy GitHub commit status contexts

> **Note:** `pre-commit` still runs when enabled, but `can-be-merged` only treats it as required when it appears in the branch’s required status checks.

> **Warning:** On private repositories, the runtime `can-be-merged` logic skips the live branch-protection status-check lookup. If you rely on extra required checks in private repos, make sure your server-side required checks and mandatory custom checks cover what you need.

If unresolved conversations are blocking the PR, the check output looks like this:

```text
PR has 2 unresolved review conversation(s):
  - src/main.py:42 (https://github.com/test-org/test-repo/pull/123#discussion_r100)
  - src/utils.py:10 (https://github.com/test-org/test-repo/pull/123#discussion_r101)
```

The server recalculates `can-be-merged` automatically when:

- a relevant label is added or removed
- a required check run completes
- a terminal commit status arrives
- a review thread is resolved or reopened
- you run `/check-can-merge`

> **Tip:** If the PR also has the `automerge` label, a successful `can-be-merged` check triggers an immediate squash merge.

> **Tip:** If you want the check run but not the extra label, you can disable the `can-be-merged` label category. The `can-be-merged` check still reports the result.

## Minimum LGTM And Approval Rules

`minimum-lgtm` does not replace approval. It adds an extra reviewer-consensus rule on top of approver approval.

Here is the distinction:

- `/approve` is the formal approval signal and creates `approved-<user>`.
- `/lgtm` and normal GitHub “approved” reviews create `lgtm-<user>`.
- `minimum-lgtm` counts `lgtm-*` labels from eligible reviewers.
- The PR author’s own LGTM does not count.

Example:

```yaml
minimum-lgtm: 2
```

With that setting, the PR still needs approver approval from the relevant OWNERS rules, and it also needs at least two valid LGTMs.

The rules are OWNERS-aware:

- A root approver can satisfy the approval requirement for the whole PR.
- Otherwise, the server looks at the approver sets attached to the changed files.
- A `changes-requested-<user>` label only blocks mergeability when that user is an approver for the PR.

> **Note:** The LGTM requirement is effectively capped by the number of eligible reviewers. If `minimum-lgtm` is higher than the number of valid reviewers, the server treats the requirement as satisfied once every eligible reviewer except the PR author has added LGTM.

Set `minimum-lgtm: 0` to disable the LGTM requirement entirely.

## Unresolved Conversation Checks

Conversation resolution is controlled by `branch-protection.required_conversation_resolution`, which defaults to `true`.

A repository-level example looks like this:

```yaml
branch-protection:
  strict: true
  require_code_owner_reviews: true
  dismiss_stale_reviews: false
  required_approving_review_count: 1
  required_linear_history: true
  required_conversation_resolution: true
```

When this setting is enabled:

- the server queries GitHub review threads through GraphQL
- resolved threads are ignored
- unresolved threads block `can-be-merged`
- outdated unresolved threads still count
- the check output includes the file, line, and discussion URL when available

Set `required_conversation_resolution: false` if you want `can-be-merged` to ignore unresolved review threads.

> **Note:** If you want mergeability to update immediately when a conversation is resolved or reopened, make sure the repository is configured to receive `pull_request_review_thread` events.

## Custom Check Runs

Custom check runs let you add your own PR checks without changing Python code.

A schema example from this repository looks like this:

```yaml
custom-check-runs:
  - name: lint
    command: uv tool run --from ruff ruff check
    mandatory: true
  - name: security-scan
    command: TOKEN=xyz DEBUG=true uv tool run --from bandit bandit -r .
    mandatory: false
```

How custom checks behave:

- The check name appears in GitHub exactly as configured.
- The command runs in the PR worktree.
- The command is executed through `/bin/sh -c`, so environment variables, pipes, and other shell syntax work.
- Custom checks are queued and run on PR open and PR update.
- Both mandatory and optional custom checks still run.
- Checks with `mandatory: false` are excluded from merge-blocking logic.

That makes `mandatory: false` useful for visibility-only checks:

- run a slow or advisory scan
- show results on the PR
- keep `can-be-merged` focused on the hard gates

You can retry custom checks with the same commands you use for built-in checks:

- `/retest lint`
- `/retest your-check-name`
- `/retest all`

Validation rules are strict:

- `name` and `command` are required
- names may only use letters, numbers, `.`, `_`, and `-`
- names must be 1 to 64 characters
- duplicate names are skipped after the first one
- names that collide with built-in check names are rejected
- the executable must exist on the webhook server host or container

The built-in names rejected for custom checks are:

- `tox`
- `pre-commit`
- `build-container`
- `python-module-install`
- `conventional-title`
- `can-be-merged`

> **Warning:** If the executable in a custom check is not available in the webhook server environment, the server does not load that custom check. It logs a warning and skips it.

> **Tip:** Use `mandatory: false` for advisory checks, and `mandatory: true` for checks that should block `can-be-merged`.

## Useful Commands

These are the commands most relevant to labels and mergeability:

- `/wip` and `/wip cancel`
- `/hold` and `/hold cancel`
- `/verified` and `/verified cancel`
- `/lgtm`
- `/approve`
- `/automerge`
- `/check-can-merge`
- `/retest <check-name>`
- `/retest all`

In practice, the flow usually looks like this:

1. Open or update a PR.
2. Let the server apply branch, size, and merge-state labels.
3. Wait for check runs to finish.
4. Use `/lgtm`, `/approve`, and `/verified` as needed.
5. Resolve any remaining review threads.
6. Watch for `can-be-merged` to turn green.


---

Source: container-and-pypi-workflows.md

# Container and PyPI Workflows

`github-webhook-server` has built-in workflows for container images and Python package publishing. Once a repository is configured, the server reacts to pull requests, merge events, comment commands, and pushed tags to build images, validate Python packaging, publish to PyPI, and send Slack notifications.

> **Note:** These workflows are implemented in the server itself, not in GitHub Actions. In this repository, there are no release or publish workflows under `.github/workflows`; the behavior comes from webhook handlers in the application code.

## What Triggers What

Here is the high-level behavior:

- `pull_request` opened or synchronized: runs the `build-container` check if `container` is configured, and runs the `python-module-install` check if `pypi` is configured.
- `issue_comment` with `/build-and-push-container`: builds and pushes a PR-tagged image.
- `pull_request` merged: builds and pushes a branch or stable image automatically.
- `push` for `refs/tags/*`: publishes to PyPI if `pypi` is configured, and builds/pushes a release image if `container.release: true`.
- Regular branch pushes: skipped for publishing.

A practical way to think about it is:

- PRs are for validation.
- Comment commands are for ad hoc pushes.
- Merges publish branch or stable images.
- Git tags publish releases.

## Configuration

You can set these values in the central `config.yaml` or in a repository-local `.github-webhook-server.yaml`.

The relevant example from `examples/.github-webhook-server.yaml` looks like this:

```yaml
slack-webhook-url: https://hooks.slack.com/services/YOUR/SLACK/WEBHOOK

pypi:
  token: pypi-your-token-here

events:
  - push
  - pull_request
  - pull_request_review
  - pull_request_review_comment
  - pull_request_review_thread
  - issue_comment
  - check_run
  - status

container:
  username: your-registry-username
  password: your-registry-password
  repository: quay.io/your-org/your-repo
  tag: latest
  release: true
  build-args:
    - "BUILD_ARG=value"
  args:
    - "--platform=linux/amd64"
```

What these keys do:

- `events`: if you explicitly list events, include `push` for tag releases, `pull_request` for PR checks and merge publishing, and `issue_comment` for manual `/build-and-push-container`.
- `container.username` and `container.password`: credentials used for pushing images and cleaning up PR tags.
- `container.repository`: the full image repository, such as `ghcr.io/org/image` or `quay.io/org/image`.
- `container.tag`: the stable tag used when a PR is merged into `main` or `master`. If omitted, the code defaults to `latest`.
- `container.release`: enables extra container publishing on git tag pushes.
- `container.build-args`: converted into `--build-arg` flags for `podman build`.
- `container.args`: passed through as extra `podman build` flags, useful for values like `--platform=linux/amd64` or `--format docker`.
- `pypi.token`: the PyPI API token used for `twine upload`.
- `slack-webhook-url`: enables Slack notifications for successful PyPI publishes and container push outcomes.

> **Note:** Configuration values are resolved in this order: repository `.github-webhook-server.yaml`, then the repository entry in central `config.yaml`, then top-level defaults in central `config.yaml`.

> **Note:** If you use the server’s repository settings sync, enabling `container` adds the `build-container` check to required status checks, and enabling `pypi` adds `python-module-install`.

## Container Workflow

Container builds use `podman`, not Docker. The server checks out a repository worktree, builds from the repository’s `Dockerfile`, and tags the image based on the event that triggered the build.

### How image tags are chosen

The tag-selection logic in the code is:

```python
if is_merged:
    pull_request_branch = pull_request.base.ref
    tag = (
        pull_request_branch
        if pull_request_branch not in (OTHER_MAIN_BRANCH, "main")
        else self.container_tag
    )
else:
    tag = f"pr-{pull_request.number}"
```

That produces these user-visible results:

- PR validation builds use `pr-<number>`.
- Manual `/build-and-push-container` pushes also use `pr-<number>`.
- Merges into `main` or `master` use `container.tag` such as `latest`.
- Merges into other base branches use the base branch name directly.
- Tag pushes use the git tag directly, such as `v1.2.3`.

### PR validation vs. manual push

When a PR is opened or updated, the server runs the `build-container` check. That check builds the image but does not push it to a registry.

If you want a pushed image for a PR, use the comment command:

```text
/build-and-push-container
```

The welcome message generated by the server documents this command like this:

```text
/build-and-push-container - Build and push container image (tagged with PR number)
Supports additional build arguments: /build-and-push-container --build-arg KEY=value
```

This manual command is separate from the `build-container` check. It is intended for “build me a test image now” workflows.

### What happens on merge

When a PR is merged, the server automatically builds and pushes again.

- Merge to `main` or `master`: pushes `<container.repository>:<container.tag>`.
- Merge to another branch: pushes `<container.repository>:<base-branch>`.

Merged-PR builds also add `--no-cache` to the `podman build` command.

> **Note:** `container.release: true` only controls tag-based release publishing. It does not disable the automatic image push that happens when a PR is merged.

### What happens on tag pushes

If a tag push arrives and `container.release: true` is enabled, the server builds and pushes a release image tagged with that exact git tag.

Examples:

- Git tag `v1.2.3` becomes image tag `v1.2.3`
- Git tag `2026.03.18` becomes image tag `2026.03.18`

Regular branch pushes do not trigger container publishing.

> **Warning:** Branch names and git tags become image tags as-is. Names like `release/1.2` or `release/v1.2.3` are valid git refs, but they may not be valid container tags in your registry. Use registry-safe names such as `v1.2.3` for release tags.

### Registry cleanup for PR images

When a PR is closed or merged, the server tries to delete the temporary `pr-<number>` tag from the registry.

- For `ghcr.io`, it uses the GitHub Packages API.
- For other registries such as Quay or Docker Hub, it uses `regctl`.

> **Warning:** Use a fully qualified image repository such as `ghcr.io/org/image` or `quay.io/org/image`. If the registry host is missing, the server skips PR-tag cleanup because it cannot tell which registry to talk to.

### Draft PRs and manual container pushes

By default, comment commands are blocked on draft PRs. The example `config.yaml` documents the draft allowlist like this:

```yaml
# allow-commands-on-draft-prs:     # Or allow only specific commands:
#   - build-and-push-container
#   - retest
```

If you want `/build-and-push-container` to work on draft PRs, add it to `allow-commands-on-draft-prs`.

## PyPI Workflow

Enabling `pypi` gives you two different behaviors:

- A PR-time packaging check named `python-module-install`
- A tag-time publish flow to PyPI

### PR packaging check

On PR events, the server validates that the project can build as a wheel. The actual command in the code is:

```python
cmd = "uvx pip wheel --no-cache-dir -w {worktree_path}/dist {worktree_path}"
```

This does not publish anything. It is just a fast packaging check that helps catch broken Python builds before release time.

### Tag-based PyPI publishing

On a pushed git tag, the server checks out that tag and runs the publish flow. The key part of the implementation is:

```python
rc, out, err = await run_command(
    command=f"uv {uv_cmd_dir} build --sdist --out-dir {_dist_dir}", log_prefix=self.log_prefix
)

commands: list[str] = [
    f"uvx {uv_cmd_dir} twine check {_dist_dir}/{tar_gz_file}",
    f"uvx {uv_cmd_dir} twine upload --username __token__ "
    f"--password {pypi_token} "
    f"{_dist_dir}/{tar_gz_file} --skip-existing",
]
```

In plain English, the server does this:

1. Checks out the pushed tag.
2. Builds a source distribution into a temporary `pypi-dist` directory.
3. Runs `twine check`.
4. Uploads to PyPI with `twine upload --skip-existing`.

> **Note:** The release upload is an sdist only. The PR-time `python-module-install` check builds a wheel for validation, but the tag publish path uploads the source tarball.

> **Note:** The uploaded package version comes from your package metadata in `pyproject.toml`, not from the git tag name. The server checks out the tag and builds whatever version is defined in the project at that commit.

If any part of the PyPI publish flow fails, the server creates a GitHub issue with a sanitized error summary instead of failing silently.

> **Tip:** The upload uses `--skip-existing`, which makes repeated delivery of the same tag much easier to tolerate if the artifact is already on PyPI.

## Tag-Based Releases

The server only needs a pushed git tag. It does not require a specific release tool.

That said, this repository’s own release flow is a good example of how to generate those tags. The relevant part of `.release-it.json` is:

```json
{
  "git": {
    "commitMessage": "Release ${version}",
    "tag": true,
    "tagAnnotation": "Release ${version}",
    "push": true,
    "pushArgs": ["--follow-tags"],
    "changelog": "uv run scripts/generate_changelog.py ${from} ${to}"
  },
  "github": {
    "release": true,
    "releaseName": "Release ${version}"
  },
  "plugins": {
    "@release-it/bumper": {
      "in": "pyproject.toml",
      "out": { "file": "pyproject.toml", "path": "project.version" }
    }
  },
  "hooks": {
    "after:bump": "uv sync"
  }
}
```

That setup does four useful things:

- Bumps `project.version` in `pyproject.toml`
- Runs `uv sync` after the bump
- Commits and pushes a release tag
- Creates a GitHub release with generated changelog content

For `github-webhook-server`, the important detail is simple: once that tag is pushed to GitHub, the webhook server sees the `push` event and runs the PyPI and container release handlers.

> **Tip:** You do not have to use `release-it`. Any workflow that pushes a tag to GitHub will trigger the same server-side release behavior, as long as the repository is configured to receive `push` events.

## Slack Notifications

Slack is optional. If `slack-webhook-url` is not configured, the server skips notification delivery.

The messages are sent through a standard Slack incoming webhook with a simple JSON payload. The message text looks like this:

```text
<repository> Version <tag> published to PYPI.
<owner/repo> New container for <image:tag> published.
<owner/repo> Failed to build and push <image:tag>.
```

What currently sends Slack notifications:

- Successful PyPI publishes
- Successful container pushes
- Container push failures after a successful build step

What does not depend on Slack:

- PR validation checks
- Tag detection itself
- PyPI upload logic
- Container build logic

Slack is best treated as a notification layer, not the source of truth. Your real release state still lives in GitHub, your registry, and PyPI.

## Practical Checklist

1. Enable `push`, `pull_request`, and `issue_comment` events for the repository.
2. Configure `container` if you want image builds and registry pushes.
3. Configure `pypi.token` if you want tag-based PyPI publishing.
4. Use a fully qualified image repository like `ghcr.io/org/image`.
5. Use registry-safe git tags such as `v1.2.3`.
6. Add `slack-webhook-url` if you want release notifications.
7. If you want manual container pushes on draft PRs, allow `build-and-push-container` in `allow-commands-on-draft-prs`.


---

Source: ai-features-and-test-oracle.md

# AI Features and Test Oracle

This project has two separate AI-related capabilities:

- `conventional-title` validates pull request titles against the Conventional Commits format.
- `ai-features.conventional-title` adds AI help when that validation fails.
- `test-oracle` sends PR data to an external [pr-test-oracle](https://github.com/myk-org/pr-test-oracle) service that recommends which tests to run.

> **Note:** `conventional-title` and `ai-features.conventional-title` are different settings. `conventional-title` enables the rule and defines the allowed types. `ai-features.conventional-title` controls whether AI suggests or auto-fixes a title when that rule fails.

## Configuration

The shipped example configuration shows `test-oracle` and `ai-features` at the root level:

```112:134:examples/config.yaml
# PR Test Oracle integration
# Analyzes PR diffs with AI and recommends which tests to run
# See: https://github.com/myk-org/pr-test-oracle
test-oracle:
  server-url: "http://localhost:8000"
  ai-provider: "claude" # claude | gemini | cursor
  ai-model: "claude-opus-4-6[1m]"
  test-patterns:
    - "tests/**/*.py"
  triggers: # Default: [approved]
    - approved # Run when /approve command is used
    # - pr-opened             # Run when PR is opened
    # - pr-synchronized       # Run when new commits pushed

# AI Features configuration
# Enables AI-powered enhancements (e.g., conventional title suggestions)
ai-features:
  ai-provider: "claude" # claude | gemini | cursor
  ai-model: "claude-opus-4-6[1m]"
  conventional-title:
    enabled: true
    mode: suggest  # suggest: show in checkrun | fix: auto-update PR title
    timeout-minutes: 10
```

The schema also allows `test-oracle` and `ai-features` under a repository entry in `config.yaml` if you want per-repository behavior.

The title validation rule itself is configured per repository:

```197:220:examples/config.yaml
    # Conventional Commits validation
    # Enforces Conventional Commits v1.0.0 specification for PR titles
    # Format: <type>[optional scope]: <description>
    #
    # Standard types (recommended):
    #   - feat: New features (triggers MINOR version bump in semver)
    #   - fix: Bug fixes (triggers PATCH version bump in semver)
    #   - build, chore, ci, docs, style, refactor, perf, test, revert
    #
    # Custom types: You can define your own types! The spec allows any noun.
    # Examples: my-title, hotfix, release, custom
    #
    # Valid PR title examples:
    #   - feat: add user authentication
    #   - fix(parser): handle edge case in XML parsing
    #   - feat!: breaking API change to authentication
    #   - my-title: custom type example
    #   - hotfix(api): resolve production issue
    #
    # Use "*" to accept any type while enforcing the format
    # conventional-title: "*"
    #
    # Resources: https://www.conventionalcommits.org/en/v1.0.0/
    conventional-title: "feat,fix,build,chore,ci,docs,style,refactor,perf,test,revert"
```

`conventional-title` enforces the format `<type>[optional scope]: <description>`. In practice, that means:

- The type must match the configured comma-separated list, unless you use `*`.
- `*` keeps the format check but allows any type token.
- Scope is optional and must appear as `(scope)`.
- A breaking-change marker `!` is allowed before the colon.
- The separator must be `: `.
- The description after `: ` must not be empty.
- Custom types are valid when you list them in `conventional-title`.

Once configured, `conventional-title` becomes a built-in PR check run. It is part of normal PR processing, it is rerun when the PR title changes, and it is available as `/retest conventional-title`.

> **Tip:** Start by enabling `conventional-title` on a single repository. It gives contributors immediate feedback in GitHub without requiring any AI integration at all.

## AI suggestion and auto-fix modes

The `ai-features.conventional-title` section controls what happens after a title fails validation:

- `mode: suggest` keeps the check failing, but adds an AI-generated title recommendation to the check output.
- `mode: fix` validates the AI suggestion first. If the suggestion is valid and different from the current title, the server edits the PR title and marks the check successful.
- `timeout-minutes` defaults to `10`.

The core behavior is implemented here:

```525:580:webhook_server/libs/handlers/runner_handler.py
            # AI-suggested title (if ai-features configured)
            ai_suggestion = await self._get_ai_title_suggestion(
                pull_request=pull_request,
                title=title,
                allowed_names=allowed_names,
                is_wildcard=is_wildcard,
            )

            ai_mode = self._get_ai_conventional_title_mode()

            if ai_suggestion and ai_mode == "fix":
                # Validate the suggestion before applying
                if is_wildcard:
                    suggestion_valid = bool(re.match(r"^[\w-]+(\([^)]+\))?!?: .+", ai_suggestion))
                else:
                    suggestion_valid = any(
                        re.match(rf"^{re.escape(_name)}(\([^)]+\))?!?: .+", ai_suggestion) for _name in allowed_names
                    )

                if suggestion_valid and ai_suggestion != title:
                    self.logger.info(f"{self.log_prefix} AI fixing PR title from '{title}' to '{ai_suggestion}'")
                    try:
                        await asyncio.to_thread(pull_request.edit, title=ai_suggestion)
                        output["title"] = "Conventional Title"
                        output["summary"] = "PR title auto-fixed by AI"
                        output["text"] = (
                            f"**AI Auto-Fix Applied**\n\n"
                            f"Title updated from: `{title}`\n"
                            f"Title updated to: `{ai_suggestion}`\n"
                        )
                        return await self.check_run_handler.set_check_success(
                            name=CONVENTIONAL_TITLE_STR, output=output
                        )
                    except Exception:
                        self.logger.exception(f"{self.log_prefix} Failed to auto-fix PR title")
                        if output["text"] is not None:
                            output["text"] += (
                                f"\n\n---\n\n### AI Auto-Fix Failed\n\n"
                                f"Suggested title: `{ai_suggestion}`\n"
                                f"Failed to update PR title automatically. Please update manually."
                            )
                else:
                    self.logger.warning(
                        f"{self.log_prefix} AI suggestion invalid or unchanged, skipping auto-fix: {ai_suggestion}"
                    )
                    if output["text"] is not None:
                        output["text"] += (
                            f"\n\n---\n\n### AI Auto-Fix Skipped\n\n"
                            f"AI suggested: `{ai_suggestion}`\n"
                            f"Suggestion was invalid or unchanged."
                        )

            elif ai_suggestion and ai_mode == "suggest" and output["text"] is not None:
                output["text"] += f"\n\n---\n\n### AI-Suggested Title\n\n> {ai_suggestion}\n"

            await self.check_run_handler.set_check_failure(name=CONVENTIONAL_TITLE_STR, output=output)
```

This is worth understanding before you enable `fix` mode:

- The server does not blindly apply whatever the AI returns.
- It validates the suggestion against the same title rules you configured.
- If the suggestion is invalid, unchanged, or the GitHub edit fails, the check stays failing and explains why.

> **Note:** AI assistance is best-effort. If the AI CLI fails, times out, or returns an unusable title, the conventional-title check still completes and the normal validation result is shown.

> **Tip:** `mode: suggest` is the safer starting point. Switch to `mode: fix` only after you are comfortable letting the server edit PR titles automatically.

## Supported AI providers

Both `ai-features` and `test-oracle` support the same provider list:

- `claude`
- `gemini`
- `cursor`

Each feature also requires `ai-model`. The schema treats the model name as a string, so use the identifier expected by your provider tooling or oracle service, such as `sonnet`, `claude-opus-4-6[1m]`, or `gemini-2.5-pro`.

## PR Test Oracle

`test-oracle` is separate from title validation. Instead of checking naming rules, it calls an external service that looks at the PR and recommends which tests to run.

This is the relevant part of the implementation:

```31:85:webhook_server/libs/test_oracle.py
    config: dict[str, Any] | None = github_webhook.config.get_value("test-oracle")
    if not config:
        return

    if trigger is not None:
        triggers: list[str] = config.get("triggers", DEFAULT_TRIGGERS)
        if trigger not in triggers:
            github_webhook.logger.debug(
                f"{github_webhook.log_prefix} Test oracle trigger '{trigger}' not in configured triggers {triggers}"
            )
            return

    server_url: str = config["server-url"]
    log_prefix: str = github_webhook.log_prefix

    try:
        async with httpx.AsyncClient(base_url=server_url) as client:
            # Health check
            try:
                health_response = await client.get("/health", timeout=5.0)
                health_response.raise_for_status()
            except httpx.HTTPError as e:
                status_info = ""
                if isinstance(e, httpx.HTTPStatusError):
                    status_info = f" (status {e.response.status_code})"

                msg = f"Test Oracle server at {server_url} is not responding{status_info}, skipping test analysis"
                github_webhook.logger.warning(f"{log_prefix} {msg}")
                try:
                    await asyncio.to_thread(
                        pull_request.create_issue_comment,
                        f"Test Oracle server is not responding{status_info}, skipping test analysis",
                    )
                except Exception:
                    github_webhook.logger.exception(f"{log_prefix} Failed to post health check comment")
                return

            # Build analyze payload
            pr_url: str = await asyncio.to_thread(lambda: pull_request.html_url)
            payload: dict[str, Any] = {
                "pr_url": pr_url,
                "ai_provider": config["ai-provider"],
                "ai_model": config["ai-model"],
                # Token is required by the oracle server to fetch PR data and post reviews.
                # Server URL is configured by the admin - they control the network setup.
                "github_token": github_webhook.token,
            }

            if "test-patterns" in config:
                payload["test_patterns"] = config["test-patterns"]

            # Call analyze
            try:
                github_webhook.logger.info(f"{log_prefix} Calling Test Oracle for {pr_url}")
                response = await client.post("/analyze", json=payload, timeout=300.0)
```

For users, the important behavior is:

- If `test-oracle` is not configured, nothing runs.
- Automatic trigger filtering only applies when the function is called with a named trigger.
- The oracle service is checked with `GET /health` before analysis starts.
- The analyze request sends `pr_url`, `ai_provider`, `ai_model`, `github_token`, and optional `test_patterns`.

> **Warning:** The server sends a GitHub token to the oracle service so it can fetch PR data and post results. Only point `server-url` at a service you trust.

## Triggers and manual runs

The automatic trigger names are:

- `approved`
- `pr-opened`
- `pr-synchronized`

> **Warning:** `approved` does not mean a plain GitHub approval review. In this project, it means someone used `/approve`. A review with GitHub state `approved` but without `/approve` does not trigger the oracle.

A review body containing `/approve` triggers the `approved` path:

```58:78:webhook_server/libs/handlers/pull_request_review_handler.py
                if body := self.hook_data["review"]["body"]:
                    self.github_webhook.logger.debug(f"{self.github_webhook.log_prefix} Found review body: {body}")
                    # In this project, "approved" means a maintainer uses the /approve command
                    # (which adds an approved-<user> label), NOT GitHub's review approval state.
                    # The oracle trigger fires only when /approve is found in the review body.
                    if any(line.strip() == f"/{APPROVE_STR}" for line in body.splitlines()):
                        await self.labels_handler.label_by_user_comment(
                            pull_request=pull_request,
                            user_requested_label=APPROVE_STR,
                            remove=False,
                            reviewed_user=reviewed_user,
                        )
                        task = asyncio.create_task(
                            call_test_oracle(
                                github_webhook=self.github_webhook,
                                pull_request=pull_request,
                                trigger="approved",
                            )
                        )
                        _background_tasks.add(task)
                        task.add_done_callback(_background_tasks.discard)
```

PR open and PR synchronize events trigger `pr-opened` and `pr-synchronized`:

```109:145:webhook_server/libs/handlers/pull_request_handler.py
            if hook_action == "opened":
                task = asyncio.create_task(
                    call_test_oracle(
                        github_webhook=self.github_webhook,
                        pull_request=pull_request,
                        trigger="pr-opened",
                    )
                )
                _background_tasks.add(task)
                task.add_done_callback(_background_tasks.discard)

            if self.ctx:
                self.ctx.complete_step("pr_handler", action=hook_action)
            return

        if hook_action == "synchronize":
            sync_tasks: list[Coroutine[Any, Any, Any]] = []

            sync_tasks.append(self.process_opened_or_synchronize_pull_request(pull_request=pull_request))
            sync_tasks.append(self.remove_labels_when_pull_request_sync(pull_request=pull_request))

            results = await asyncio.gather(*sync_tasks, return_exceptions=True)

            for result in results:
                if isinstance(result, Exception):
                    self.logger.error(f"{self.log_prefix} Async task failed: {result}")

            task = asyncio.create_task(
                call_test_oracle(
                    github_webhook=self.github_webhook,
                    pull_request=pull_request,
                    trigger="pr-synchronized",
                )
            )
            _background_tasks.add(task)
            task.add_done_callback(_background_tasks.discard)
```

Manual runs are also supported:

- The `/test-oracle` issue comment command always works when `test-oracle` is configured, even if the trigger list does not include the current event.
- `/test-oracle` is intentionally exempt from the normal draft-PR command restriction.
- `approved` is the default automatic trigger if you omit `triggers`.
- `reopened` and `ready_for_review` do not automatically invoke test oracle.

## Failure behavior

The oracle integration is deliberately forgiving:

- If the health check fails, the server adds a PR comment explaining that the oracle is unavailable and skips analysis.
- If the later `/analyze` request fails or returns invalid JSON, the error is logged and webhook processing continues.
- A broken oracle does not stop the rest of the PR automation pipeline.

> **Tip:** A conservative rollout is to enable `conventional-title`, set `ai-features.conventional-title.mode: suggest`, and keep `test-oracle.triggers` at its default `approved`. Once that works well for your team, you can switch title handling to `fix` or add `pr-opened` and `pr-synchronized` for broader test analysis.


---

Source: log-viewer-guide.md

# Log Viewer Guide

The log viewer gives you one place to watch webhook activity, narrow history to the event you care about, inspect how a PR moved through the server, and export a clean slice of logs for debugging or reporting.

Quick start:
1. Set `ENABLE_LOG_SERVER=true`.
2. Start the server.
3. Open `/logs`.
4. Filter by `Hook ID` or `PR #`.
5. Click `Start Real-time` if you want live updates.
6. Click a hook or PR tag to drill into the workflow.
7. Use `Export JSON` after you narrow the result set.

> **Warning:** The log viewer routes are unauthenticated. Deploy them only on localhost, a VPN, or another trusted network, and keep `mask-sensitive-data: true` unless you are debugging in a safe environment.

## Enable the log viewer

The log viewer is enabled with an environment variable, not a YAML setting. The repository’s Docker Compose example enables it like this:

```yaml
environment:
  - PUID=1000
  - PGID=1000
  - TZ=Asia/Jerusalem
  - MAX_WORKERS=50 # Defaults to 10 if not set
  - WEBHOOK_SERVER_IP_BIND=0.0.0.0 # IP to listen
  - WEBHOOK_SERVER_PORT=5000 # Port to listen
  - WEBHOOK_SECRET=<secret> # If set verify hook is a valid hook from Github
  - VERIFY_GITHUB_IPS=1 # Verify hook request is from GitHub IPs
  - VERIFY_CLOUDFLARE_IPS=1 # Verify hook request is from Cloudflare IPs
  - ENABLE_LOG_SERVER=true # Enable log viewer endpoints (default: false)
  - ENABLE_MCP_SERVER=false # Enable MCP server for AI agent integration (default: false)
```

After startup, open `http://<host>:5000/logs`.

> **Note:** The webhook receiver is mounted at `/webhook_server`, but the log viewer page is mounted separately at `/logs`.

The viewer reads from `${WEBHOOK_SERVER_DATA_DIR}/logs`. If you do not set `WEBHOOK_SERVER_DATA_DIR`, the server uses `/home/podman/data`, so the default log directory is `/home/podman/data/logs`.

If `ENABLE_LOG_SERVER` is not set to `true`, `/logs`, `/logs/api/*`, and `/logs/ws` are unavailable.

## Choose the log files and masking behavior

The viewer uses the same logging configuration as the rest of the server. The example `config.yaml` includes these keys:

```yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart
mcp-log-file: mcp_server.log # Set global MCP log file, change take effect immediately without server restart
logs-server-log-file: logs_server.log # Set global Logs Server log file, change take effect immediately without server restart
mask-sensitive-data: true # Mask sensitive data in logs (default: true). Set to false for debugging (NOT recommended in production)
```

What those settings mean in practice:
- `log-file` is the main text log the viewer can search.
- `logs-server-log-file` keeps the viewer’s own infrastructure logging separate.
- `mask-sensitive-data` controls whether sensitive strings are redacted before they are written to logs. This affects what you see in the viewer and what gets exported.

> **Tip:** Keep `mask-sensitive-data: true` for normal use. Turn it off only for short-lived debugging on a trusted network, then turn it back on.

The viewer also reads daily structured JSON files named `webhooks_YYYY-MM-DD.json`. Those files are what make the best PR flow and workflow-step views possible. Historical searches include recent `*.log`, rotated `*.log.*`, and `webhooks_*.json` files.

The JSON files rotate by date. Cleanup and long-term retention beyond that are up to your deployment.

Unfiltered searches also stay focused on webhook activity: the viewer automatically skips uncorrelated infrastructure noise from its own logger and the MCP logger.

## Filter logs and use live streaming

The page is built for two workflows: search recent history, then switch to live mode if you want to watch new entries arrive.

The UI exposes these filters:
- `Search`
- `Hook ID`
- `PR #`
- `Repository`
- `User`
- `Level`
- `Start Time`
- `End Time`
- `Results Limit`

`Search` is a case-insensitive match against the log message text.

The level dropdown includes the usual levels plus viewer-specific ones:
- `COMPLETED`
- `DEBUG`
- `INFO`
- `WARNING`
- `ERROR`
- `STEP`
- `SUCCESS`

The UI offers these result limits:
- `100`
- `500`
- `1000`
- `5000`
- `10000`

The default is `1000`.

A few controls are especially useful:
- `Refresh` reloads historical entries from disk.
- `Start Real-time` opens a WebSocket stream to `/logs/ws`.
- `Stop Real-time` closes the live stream.
- `Auto-scroll` keeps the newest incoming entry in view.
- `Clear Logs` clears the browser view only. It does not delete log files.
- `Export JSON` downloads the currently filtered slice.

The stats row helps you judge how complete a result is:
- `Shown` is how many entries are currently displayed.
- `Total` is an estimate of how many log entries exist across the available log files.
- `Scanned` is how many entries the last query had to examine.

> **Tip:** Start with `Hook ID` or `PR #` whenever possible. Those are the fastest way to narrow a noisy system down to the webhook run you actually care about.

On very large log sets, the viewer may intentionally stop early to stay responsive. When that happens, the stats panel marks the scan as partial.

> **Note:** If the `Scanned` value shows `(partial scan)`, narrow the repository, time range, PR, or hook ID and run `Refresh` again before you treat the result as complete.

Live mode is best for new events. Historical refreshes are the better choice when you need a strict time window or a broad search across existing files.

## Inspect PR flows and workflow steps

The most useful part of the viewer is the drill-down from raw log lines into one webhook delivery or one PR’s full sequence of webhook events.

From the main log table:
- Click a `Hook` tag to open the flow view for one specific webhook delivery.
- Click a `PR` tag to open a PR view that lists all hook IDs found for that pull request.
- From the PR view, click any hook ID to inspect that single webhook run.

The flow modal shows:
- Hook ID
- Total step count
- Total duration
- Token spend
- Repository
- A success, in-progress, or error state for the overall run

Here, “Token Spend” means GitHub API calls consumed during that webhook run, not LLM token usage.

The richest step data comes from structured workflow tracking. The codebase records steps with calls like these:

```python
ctx = create_context(
    hook_id="github-delivery-id",
    event_type="pull_request",
    repository="org/repo",
    repository_full_name="org/repo",
    action="opened",
    sender="username",
)

ctx.start_step("clone_repository", branch="main")
try:
    await clone_repo()
    ctx.complete_step("clone_repository", commit_sha="abc123")
except Exception as ex:
    ctx.fail_step("clone_repository", exception=ex, traceback_str=traceback.format_exc())
```

When handlers record steps that way, the viewer can:
- order steps by time
- group related entries by task
- show step status and duration
- highlight failed steps
- fetch log lines that happened during a specific step window

Click any step in the flow view to expand it. The viewer first shows the step metadata, then loads log lines that fall inside that step’s execution window. Per-step drill-down is intentionally bounded so very chatty steps do not overwhelm the UI.

> **Note:** Step-level log drill-down returns up to 500 log entries for a selected step.

Older webhook runs can still open, but the experience may be lighter if only text logs are available. The viewer tries structured JSON data first and falls back to text log parsing when it needs to.

> **Note:** Older webhooks can show less detail. In that case the flow view may still work, but fields like token spend can appear as `N/A (older webhook)`.

## Export logs safely

Use `Export JSON` after you have narrowed the data to the smallest slice you need. The export uses the current filters and the current results limit from the UI.

Only JSON export is supported. The downloaded file name follows this pattern:

```text
webhook_logs_YYYYMMDD_HHMMSS.json
```

The export payload includes metadata alongside the log entries. The code that builds it looks like this:

```python
export_data = {
    "export_metadata": {
        "generated_at": datetime.datetime.now(datetime.UTC).isoformat(),
        "filters_applied": filters or {},
        "total_entries": len(entries),
        "export_format": "json",
    },
    "log_entries": [entry.to_dict() for entry in entries],
}
```

That means every export includes:
- when it was generated
- which filters were active
- how many entries were included
- the log entries themselves

A few practical rules make exports safer and easier to use:
- Keep `mask-sensitive-data: true` unless you have a strong reason not to.
- Filter by `Hook ID`, `PR #`, `Repository`, or a short time range before exporting.
- Increase `Results Limit` before exporting if you need more than the default 1000 entries.
- Treat the exported file like any other operational log artifact and store it accordingly.

> **Warning:** Exported logs can include repository names, usernames, PR metadata, error messages, and other operational detail. Do not share them casually or expose the export endpoint to the public internet.

The server rejects very large exports instead of trying to generate an unbounded file.

> **Tip:** If an export is too large, split it by repository, PR, hook ID, or time window and export in smaller batches. The server caps exports at 50,000 entries.

## Useful routes

These are the routes behind the viewer:
- `GET /logs` renders the main log viewer page.
- `GET /logs/api/entries` returns filtered historical entries.
- `GET /logs/api/workflow-steps/{hook_id}` returns workflow-step data for one webhook delivery.
- `GET /logs/api/step-logs/{hook_id}/{step_name}` returns log lines correlated to one workflow step.
- `GET /logs/api/export?format_type=json` downloads a JSON export.
- `WS /logs/ws` streams new log entries in real time.

> **Note:** The per-step log drill-down route is additionally intended for trusted/private network use. If the main viewer loads but step logs do not, check your network path first.


---

Source: webhook-and-health-api.md

# Webhook and Health API

`github-webhook-server` exposes two primary HTTP endpoints under the same `/webhook_server` prefix:

- `POST /webhook_server` receives GitHub webhook deliveries.
- `GET /webhook_server/healthcheck` provides a lightweight liveness check.

The webhook endpoint is intentionally designed to return quickly, then continue the real work in the background. That keeps GitHub from timing out while the server does slower tasks such as GitHub API calls, repository cloning, PR checks, labels, comments, and other automation.

## POST `/webhook_server`

Use `POST /webhook_server` as the webhook target in GitHub. The configured public URL must include the full path, not just the host:

```17:17:examples/config.yaml
webhook-ip: <HTTP://IP OR URL:PORT/webhook_server> # Full URL with path (e.g., https://your-domain.com/webhook_server or https://smee.io/your-channel)
```

Per-repository `events` in `config.yaml` control which GitHub events the server subscribes to when it creates or updates webhooks on startup:

```150:157:examples/config.yaml
    events: # To listen to all events do not send events
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status
```

If you leave out `events`, the webhook registration logic falls back to all events. If you configure `webhook-secret`, the server also includes that same secret when it creates the webhook in GitHub.

### What the endpoint validates before it accepts a delivery

A webhook only needs a small set of fields to be accepted at the HTTP layer:

- `X-GitHub-Event` must be present.
- The request body must be readable and valid JSON.
- The JSON must include `repository.name` and `repository.full_name`.
- If `webhook-secret` is configured, the request must include a valid `X-Hub-Signature-256` HMAC SHA256 signature.
- If IP verification is enabled, the client IP must match the loaded GitHub and/or Cloudflare allowlist.

`X-GitHub-Delivery` is not required, but you should send it. The server echoes it back in the response and uses it for log correlation. If it is missing, the server uses `unknown-delivery`.

A successful request in the test suite uses the standard GitHub-style headers and gets back the queue acknowledgement response:

```88:102:webhook_server/tests/test_app.py
        headers = {
            "X-GitHub-Event": "pull_request",
            "X-GitHub-Delivery": "test-delivery-123",
            "x-hub-signature-256": signature,
            "Content-Type": "application/json",
        }

        response = client.post("/webhook_server", content=payload_json, headers=headers)

        assert response.status_code == 200
        data = response.json()
        assert data["status"] == 200
        assert data["message"] == "Webhook queued for processing"
        assert data["delivery_id"] == "test-delivery-123"
        assert data["event_type"] == "pull_request"
```

> **Note:** The server reads the raw body and parses JSON itself. In practice, the important part is that the body is valid JSON; `Content-Type` is not what determines acceptance.

### Response codes

| Status | Meaning |
| --- | --- |
| `200 OK` | The request passed front-door validation and was queued for background processing. |
| `400 Bad Request` | Missing `X-GitHub-Event`, unreadable body, invalid JSON, missing `repository`, missing `repository.name`, missing `repository.full_name`, or invalid client IP metadata when IP filtering is active. |
| `403 Forbidden` | Missing or invalid `X-Hub-Signature-256` when `webhook-secret` is enabled, or client IP is outside the configured allowlist. |
| `500 Internal Server Error` | The server could not load the configuration it needed to validate the request. |

Tests also confirm two practical edge cases:

- A bad signature returns `403` with `Request signatures didn't match`.
- If no `webhook-secret` is configured, the same endpoint still accepts a valid JSON delivery without signature verification.

> **Warning:** If `webhook-secret` is unset, the endpoint accepts unsigned webhook payloads. That may be fine on a private test setup, but it is not a safe default for an internet-exposed deployment.

### What `200 OK` really means

The most important thing to understand about `POST /webhook_server` is that `200` means “queued,” not “finished.”

Once the request passes validation, the server starts a background task and returns immediately:

```507:529:webhook_server/app.py
    # Start background task immediately using asyncio.create_task
    # This ensures the HTTP response is sent immediately without waiting
    # Store task reference for observability and graceful shutdown
    task = asyncio.create_task(
        process_with_error_handling(
            _hook_data=hook_data,
            _headers=request.headers,
            _delivery_id=delivery_id,
            _event_type=event_type,
        )
    )
    _background_tasks.add(task)
    task.add_done_callback(_background_tasks.discard)

    # Return 200 immediately with JSONResponse for fastest serialization
    return JSONResponse(
        status_code=status.HTTP_200_OK,
        content={
            "status": status.HTTP_200_OK,
            "message": "Webhook queued for processing",
            "delivery_id": delivery_id,
            "event_type": event_type,
        },
    )
```

This design exists to avoid GitHub webhook timeouts. The server only does just enough validation to know the delivery can be processed. The slower work happens after the response has already been sent.

In practice, that means:

- `200 OK` means the request was valid enough to enter the processing pipeline.
- `200 OK` does **not** mean the automation finished successfully.
- Repository lookup failures, GitHub API failures, and handler errors can still happen later.
- Those later failures are logged, but they do not change the already-sent HTTP response.

> **Tip:** Treat the returned `delivery_id` as your main breadcrumb. It is the quickest way to match a GitHub delivery to the server’s background-processing logs.

### What happens after the delivery is queued

After the webhook is queued, the background processor routes the event. The code handles GitHub events including `ping`, `push`, `pull_request`, `pull_request_review`, `pull_request_review_thread`, `issue_comment`, `check_run`, and `status`.

Not every accepted delivery turns into visible automation. Some are intentionally accepted and then skipped in the background, for example:

- a `ping` event
- a `status` event that is still `pending`
- a `check_run` that is not yet `completed`
- a push deletion event
- an event that does not resolve to an open pull request

That is why the HTTP response should be treated as an acknowledgement, not a final execution result.

### Tracing a delivery

Structured webhook logs are written as daily JSONL files under the server data directory in `logs/webhooks_YYYY-MM-DD.json`. With the default data directory, that is `/home/podman/data/logs/`. Deployments can override the data directory with `WEBHOOK_SERVER_DATA_DIR`.

The `delivery_id` returned by the endpoint corresponds to the same delivery identifier used in the logs as `hook_id`, so it is the best way to trace what happened after the request was queued.

### IP allowlist behavior

If you enable `verify-github-ips` and/or `verify-cloudflare-ips`, the endpoint enforces source-IP filtering before it even parses the webhook body. When those settings are off, the webhook endpoint does not perform source-IP checks.

That hardening also affects startup: the application loads the allowlists during startup, and if IP verification is enabled but no valid networks can be loaded, the server fails closed rather than starting in an insecure state.

> **Warning:** With IP verification enabled, a startup failure to load valid GitHub and/or Cloudflare ranges means the API will not come up at all. That is intentional fail-closed behavior.

## GET `/webhook_server/healthcheck`

`GET /webhook_server/healthcheck` is a simple liveness endpoint. If the application is up and serving requests, it returns:

```307:309:webhook_server/app.py
@FASTAPI_APP.get(f"{APP_URL_ROOT_PATH}/healthcheck", operation_id="healthcheck")
def healthcheck() -> dict[str, Any]:
    return {"status": requests.codes.ok, "message": "Alive"}
```

This endpoint is intentionally lightweight. It does not do a live GitHub API call, repository lookup, or end-to-end webhook test on every request. It simply answers whether the application process is up and serving HTTP.

That makes it a good fit for container and load-balancer health checks. The project’s `Dockerfile` uses it exactly that way:

```88:90:Dockerfile
HEALTHCHECK CMD curl --fail http://127.0.0.1:5000/webhook_server/healthcheck || exit 1

ENTRYPOINT ["tini", "--", "uv", "run", "entrypoint.py"]
```

> **Tip:** Use `/webhook_server/healthcheck` to answer “is the server alive?” Use a real GitHub delivery plus the webhook logs to answer “is my automation working end to end?”

## Troubleshooting checklist

If GitHub shows `200 OK` but nothing happened, check these first:

- The configured `webhook-ip` includes the exact `/webhook_server` path.
- The repository exists in `config.yaml`.
- The repository’s `events` list includes the event GitHub actually sent.
- `webhook-secret` matches GitHub’s configured secret, if you use one.
- The request source IP is allowed, if IP verification is enabled.
- The returned `delivery_id` appears in that day’s structured log file.

The core rule for this API is simple: `POST /webhook_server` returns fast so GitHub can move on. The real answer about success or failure lives in the background processing logs, not in the HTTP `200` alone.


---

Source: log-viewer-api.md

# Log Viewer API

The log viewer exposes a browser UI at `/logs`, historical REST endpoints under `/logs/api/*`, and a live stream at `/logs/ws`. Use it to search webhook history, export filtered results, inspect PR processing at a high level, drill into one webhook’s step timeline, and fetch the raw log lines that happened during a specific step.

The examples below are copied from the shipped frontend and test fixtures in this repository.

> **Warning:** The log viewer is not authenticated by the application. Treat it as an internal tool and expose it only on trusted networks.

## Enable It

Set `ENABLE_LOG_SERVER=true` before you expect the page, REST endpoints, or WebSocket stream to be available.

```17:20:examples/docker-compose.yaml
      - VERIFY_GITHUB_IPS=1 # Verify hook request is from GitHub IPs
      - VERIFY_CLOUDFLARE_IPS=1 # Verify hook request is from Cloudflare IPs
      - ENABLE_LOG_SERVER=true # Enable log viewer endpoints (default: false)
      - ENABLE_MCP_SERVER=false # Enable MCP server for AI agent integration (default: false)
```

You can also set a dedicated log file for the log viewer itself:

```3:7:examples/config.yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart
mcp-log-file: mcp_server.log # Set global MCP log file, change take effect immediately without server restart
logs-server-log-file: logs_server.log # Set global Logs Server log file, change take effect immediately without server restart
mask-sensitive-data: true # Mask sensitive data in logs (default: true). Set to false for debugging (NOT recommended in production)
```

> **Note:** When the log server is disabled, the REST endpoints respond as unavailable, the WebSocket closes with code `1008`, and the `/logs` page is not exposed.

## What The API Reads

The log viewer scans files under `<data_dir>/logs/` and combines two sources:

- Text logs such as `webhook-server.log` and rotated `*.log.*` files.
- Structured JSONL files named `webhooks_YYYY-MM-DD.json`.

A few behaviors matter when you use the API:

- Historical queries prefer `webhooks_*.json` first, then plain `.log` files.
- Workflow-step lookups prefer structured JSON summaries and fall back to text logs only when needed.
- Step-scoped log correlation reads text `.log` files only, because that is where detailed per-operation lines live.
- Infrastructure-only noise from the log viewer, MCP server, and parser is filtered out unless it is tied to a webhook context.

> **Note:** The viewer scans `<data_dir>/logs/` only. If you move your main text log file somewhere else with an absolute path, historical text queries and step-scoped log correlation will not see it.

## Shared Log Entry Shape

Historical query results and WebSocket messages use the same entry model. The important fields are:

- `timestamp`: ISO 8601 timestamp.
- `level`: exact log level, such as `DEBUG`, `INFO`, `WARNING`, `ERROR`, or `COMPLETED`.
- `logger_name`: the logger that emitted the line.
- `message`: rendered message text.
- `hook_id`: GitHub delivery ID.
- `event_type`: GitHub event name such as `pull_request` or `check_run`.
- `repository`: repository in `owner/repo` form when available.
- `pr_number`: pull request number when available.
- `github_user`: GitHub or API user associated with the entry.
- `task_id`, `task_type`, `task_status`: workflow-correlation fields when the original log line included them.
- `token_spend`: parsed GitHub API call usage when available.

The `search` filter only matches `message`. All other filters are exact matches.

> **Tip:** If you want one high-level row per webhook execution, filter `level=COMPLETED`. Those entries come from the structured webhook summary log.

## Endpoint Reference

### `GET /logs/api/entries`

Use this endpoint for historical search and pagination.

Supported query parameters:

- `hook_id`: exact delivery ID.
- `pr_number`: exact pull request number.
- `repository`: exact `owner/repo`.
- `event_type`: exact GitHub event type.
- `github_user`: exact user value stored on the entry.
- `level`: exact log level.
- `start_time`: inclusive ISO 8601 lower bound.
- `end_time`: inclusive ISO 8601 upper bound.
- `search`: case-insensitive substring match against `message`.
- `limit`: page size from `1` to `10000`.
- `offset`: number of matching entries to skip.

The built-in UI uses this endpoint like this:

```354:375:webhook_server/web/static/js/log_viewer.js
    const filters = new URLSearchParams();
    const hookId = document.getElementById("hookIdFilter").value.trim();
    const prNumber = document.getElementById("prNumberFilter").value.trim();
    const repository = document.getElementById("repositoryFilter").value.trim();
    const user = document.getElementById("userFilter").value.trim();
    const level = document.getElementById("levelFilter").value;
    const search = document.getElementById("searchFilter").value.trim();
    const limit = document.getElementById("limitFilter").value;

    filters.append("limit", limit);
    if (hookId) filters.append("hook_id", hookId);
    if (prNumber) filters.append("pr_number", prNumber);
    if (repository) filters.append("repository", repository);
    if (user) filters.append("github_user", user);
    if (level) filters.append("level", level);
    if (search) filters.append("search", search);
    appendTimeFilters(filters);

    const response = await fetch(`/logs/api/entries?${filters.toString()}`);
```

The response contains:

- `entries`: the current page of matching log entries.
- `entries_processed`: how many entries the server examined for this request. This may be an integer or a string such as `"50000+"`.
- `filtered_count_min`: a lower bound for total matches, not an exact total.
- `total_log_count_estimate`: a rough text-log size estimate, often formatted like `1.2K` or `1.0M`.
- `limit`: echoed page size.
- `offset`: echoed offset.
- `is_partial_scan`: `true` when the server hit its internal scan cap.

Operational details:

- Unfiltered requests scan up to `20000` entries.
- Filtered requests scan up to `50000` entries.
- The scan reads up to `25` recent log files.
- The server stops early once it has enough entries for your page.

> **Note:** There is no exact total-count field. `filtered_count_min` is intentionally conservative because the server stops as soon as it has filled your page or reached the scan cap.

### `GET /logs/api/export`

Use this endpoint when you want a downloadable copy of the same filtered data.

It accepts the same filters as `/logs/api/entries`, plus:

- `format_type`: must be `json`.
- `limit`: effective maximum `50000`.

What you get back:

- A streamed JSON download.
- `Content-Type: application/json`.
- A filename in the form `webhook_logs_YYYYMMDD_HHMMSS.json`.

The JSON file contains:

- `export_metadata.generated_at`
- `export_metadata.filters_applied`
- `export_metadata.total_entries`
- `export_metadata.export_format`
- `log_entries`

Useful behavior to know:

- The export is streamed instead of fully buffered in memory.
- Requests above `50000` entries are rejected with `413`.
- An empty match set still produces a valid JSON export.

> **Tip:** Build your filters with `/logs/api/entries` first. When the result set looks right, send the same filters to `/logs/api/export`.

### `GET /logs/api/pr-flow/{hook_id}`

Use this endpoint for a compact, stage-based flow summary.

Despite the `{hook_id}` path name, this endpoint accepts four identifier styles:

- `hook-abc123`
- `pr-42`
- `42`
- `abc123`

How to choose:

- Use a delivery ID when you want one webhook run.
- Use `pr-42` or `42` when you want a PR-wide view across all matching log entries for that PR.

The response includes:

- `identifier`
- `stages`
- `total_duration_ms`
- `success`
- optional `error`

Each stage can include:

- `name`
- `timestamp`
- `duration_ms`
- optional `error`

The stage names are matched from log messages using these buckets:

- `Webhook Received`
- `Validation Complete`
- `Reviewers Assigned`
- `Labels Applied`
- `Checks Started`
- `Checks Complete`
- `Processing Complete`

> **Note:** This endpoint is pattern-based analysis, not a strict replay of structured workflow data. If you need the exact per-step timeline for one delivery, use `/logs/api/workflow-steps/{hook_id}` instead.

### `GET /logs/api/workflow-steps/{hook_id}`

Use this endpoint for the richest single-delivery view.

This endpoint expects the raw delivery ID, such as `test-hook-123`. Unlike `/logs/api/pr-flow/{hook_id}`, it does not accept `hook-...`, `pr-...`, or bare PR-number aliases.

When structured summary data exists in `webhooks_*.json`, the response can include:

- `hook_id`
- `start_time`
- `total_duration_ms`
- `step_count`
- `steps`
- `token_spend`
- `event_type`
- `action`
- `repository`
- `sender`
- `pr`
- `success`
- `error`

Each step can include:

- `timestamp`
- `step_name`
- `message`
- `level`
- `repository`
- `event_type`
- `pr_number`
- `task_id`
- `task_type`
- `task_status`
- `duration_ms`
- `error`
- `step_details`
- `relative_time_ms`

A structured webhook summary in the test suite looks like this:

```226:265:webhook_server/tests/conftest.py
    return {
        "hook_id": "test-hook-123",
        "event_type": "pull_request",
        "action": "opened",
        "repository": "org/test-repo",
        "sender": "test-user",
        "pr": {
            "number": 456,
            "title": "Test PR",
            "url": "https://github.com/org/test-repo/pull/456",
        },
        "timing": {
            "started_at": "2025-01-05T10:00:00.000000Z",
            "completed_at": "2025-01-05T10:00:05.000000Z",
            "duration_ms": 5000,
        },
        "workflow_steps": {
            "clone_repository": {
                "timestamp": "2025-01-05T10:00:01.000000Z",
                "status": "completed",
                "duration_ms": 1500,
            },
            "assign_reviewers": {
                "timestamp": "2025-01-05T10:00:02.500000Z",
                "status": "completed",
                "duration_ms": 800,
            },
            "apply_labels": {
                "timestamp": "2025-01-05T10:00:03.500000Z",
                "status": "failed",
                "duration_ms": 200,
                "error": {"type": "ValueError", "message": "Label not found"},
            },
        },
        "token_spend": 35,
        "success": False,
        "error": {
            "type": "TestError",
            "message": "Test failure message for unit tests",
        },
    }
```

Fallback behavior:

- The server first searches structured JSON summaries.
- If it cannot find one, it falls back to text logs.
- The fallback timeline is simpler and only includes entries that carried both `task_id` and `task_status`.
- In fallback mode, `token_spend` can still be inferred from a text message such as `Token spend: 15 API calls`.

> **Tip:** This is the endpoint the built-in `/logs` UI uses for its detailed flow modal. It is the best API to call once you already know the delivery ID you care about.

### `GET /logs/api/step-logs/{hook_id}/{step_name}`

Use this endpoint when you know which workflow step you want to inspect and need the raw log lines that happened during that step.

This endpoint also expects the raw delivery ID, not the prefixed forms accepted by `/logs/api/pr-flow/{hook_id}`.

How it works:

- It loads the workflow-step timeline for the delivery.
- It finds the step whose `step_name` exactly matches your path segment.
- It builds a time window from the step’s `timestamp` and `duration_ms`.
- It scans text `.log` files for that delivery inside that time window.

The built-in UI URL-encodes both path segments when it fetches step-scoped logs:

```1991:2005:webhook_server/web/static/js/log_viewer.js
  // Fetch actual log entries for this step
  const stepName = step.step_name;
  const hookId = currentFlowData?.hook_id;

  if (stepName && hookId) {
    // Show loading indicator
    const loadingDiv = document.createElement("div");
    loadingDiv.className = "step-logs-loading";
    loadingDiv.textContent = "Loading logs...";

    try {
      const response = await fetch(
        `/logs/api/step-logs/${encodeURIComponent(hookId)}/${encodeURIComponent(stepName)}`,
        { signal: currentStepLogsController.signal }
      );
```

The response contains:

- `step`: metadata for the requested step.
- `logs`: matching log entries from the step’s execution window.
- `log_count`: number of returned log entries.

Important limits and edge cases:

- At most `500` log entries are returned.
- If `duration_ms` is missing, the server uses a default `60` second window starting at the step timestamp.
- If the step exists but nothing was logged in that window, you still get `step` plus `logs: []`.
- If the step timestamp is missing or malformed, the endpoint fails rather than guessing.

> **Warning:** This is the only log-viewer endpoint with an additional trusted-network check. Requests are allowed only from private, loopback, or link-local client addresses.

### `WS /logs/ws`

Use the WebSocket when you want live updates after you have loaded history.

Supported query parameters:

- `hook_id`
- `pr_number`
- `repository`
- `event_type`
- `github_user`
- `level`

The shipped frontend builds the connection like this:

```54:77:webhook_server/web/static/js/log_viewer.js
  const protocol = window.location.protocol === "https:" ? "wss:" : "ws:";

  // Build WebSocket URL with current filter parameters
  const filters = new URLSearchParams();
  const hookId = document.getElementById("hookIdFilter").value.trim();
  const prNumber = document.getElementById("prNumberFilter").value.trim();
  const repository = document.getElementById("repositoryFilter").value.trim();
  const user = document.getElementById("userFilter").value.trim();
  const level = document.getElementById("levelFilter").value;

  if (hookId) filters.append("hook_id", hookId);
  if (prNumber) filters.append("pr_number", prNumber);
  if (repository) filters.append("repository", repository);
  if (user) filters.append("github_user", user);
  if (level) filters.append("level", level);

  const wsUrl = `${protocol}//${window.location.host}/logs/ws${
    filters.toString() ? "?" + filters.toString() : ""
  }`;

  ws = new WebSocket(wsUrl);
```

Behavior to expect:

- The stream starts at the end of the current file, so it delivers new entries only.
- It monitors the most recent current `webhooks_*.json` file.
- If no JSON webhook file exists, it falls back to the most recent current `.log` file.
- Rotated files are not tailed in real time.
- Messages use the same log-entry shape as `/logs/api/entries`.

Connection and error behavior:

- If the log directory is missing, the server sends `{"error": "Log directory not found"}`.
- If the log server is disabled, the socket closes with code `1008`.
- If the server hits an internal error while streaming, it closes with code `1011`.
- On application shutdown, open log-viewer sockets are closed with code `1001`.

> **Note:** The WebSocket only applies context filters. It does not do historical replay, time-range filtering, or free-text `search`. Use `/logs/api/entries` first if you need “history plus live updates.”

## Practical Drill-Down

A good workflow is:

1. Query `/logs/api/entries` with `repository`, `pr_number`, `hook_id`, or `search` to find the run you care about.
2. Take the raw `hook_id` from those results and call `/logs/api/workflow-steps/{hook_id}` for the precise per-step timeline.
3. If one step looks suspicious, call `/logs/api/step-logs/{hook_id}/{step_name}` for the raw log lines inside that step window.
4. Keep `/logs/ws` open if you want to watch new entries after the initial history load.

> **Tip:** If you start from a PR number, a practical first query is `/logs/api/entries?pr_number=456`. That matches how the built-in UI discovers the hook IDs attached to a PR before opening the detailed flow view.


---

Source: mcp-api.md

# MCP API

`github-webhook-server` can optionally expose a Model Context Protocol (MCP) endpoint at `/mcp`. It runs on the same FastAPI server as the normal webhook API, but it is a separate interface: GitHub still sends events to `/webhook_server`, while MCP clients connect to `/mcp`.

When enabled, the server builds the MCP layer from the existing FastAPI app instead of maintaining a separate MCP-only API. That keeps the MCP surface aligned with the routes you already run.

> **Note:** `/mcp` is hidden from the autogenerated FastAPI schema. The route is registered with `include_in_schema=False`, so you will not see it in Swagger or other OpenAPI-based docs.

## Enable and connect

MCP is disabled unless the process starts with `ENABLE_MCP_SERVER=true`. The same pattern is used for the log server feature, which matters because most of the useful agent-facing operations are log and workflow analysis routes.

```python
LOG_SERVER_ENABLED: bool = os.environ.get("ENABLE_LOG_SERVER") == "true"
MCP_SERVER_ENABLED: bool = os.environ.get("ENABLE_MCP_SERVER") == "true"
```

> **Tip:** Use the exact lowercase string `true`. Values such as `True`, `1`, or `yes` do not enable these features in this codebase.

The example Compose file ships with MCP off by default:

```yaml
environment:
  - ENABLE_LOG_SERVER=true
  - ENABLE_MCP_SERVER=false
```

To enable MCP:

1. Set `ENABLE_MCP_SERVER=true`.
2. Set `ENABLE_LOG_SERVER=true` if you want agents to query logs, PR flow data, or workflow step timelines.
3. Restart the server.
4. Point your MCP client at `http://<host>:5000/mcp`, or the HTTPS equivalent for your deployment.

With the default container setup, port `5000` is the expected default.

> **Note:** Enabling MCP does not change your GitHub webhook URL. GitHub should continue posting to `/webhook_server`.

## How `/mcp` is implemented

The MCP transport is mounted as a single streamable HTTP endpoint at `/mcp`. It is created from the main `FASTAPI_APP`, and routes tagged `mcp_exclude` are filtered out.

```python
# MCP Integration - Only register if ENABLE_MCP_SERVER=true
if MCP_SERVER_ENABLED:
    # Create MCP instance with the main app
    # NOTE: No authentication configured - MCP server runs without auth
    # ⚠️ SECURITY WARNING: Deploy only on trusted networks (VPN, internal)
    # Never expose to public internet - use reverse proxy with auth for external access
    mcp = FastApiMCP(FASTAPI_APP, exclude_tags=["mcp_exclude"])

    # Create stateless HTTP transport to avoid session management issues
    # Override with stateless session manager
    http_transport = FastApiHttpSessionManager(
        mcp_server=mcp.server,
        event_store=None,  # No event store needed for stateless mode
        json_response=True,
    )
    # Manually patch to use stateless mode
    http_transport._session_manager = None  # Force recreation with stateless=True

    # Register the HTTP endpoint manually
    @FASTAPI_APP.api_route("/mcp", methods=["GET", "POST", "DELETE"], include_in_schema=False, operation_id="mcp_http")
    async def handle_mcp_streamable_http(request: Request) -> Response:
        # Session manager is initialized in lifespan
        if http_transport is None or http_transport._session_manager is None:
            LOGGER.error("MCP session manager not initialized")
            raise HTTPException(status_code=status.HTTP_500_INTERNAL_SERVER_ERROR, detail="MCP server not initialized")

        return await http_transport.handle_fastapi_request(request)
```

During startup, the app initializes the underlying MCP session manager in stateless mode:

```python
http_transport._session_manager = StreamableHTTPSessionManager(
    app=mcp.server,
    event_store=http_transport.event_store,
    json_response=True,
    stateless=True,  # Enable stateless mode - no session management required
)
```

In practice, this means:

- `/mcp` is either mounted or not mounted at all.
- It uses one HTTP endpoint instead of a separate URL per capability.
- It is stateless on the server side.
- Proxies and gateways in front of it must allow `GET`, `POST`, and `DELETE` to `/mcp`.

## What agents can access

Because the MCP layer is built from the FastAPI app, the operation IDs in `webhook_server/app.py` are the best guide to what an agent can use.

These routes are part of the app surface:

```python
@FASTAPI_APP.get(f"{APP_URL_ROOT_PATH}/healthcheck", operation_id="healthcheck")
def healthcheck() -> dict[str, Any]:
    return {"status": requests.codes.ok, "message": "Alive"}
```

```python
@FASTAPI_APP.get(
    "/logs/api/entries",
    operation_id="get_log_entries",
    dependencies=[Depends(require_log_server_enabled)],
)
```

```python
@FASTAPI_APP.get(
    "/logs/api/export",
    operation_id="export_logs",
    dependencies=[Depends(require_log_server_enabled)],
)
```

```python
@FASTAPI_APP.get(
    "/logs/api/pr-flow/{hook_id}",
    operation_id="get_pr_flow_data",
    dependencies=[Depends(require_log_server_enabled)],
)
```

```python
@FASTAPI_APP.get(
    "/logs/api/workflow-steps/{hook_id}",
    operation_id="get_workflow_steps",
    dependencies=[Depends(require_log_server_enabled)],
)
```

```python
@FASTAPI_APP.get(
    "/logs/api/step-logs/{hook_id}/{step_name}",
    operation_id="get_step_logs",
    dependencies=[Depends(require_log_server_enabled), Depends(require_trusted_network)],
)
```

The main capabilities this gives to AI agents are:

- Health checks through `healthcheck`
- Filtered webhook log search through `get_log_entries`
- Log exports through `export_logs`
- PR workflow analysis through `get_pr_flow_data`
- Step-by-step execution timelines through `get_workflow_steps`
- Time-correlated step log inspection through `get_step_logs`

If `ENABLE_LOG_SERVER` is still off, the log-related operations are not usable and return the same "log server is disabled" behavior as the regular HTTP API.

The webhook receiver itself is intentionally excluded from the MCP surface:

```python
@FASTAPI_APP.post(
    APP_URL_ROOT_PATH,
    operation_id="process_webhook",
    dependencies=[Depends(gate_by_allowlist_ips_dependency)],
    tags=["mcp_exclude"],
)
async def process_webhook(request: Request) -> JSONResponse:
```

That is an important design choice. Agents can inspect server state and workflow data, but they do not get an MCP path to submit synthetic GitHub webhooks through the main ingestion endpoint.

> **Tip:** If you do not want AI agents to inspect logs or workflow history, leave `ENABLE_LOG_SERVER=false` even if you enable MCP.

## Logging and configuration

MCP traffic uses its own log file instead of sharing the main webhook server log. The example config includes the relevant settings:

```yaml
log-file: webhook-server.log
mcp-log-file: mcp_server.log
logs-server-log-file: logs_server.log
mask-sensitive-data: true
```

A few practical details matter here:

- `mcp-log-file` defaults to `mcp_server.log`.
- Relative log file names are resolved under `${WEBHOOK_SERVER_DATA_DIR}/logs/`.
- If you do not set `WEBHOOK_SERVER_DATA_DIR`, the default data directory is `/home/podman/data`, so the default MCP log path becomes `/home/podman/data/logs/mcp_server.log`.
- MCP logging is configured during application startup, so plan to restart the server after changing `ENABLE_MCP_SERVER` or `mcp-log-file`.
- The MCP logger reuses the same masking setting as the rest of the app, so `mask-sensitive-data: true` also applies to MCP logs.

## Security and deployment expectations

> **Warning:** The MCP endpoint has no built-in authentication. Treat `/mcp` as an internal or administrative interface.

The code is explicit about this:

```python
# NOTE: No authentication configured - MCP server runs without auth
# ⚠️ SECURITY WARNING: Deploy only on trusted networks (VPN, internal)
# Never expose to public internet - use reverse proxy with auth for external access
mcp = FastApiMCP(FASTAPI_APP, exclude_tags=["mcp_exclude"])
```

Use these expectations when deploying it:

- Keep `/mcp` on a trusted private network, VPN, or localhost-only deployment whenever possible.
- If it must be reachable through a shared edge, put it behind a reverse proxy or load balancer that enforces authentication and TLS.
- Do not assume the GitHub IP allowlist protects `/mcp`. That allowlist is attached to the webhook receiver, not to the MCP transport.
- Be extra careful if `ENABLE_LOG_SERVER=true`. In that mode, agents can query operational data such as hook IDs, repository names, GitHub usernames, workflow timings, error details, and token-spend metadata.
- Treat `mask-sensitive-data: true` as a helpful safeguard, not as a security boundary.
- If you run behind a reverse proxy, verify client IP handling carefully before relying on network-based restrictions.
- Keep the whole MCP endpoint off the public internet unless you have strong compensating controls in front of it.

> **Warning:** `get_step_logs` is the most restricted log route in the HTTP app because it requires both `require_log_server_enabled` and `require_trusted_network`. That extra check is useful, but it is still not a replacement for proper network isolation and authentication in front of `/mcp`.


---

Source: repository-bootstrap-and-github-app.md

# Repository Bootstrap and GitHub App

When `github-webhook-server` starts, it does a one-time bootstrap pass before it begins serving webhook traffic. That startup pass is what makes each configured repository "ready" on GitHub: it creates the server's built-in labels, applies repository defaults, writes branch protection for the branches you list, repairs stale built-in check runs, and makes sure the repository webhook points back to this server.

This is a startup-only workflow. If you change bootstrap-related settings such as `webhook-ip`, `branch-protection`, or `protected-branches`, restart the server so the bootstrap runs again.

From `entrypoint.py`:

```python
if __name__ == "__main__":
    # Run Podman cleanup before starting the application
    run_podman_cleanup()

    result = asyncio.run(repository_and_webhook_settings(webhook_secret=_webhook_secret))

    uvicorn.run(
        "webhook_server.app:FASTAPI_APP",
        host=_ip_bind,
        port=int(_port),
        workers=int(_max_workers),
        reload=False,
    )
```

From `webhook_server/utils/github_repository_and_webhook_settings.py`:

```python
async def repository_and_webhook_settings(webhook_secret: str | None = None) -> None:
    config = Config(logger=LOGGER)
    apis_dict: dict[str, dict[str, Any]] = {}

    apis: list[Future[tuple[str, github.Github | None, str]]] = []
    with ThreadPoolExecutor() as executor:
        for repo, _ in config.root_data["repositories"].items():
            apis.append(
                executor.submit(
                    get_repository_api,
                    **{"repository": repo},
                )
            )

        for result in as_completed(apis):
            repository, github_api, api_user = result.result()
            apis_dict[repository] = {"api": github_api, "user": api_user}

    LOGGER.debug(f"Repositories APIs: {apis_dict}")

    await set_repositories_settings(config=config, apis_dict=apis_dict)
    set_all_in_progress_check_runs_to_queued(repo_config=config, apis_dict=apis_dict)
    create_webhook(config=config, apis_dict=apis_dict, secret=webhook_secret)
```

> **Note:** The startup bootstrap uses the central `config.yaml` in the server data directory. The repo-local `.github-webhook-server.yaml` file is loaded later during normal webhook processing, not during this bootstrap pass.

## What You Need

The server reads its main configuration from `WEBHOOK_SERVER_DATA_DIR`, which defaults to `/home/podman/data`.

From `webhook_server/libs/config.py`:

```python
self.data_dir: str = os.environ.get("WEBHOOK_SERVER_DATA_DIR", "/home/podman/data")
self.config_path: str = os.path.join(self.data_dir, "config.yaml")
```

That same directory also needs the GitHub App private key file named `webhook-server.private-key.pem`.

The container example makes that explicit:

```yaml
volumes:
  - "./webhook_server_data_dir:/home/podman/data:Z" # Should include config.yaml and webhook-server.private-key.pem
```

At minimum, you should have:

- a `config.yaml` with `github-app-id`, `webhook-ip`, and `repositories`
- at least one valid `github-token` for each repository you want to bootstrap
- the GitHub App private key at `webhook-server.private-key.pem`
- the GitHub App installed on the repositories where you want App-backed check runs to work

A trimmed example from `examples/config.yaml`:

```yaml
github-app-id: 123456 # GitHub app id
github-tokens:
  - <GITHIB TOKEN1>
  - <GITHIB TOKEN2>

webhook-ip: <HTTP://IP OR URL:PORT/webhook_server> # Full URL with path (e.g., https://your-domain.com/webhook_server or https://smee.io/your-channel)

default-status-checks:
  - "WIP"
  - "dpulls"
  - "can-be-merged"

branch-protection:
  strict: True
  require_code_owner_reviews: True
  dismiss_stale_reviews: False
  required_approving_review_count: 1
  required_linear_history: True
  required_conversation_resolution: True

repositories:
  my-repository:
    name: my-org/my-repository
    events:
      - push
      - pull_request
      - pull_request_review
      - pull_request_review_thread
      - issue_comment
      - check_run
      - status
    tox:
      main: all
      dev: testenv1,testenv2
    pre-commit: true
    protected-branches:
      main: # set [] in order to set all defaults run included
        include-runs:
          - "pre-commit.ci - pr"
          - "WIP"
        exclude-runs:
          - "SonarCloud Code Analysis"
    container:
      username: <registry username>
      password: <registry_password>
      repository: <registry_repository_full_path>
      tag: <image_tag>
      release: true
```

> **Warning:** `webhook-ip` must be the full callback URL, including `/webhook_server`. The bootstrap code uses this value exactly as written when it creates the GitHub webhook.

## How Authentication Is Used

Startup bootstrap uses two different GitHub auth paths.

- `github-tokens` are used for normal repository administration: reading the repo, creating labels, editing repository settings, applying branch protection, and creating repository webhooks.
- The configured GitHub App is used when the server needs an installation-scoped API that can create check runs. That matters during startup because stale built-in checks are repaired through the App.

The token side is selected per repository, choosing the token with the highest remaining rate limit:

```python
config = Config(repository=repository, logger=LOGGER)
github_api, _, api_user = get_api_with_highest_rate_limit(config=config, repository_name=repository)
```

The GitHub App side is created from `github-app-id` plus `webhook-server.private-key.pem`:

```python
with open(os.path.join(config_.data_dir, "webhook-server.private-key.pem")) as fd:
    private_key = fd.read()

github_app_id: int = config_.root_data["github-app-id"]
auth: AppAuth = Auth.AppAuth(app_id=github_app_id, private_key=private_key)
app_instance: GithubIntegration = GithubIntegration(auth=auth)
owner, repo = repository_name.split("/")

return app_instance.get_repo_installation(owner=owner, repo=repo).get_github_for_installation()
```

> **Note:** If you configure repository-specific `github-tokens`, those override the global token list because bootstrap resolves tokens through repository-aware config lookups.

## What Bootstrap Changes

### Labels and Repository Defaults

For each configured repository, bootstrap first makes sure the static label set exists and has the expected colors. If a label is missing, it is created. If it exists with a different color, it is updated.

From `webhook_server/utils/github_repository_settings.py`:

```python
def set_repository_labels(repository: Repository, api_user: str) -> str:
    LOGGER.info(f"[API user {api_user}] - Set repository {repository.name} labels")
    repository_labels: dict[str, dict[str, Any]] = {}
    for label in repository.get_labels():
        repository_labels[label.name.lower()] = {
            "object": label,
            "color": label.color,
        }

    for label_name, label_color in STATIC_LABELS_DICT.items():
        label_lower: str = label_name.lower()
        if label_lower in repository_labels:
            repo_label: Label = repository_labels[label_lower]["object"]
            if repository_labels[label_lower]["color"] == label_color:
                continue
            else:
                LOGGER.debug(f"{repository.name}: Edit repository label {label_name} with color {label_color}")
                repo_label.edit(name=repo_label.name, color=label_color)
        else:
            LOGGER.debug(f"{repository.name}: Add repository label {label_name} with color {label_color}")
            repository.create_label(name=label_name, color=label_color)

    return f"[API user {api_user}] - {repository}: Setting repository labels is done"
```

The static startup label set comes from `STATIC_LABELS_DICT`:

```python
STATIC_LABELS_DICT: dict[str, str] = {
    **USER_LABELS_DICT,
    CHERRY_PICKED_LABEL: "1D76DB",
    AI_RESOLVED_CONFLICTS_LABEL: "FFA500",
    f"{SIZE_LABEL_PREFIX}L": "F5621C",
    f"{SIZE_LABEL_PREFIX}M": "F09C74",
    f"{SIZE_LABEL_PREFIX}S": "0E8A16",
    f"{SIZE_LABEL_PREFIX}XL": "D93F0B",
    f"{SIZE_LABEL_PREFIX}XS": "ededed",
    f"{SIZE_LABEL_PREFIX}XXL": "B60205",
    NEEDS_REBASE_LABEL_STR: "B60205",
    CAN_BE_MERGED_STR: "0E8A17",
    HAS_CONFLICTS_LABEL_STR: "B60205",
}
```

In practice, that means startup ensures labels such as:

- `hold`, `verified`, `wip`, `lgtm`, `approve`, `automerge`
- `CherryPicked`, `ai-resolved-conflicts`
- `needs-rebase`, `has-conflicts`, `can-be-merged`
- `size/XS`, `size/S`, `size/M`, `size/L`, `size/XL`, `size/XXL`

After labels, bootstrap applies repository-wide defaults:

```python
def set_repository_settings(repository: Repository, api_user: str) -> None:
    LOGGER.info(f"[API user {api_user}] - Set repository {repository.name} settings")
    repository.edit(delete_branch_on_merge=True, allow_auto_merge=True, allow_update_branch=True)

    if repository.private:
        LOGGER.warning(f"{repository.name}: Repository is private, skipping setting security settings")
        return

    LOGGER.info(f"[API user {api_user}] - Set repository {repository.name} security settings")
    repository._requester.requestJsonAndCheck(
        "PATCH",
        f"{repository.url}/code-scanning/default-setup",
        input={"state": "not-configured"},
    )

    repository._requester.requestJsonAndCheck(
        "PATCH",
        repository.url,
        input={
            "security_and_analysis": {
                "secret_scanning": {"status": "enabled"},
                "secret_scanning_push_protection": {"status": "enabled"},
            }
        },
    )
```

So, on public repositories, startup also:

- enables delete-branch-on-merge
- enables auto-merge
- enables update branch
- enables secret scanning
- enables secret scanning push protection
- sets code scanning default setup to `not-configured`

> **Note:** Private repositories still get label reconciliation and repository-level merge defaults, but the startup path skips the public-repo security setup and does not apply branch protection for them.

> **Tip:** Startup only pre-creates the static labels. Dynamic labels such as `approved-*`, `lgtm-*`, `commented-*`, `changes-requested-*`, `cherry-pick-*`, and `branch-*` show up later when real webhook events need them.

### Branch Protection and Required Checks

Only branches listed under `protected-branches` are touched. Bootstrap reads the top-level `branch-protection` block, then lets `repositories.<name>.branch-protection` override it for that repository.

If you do not override anything, the default protection settings are:

- `strict: true`
- `require_code_owner_reviews: false`
- `dismiss_stale_reviews: true`
- `required_approving_review_count: 0`
- `required_linear_history: true`
- `required_conversation_resolution: true`

When branch protection is applied, the required status checks are passed directly into `branch.edit_protection(...)`:

```python
def set_branch_protection(
    branch: Branch,
    repository: Repository,
    required_status_checks: list[str],
    strict: bool,
    require_code_owner_reviews: bool,
    dismiss_stale_reviews: bool,
    required_approving_review_count: int,
    required_linear_history: bool,
    required_conversation_resolution: bool,
    api_user: str,
) -> bool:
    LOGGER.info(
        f"[API user {api_user}] - Set branch {branch} setting for {repository.name}. "
        f"enabled checks: {required_status_checks}"
    )
    branch.edit_protection(
        strict=strict,
        required_conversation_resolution=required_conversation_resolution,
        contexts=required_status_checks,
        require_code_owner_reviews=require_code_owner_reviews,
        dismiss_stale_reviews=dismiss_stale_reviews,
        required_approving_review_count=required_approving_review_count,
        required_linear_history=required_linear_history,
        users_bypass_pull_request_allowances=[api_user],
        teams_bypass_pull_request_allowances=[api_user],
        apps_bypass_pull_request_allowances=[api_user],
    )

    return True
```

The required check list is generated from your repository configuration. From `get_required_status_checks(...)`:

```python
def get_required_status_checks(
    repo: Repository,
    data: dict[str, Any],
    default_status_checks: list[str],
    exclude_status_checks: list[str],
) -> list[str]:
    if data.get("tox"):
        default_status_checks.append("tox")

    if data.get("verified-job", True):
        default_status_checks.append("verified")

    if data.get("container"):
        default_status_checks.append(BUILD_CONTAINER_STR)

    if data.get("pypi"):
        default_status_checks.append(PYTHON_MODULE_INSTALL_STR)

    if data.get("pre-commit"):
        default_status_checks.append(PRE_COMMIT_STR)

    if data.get(CONVENTIONAL_TITLE_STR):
        default_status_checks.append(CONVENTIONAL_TITLE_STR)

    try:
        repo.get_contents(".pre-commit-config.yaml")
        default_status_checks.append("pre-commit.ci - pr")
    except UnknownObjectException:
        pass

    # Deduplicate status checks while preserving order
    seen: set[str] = set()
    deduplicated: list[str] = []
    for status_check in default_status_checks:
        if status_check not in seen:
            seen.add(status_check)
            deduplicated.append(status_check)

    # Remove excluded status checks
    for status_check in exclude_status_checks:
        while status_check in deduplicated:
            deduplicated.remove(status_check)

    return deduplicated
```

Here is how that plays out:

- Start with `default-status-checks`
- Always add `can-be-merged`
- Add `tox` if `tox` is configured
- Add `verified` unless `verified-job: false`
- Add `build-container` if `container` is configured
- Add `python-module-install` if `pypi` is configured
- Add `pre-commit` if `pre-commit: true`
- Add `conventional-title` if `conventional-title` is configured
- Add `pre-commit.ci - pr` automatically when `.pre-commit-config.yaml` exists in the repository
- Deduplicate the list
- Remove anything listed in `exclude-runs`

> **Warning:** `include-runs` is not additive. If you set `include-runs` for a branch, that list replaces the automatically generated check list for that branch. Use `exclude-runs` when you want "the generated list, minus a few checks."

> **Tip:** The server always adds `can-be-merged` before deduplicating. You do not need to add it twice.

One important limitation to remember: startup branch protection is built from the built-in checks above and your explicit `include-runs`. Runtime-only features such as `custom-check-runs` are handled later during webhook processing, not automatically added by this startup branch-protection pass.

### GitHub App Check-State Repair

After repository settings are written, bootstrap scans open pull requests and repairs built-in checks that were left stuck in `in_progress`. This is where the GitHub App matters most at startup.

The built-in check names are:

```python
BUILTIN_CHECK_NAMES: frozenset[str] = frozenset({
    TOX_STR,
    PRE_COMMIT_STR,
    BUILD_CONTAINER_STR,
    PYTHON_MODULE_INSTALL_STR,
    CONVENTIONAL_TITLE_STR,
    CAN_BE_MERGED_STR,
})
```

And the reset logic is:

```python
def set_repository_check_runs_to_queued(
    config_: Config,
    data: dict[str, Any],
    github_api: Github,
    check_runs: frozenset[str],
    api_user: str,
) -> tuple[bool, str, Callable[..., Any]]:
    def _set_checkrun_queued(_api: Repository, _pull_request: PullRequest) -> None:
        last_commit: Commit | None = None
        for commit in _pull_request.get_commits():
            last_commit = commit
        if last_commit is None:
            LOGGER.error(f"[API user {api_user}] - {repository}: [PR:{_pull_request.number}] No commits found")
            return
        for check_run in last_commit.get_check_runs():
            if check_run.name in check_runs and check_run.status == IN_PROGRESS_STR:
                LOGGER.warning(
                    f"[API user {api_user}] - {repository}: [PR:{_pull_request.number}] "
                    f"{check_run.name} status is {IN_PROGRESS_STR}, "
                    f"Setting check run {check_run.name} to {QUEUED_STR}"
                )
                _api.create_check_run(name=check_run.name, head_sha=last_commit.sha, status=QUEUED_STR)
```

What this means for you:

- it looks only at open PRs
- it inspects the last commit on each PR
- it only repairs built-in check runs
- if one of those checks is still `in_progress` when the server starts, bootstrap creates a new check run with the same name and sets it to `queued`

This is especially useful after restarts or crashes, because it prevents built-in checks like `tox` or `can-be-merged` from being left permanently stuck in the GitHub UI.

> **Note:** Custom checks are not part of this startup repair pass. The App-backed reset only targets the built-in check names listed above.

> **Warning:** If the GitHub App is not installed on a repository, the startup repair step cannot create replacement check runs for that repository.

### Webhook Creation and Reconciliation

The last bootstrap step makes sure every configured repository has a webhook pointing back to this server.

From `webhook_server/utils/webhook.py`:

```python
config_: dict[str, str] = {"url": webhook_ip, "content_type": "json"}

if secret:
    config_["secret"] = secret

events: list[str] = data.get("events", ["*"])

try:
    hooks: list[Hook] = list(repo.get_hooks())
except Exception as ex:
    return (
        False,
        f"[API user {api_user}] - Could not list webhook for {full_repository_name}, check token permissions: {ex}",
        LOGGER.error,
    )

for _hook in hooks:
    if webhook_ip in _hook.config["url"]:
        secret_presence_mismatch = bool(_hook.config.get("secret")) != bool(secret)
        if secret_presence_mismatch:
            LOGGER.info(f"[API user {api_user}] - {full_repository_name}: Deleting old webhook")
            _hook.delete()

        else:
            # Check if events need updating
            hook_events = sorted(set(_hook.events))
            config_events = sorted(set(events))
            if hook_events != config_events:
                LOGGER.info(
                    f"[API user {api_user}] - {full_repository_name}: "
                    f"Updating webhook events: {hook_events} -> {config_events}"
                )
                _hook.edit(name="web", config=config_, events=config_events, active=True)
                return (
                    True,
                    f"[API user {api_user}] - {full_repository_name}: "
                    f"Hook updated with new events - {_hook.config['url']}",
                    LOGGER.info,
                )

            return (
                True,
                f"[API user {api_user}] - {full_repository_name}: Hook already exists - {_hook.config['url']}",
                LOGGER.info,
            )

LOGGER.info(
    f"[API user {api_user}] - Creating webhook: {config_['url']} for {full_repository_name} with events: {events}"
)
repo.create_hook(name="web", config=config_, events=events, active=True)
```

That gives you a simple startup contract:

- if no matching hook exists, bootstrap creates one
- if a matching hook exists with the same event list, bootstrap leaves it alone
- if the event list changed, bootstrap updates the existing hook
- if you added or removed a webhook secret, bootstrap deletes the old matching hook and recreates it
- if `events` is omitted for a repository, bootstrap subscribes that webhook to `*`

If you also set `webhook-secret`, the same secret is used later to validate incoming requests:

```python
webhook_secret = root_config.get("webhook-secret")

if webhook_secret:
    signature_header = request.headers.get("x-hub-signature-256")
    verify_signature(payload_body=payload_body, secret_token=webhook_secret, signature_header=signature_header)
```

That gives you one secret on both sides:

- bootstrap attaches it to the GitHub repository webhook
- request handling verifies `x-hub-signature-256` with that same value

## What To Expect After a Restart

After a successful restart, the practical results should be easy to spot in GitHub:

- missing static labels appear, and existing static label colors are corrected
- repository defaults like delete-branch-on-merge and auto-merge are enabled
- listed protected branches are updated with the configured protection rules and required checks
- built-in open-PR checks that were stuck in `in_progress` are reset to `queued`
- the repository webhook points to your configured `webhook-ip` and listens to the configured events

If one of those things does not happen, the first things to verify are:

- the repo exists in `config.yaml`
- the selected GitHub token has enough repository/admin access to edit the repo
- the GitHub App is installed on that repo
- `webhook-server.private-key.pem` and `github-app-id` match the installed App
- `webhook-ip` is the full callback URL, including `/webhook_server`


---

Source: logging-and-data-files.md

# Logging and Data Files

`github-webhook-server` keeps its persistent configuration, key material, and logs under a single data directory. By default that directory is `/home/podman/data`, and you can move it with `WEBHOOK_SERVER_DATA_DIR`.

```20:21:webhook_server/libs/config.py
self.data_dir: str = os.environ.get("WEBHOOK_SERVER_DATA_DIR", "/home/podman/data")
self.config_path: str = os.path.join(self.data_dir, "config.yaml")
```

The example container setup mounts a host directory directly to that path and calls out the files that should already exist there:

```5:20:examples/docker-compose.yaml
volumes:
  - "./webhook_server_data_dir:/home/podman/data:Z" # Should include config.yaml and webhook-server.private-key.pem
  # Mount temporary directories to prevent boot ID mismatch issues
  - "/tmp/podman-storage-${USER:-1000}:/tmp/storage-run-1000"
environment:
  - PUID=1000
  - PGID=1000
  - TZ=Asia/Jerusalem
  - MAX_WORKERS=50 # Defaults to 10 if not set
  - WEBHOOK_SERVER_IP_BIND=0.0.0.0 # IP to listen
  - WEBHOOK_SERVER_PORT=5000 # Port to listen
  - WEBHOOK_SECRET=<secret> # If set verify hook is a valid hook from Github
  - VERIFY_GITHUB_IPS=1 # Verify hook request is from GitHub IPs
  - VERIFY_CLOUDFLARE_IPS=1 # Verify hook request is from Cloudflare IPs
  - ENABLE_LOG_SERVER=true # Enable log viewer endpoints (default: false)
  - ENABLE_MCP_SERVER=false # Enable MCP server for AI agent integration (default: false)
```

## Data Directory Layout

Common files and directories you will see:

- `config.yaml`: the main server configuration file.
- `webhook-server.private-key.pem`: the GitHub App private key.
- `logs/`: the main logging directory.
- `logs/<log-file>`: the main human-readable application log.
- `logs/<log-file>.1`, `logs/<log-file>.2`, and so on: rotated text logs.
- `logs/webhooks_YYYY-MM-DD.json`: structured webhook log files, one file per UTC day.
- `logs/logs_server.log`: dedicated log viewer log when the log viewer is enabled.
- `logs/mcp_server.log`: dedicated MCP server log when MCP support is enabled.

The private key is read from the data directory root, not from `logs/`:

```410:417:webhook_server/utils/github_repository_settings.py
def get_repository_github_app_api(config_: Config, repository_name: str) -> Github | None:
    LOGGER.debug("Getting repositories GitHub app API")

    with open(os.path.join(config_.data_dir, "webhook-server.private-key.pem")) as fd:
        private_key = fd.read()

    github_app_id: int = config_.root_data["github-app-id"]
```

> **Note:** The `logs/` directory is created automatically when the server needs it.

## Configure Log Files

The example config shows the main logging settings:

```3:7:examples/config.yaml
log-level: INFO # Set global log level, change take effect immediately without server restart
log-file: webhook-server.log # Set global log file, change take effect immediately without server restart
mcp-log-file: mcp_server.log # Set global MCP log file, change take effect immediately without server restart
logs-server-log-file: logs_server.log # Set global Logs Server log file, change take effect immediately without server restart
mask-sensitive-data: true # Mask sensitive data in logs (default: true). Set to false for debugging (NOT recommended in production)
```

You can also override the text log file and masking behavior for a specific repository:

```139:144:examples/config.yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    mask-sensitive-data: false # Override global setting - disable masking for debugging this specific repo (NOT recommended in production)
```

Relative filenames are resolved inside `<data-dir>/logs/`. If you give an absolute path, the server uses it as-is:

```130:149:webhook_server/utils/helpers.py
def get_log_file_path(config: Config, log_file_name: str | None) -> str | None:
    """
    Resolve the full path for a log file using the configuration data directory.

    Args:
        config: Config object containing data_dir
        log_file_name: Name of the log file (e.g., "server.log")

    Returns:
        Full path to the log file, or None if log_file_name is None
    """
    if log_file_name and not log_file_name.startswith("/"):
        log_file_path = os.path.join(config.data_dir, "logs")

        if not os.path.isdir(log_file_path):
            os.makedirs(log_file_path, exist_ok=True)

        return os.path.join(log_file_path, log_file_name)

    return log_file_name
```

> **Tip:** Use repository-level overrides when you need extra visibility for one repository without changing logging for every repository.

## Rotating Text Logs

Text logs are the easiest place to read day-to-day activity. They use size-based rotation and are designed not to crash the server if rotated files have already been removed.

The project swaps in a safe rotating handler and sets a 10 MiB max file size for logger-managed text files:

```36:38:webhook_server/utils/helpers.py
# Patch simple_logger to use SafeRotatingFileHandler to prevent crashes
# when backup log files are missing during rollover
simple_logger.logger.RotatingFileHandler = SafeRotatingFileHandler
```

```96:103:webhook_server/utils/helpers.py
logger = get_logger(
    name=logger_cache_key,
    filename=log_file_path_resolved,
    level=log_level,
    file_max_bytes=1024 * 1024 * 10,
    mask_sensitive=mask_sensitive,
    mask_sensitive_patterns=mask_sensitive_patterns,
    console=True,  # Enable console output for docker logs with FORCE_COLOR support
)
```

During rollover, the handler works with the standard rotated filenames such as `.1`, `.2`, and `.3`, but suppresses file-operation errors so logging can continue:

```65:111:webhook_server/utils/safe_rotating_handler.py
if self.backupCount > 0:
    # Remove backup files that exceed backupCount, handle missing files
    for i in range(self.backupCount - 1, 0, -1):
        sfn = self.rotation_filename(f"{self.baseFilename}.{i}")
        dfn = self.rotation_filename(f"{self.baseFilename}.{i + 1}")
        if os.path.exists(sfn):
            try:
                if os.path.exists(dfn):
                    os.remove(dfn)
                os.rename(sfn, dfn)
            except FileNotFoundError:
                # File was deleted between exists check and operation - ignore
                pass
            except OSError:
                # Broad suppression intentional: logging must never crash.
                # See module docstring for full rationale.
                pass

    dfn = self.rotation_filename(f"{self.baseFilename}.1")
    try:
        if os.path.exists(dfn):
            os.remove(dfn)
    except FileNotFoundError:
        # File was deleted between exists check and remove - ignore
        pass
    except OSError:
        # Broad suppression intentional: logging must never crash.
        # See module docstring for full rationale.
        pass

    try:
        self.rotate(self.baseFilename, dfn)
    except FileNotFoundError:
        # Base file was deleted - just create a new one
        pass
    except OSError:
        # Broad suppression intentional: logging must never crash.
        # See module docstring for full rationale.
        pass

if not self.delay:
    try:
        self.stream = self._open()
    except OSError:
        # Cannot open new log file - leave stream as None.
        # FileHandler.emit() will attempt to open on next log entry.
        pass
```

In practice, if `log-file` is `webhook-server.log`, expect a current file like `logs/webhook-server.log` plus rotated siblings such as `logs/webhook-server.log.1`.

## Structured JSONL Webhook Logs

The server also writes structured webhook data into daily files named `webhooks_YYYY-MM-DD.json` under `logs/`.

Despite the `.json` extension, these files use JSON Lines: one compact JSON object per line.

```79:122:webhook_server/utils/structured_logger.py
def _get_log_file_path(self, date: datetime | None = None) -> Path:
    """Get log file path for the specified date.

    Args:
        date: Date for the log file (defaults to current UTC date)

    Returns:
        Path to the log file (e.g., {log_dir}/webhooks_2026-01-05.json)
    """
    if date is None:
        date = datetime.now(UTC)
    date_str = date.strftime("%Y-%m-%d")
    return self.log_dir / f"webhooks_{date_str}.json"

def write_log(self, context: WebhookContext) -> None:
    """Write webhook context as JSONL entry to date-based log file.

    Writes a compact JSON entry (single line, no indentation) containing complete webhook execution context.
    Each entry is terminated by a newline character.
    Uses atomic write pattern (temp file + rename) with file locking for safety.

    Args:
        context: WebhookContext to serialize and write

    Note:
        Uses context.completed_at as source of truth, falls back to datetime.now(UTC)
    """
    # Prefer context.completed_at as source of truth, fall back to current time
    completed_at = context.completed_at if context.completed_at else datetime.now(UTC)

    # Get context dict and update timing locally (without mutating context)
    context_dict = context.to_dict()
    context_dict["type"] = "webhook_summary"
    if "timing" in context_dict:
        context_dict["timing"]["completed_at"] = completed_at.isoformat()
        if context.started_at:
            duration_ms = int((completed_at - context.started_at).total_seconds() * 1000)
            context_dict["timing"]["duration_ms"] = duration_ms

    # Get log file path
    log_file = self._get_log_file_path(completed_at)

    # Serialize context to JSON (compact JSONL format - single line, no indentation)
    log_entry = json.dumps(context_dict, ensure_ascii=False)
```

These files contain two important entry types:

- `webhook_summary`: one end-of-webhook summary with timing, workflow steps, success state, and errors.
- `log_entry`: individual log records enriched with webhook context when that context exists.

A `log_entry` record is built like this:

```105:132:webhook_server/utils/json_log_handler.py
message = record.getMessage()
message = _ANSI_ESCAPE_RE.sub("", message)

exc_text: str | None = None
if record.exc_info and record.exc_info[0] is not None:
    exc_text = "".join(traceback.format_exception(*record.exc_info))

entry: dict[str, object] = {
    "type": "log_entry",
    "timestamp": datetime.fromtimestamp(record.created, tz=UTC).isoformat(),
    "level": record.levelname,
    "logger_name": record.name,
    "message": message,
}

if exc_text:
    entry["exc_info"] = exc_text

# Enrich with webhook context when available
ctx = get_context()
if ctx is not None:
    entry["hook_id"] = ctx.hook_id
    entry["event_type"] = ctx.event_type
    entry["repository"] = ctx.repository
    entry["pr_number"] = ctx.pr_number
    entry["api_user"] = ctx.api_user
```

A `webhook_summary` carries the higher-level fields you usually want when debugging a delivery:

```374:405:webhook_server/utils/context.py
return {
    "hook_id": self.hook_id,
    "level": self._derive_level(),
    "status": self._derive_status(),
    "event_type": self.event_type,
    "action": self.action,
    "sender": self.sender,
    "repository": self.repository,
    "repository_full_name": self.repository_full_name,
    "pr": {
        "number": self.pr_number,
        "title": self.pr_title,
        "author": self.pr_author,
    }
    if self.pr_number
    else None,
    "api_user": self.api_user,
    "timing": {
        "started_at": self.started_at.isoformat(),
        "completed_at": (self.completed_at.isoformat() if self.completed_at else None),
        "duration_ms": int((self.completed_at - self.started_at).total_seconds() * 1000)
        if self.completed_at
        else None,
    },
    "workflow_steps": self.workflow_steps,
    "token_spend": self.token_spend,
    "initial_rate_limit": self.initial_rate_limit,
    "final_rate_limit": self.final_rate_limit,
    "success": self.success,
    "error": self.error,
    "summary": self._build_summary(),
}
```

> **Note:** `webhooks_*.json` is date-split, not size-rotated. The server creates a new file each UTC day, but it does not roll these files over by size. If you keep logs for a long time, plan your own retention or archival policy.

A practical detail: the server always tries to write a structured summary at the end of webhook processing, even after failures. If you need the most reliable delivery-level record, start with `webhooks_*.json`.

## How Masking Works

Masking is enabled by default with `mask-sensitive-data: true`. The logger treats common secret and credential patterns as sensitive:

```47:78:webhook_server/utils/helpers.py
mask_sensitive_patterns: list[str] = [
    # Passwords and secrets
    "container_repository_password",
    "password",
    "secret",
    # Tokens and API keys
    "token",
    "apikey",
    "api_key",
    "github_token",
    "GITHUB_TOKEN",
    "pypi",
    # Authentication credentials
    "username",
    "login",
    "-u",
    "-p",
    "--username",
    "--password",
    "--creds",
    # Private keys and sensitive IDs
    "private_key",
    "private-key",
    "webhook_secret",
    "webhook-secret",
    "github-app-id",
    # Slack webhooks (contain sensitive URLs)
    "slack-webhook-url",
    "slack_webhook_url",
    "webhook-url",
    "webhook_url",
]
```

In practice, this means:

- Tokens, passwords, webhook secrets, and similar values are masked in log output by default.
- Command helpers redact explicitly supplied secrets before writing command lines, stdout, or stderr to the logs.
- Repository-level `mask-sensitive-data` can override the global setting for one repository.

> **Warning:** Setting `mask-sensitive-data: false` can expose credentials in your logs. Use it only for short-lived debugging in a controlled environment.

## Log Separation

This project intentionally separates logs by purpose.

- The main text log is for readable application activity.
- `webhooks_*.json` is for structured webhook diagnostics and analysis.
- `logs_server.log` is for the log viewer itself.
- `mcp_server.log` is for the optional MCP server.

That separation is enforced in the logger setup. The structured JSON handler is attached only to the default webhook/application logger, not to infrastructure loggers that are created with an explicit filename:

```106:125:webhook_server/utils/helpers.py
# Attach JsonLogHandler for writing log records to the webhook JSONL file.
# Only attach when:
# - A log file path is configured (skip console-only loggers)
# - The logger is for the main webhook log (log_file_name not explicitly set)
#   Infrastructure loggers (mcp_server.log, logs_server.log) must NOT write
#   to webhooks_*.json because their entries lack webhook context (hook_id,
#   event_type, etc.) and pollute the webhook log with noise entries.
# - Only once per logger instance to avoid duplicate handlers.
# Uses _config.data_dir/logs (same directory as StructuredLogWriter) instead
# of deriving from the text log file path, which may differ for absolute paths.
if log_file_path_resolved and not log_file_name:
    log_dir = os.path.join(_config.data_dir, "logs")
    with _JSON_HANDLER_LOCK:
        if not any(isinstance(h, JsonLogHandler) and h.log_dir == Path(log_dir) for h in logger.handlers):
            logger.addHandler(
                JsonLogHandler(
                    log_dir=log_dir,
                    level=getattr(logging, log_level.upper(), logging.DEBUG),
                )
            )
```

That last comment matters: even if you point `log-file` at an absolute path somewhere else, the structured `webhooks_*.json` files still stay under `<data-dir>/logs/`.

The log viewer gets its own dedicated logger:

```547:554:webhook_server/app.py
if _log_viewer_controller_singleton is None:
    # Use global LOGGER for config operations
    config = Config(logger=LOGGER)
    logs_server_log_file = config.get_value("logs-server-log-file", return_on_none="logs_server.log")

    # Create dedicated logger for log viewer
    log_viewer_logger = get_logger_with_params(log_file_name=logs_server_log_file)
    _log_viewer_controller_singleton = LogViewerController(logger=log_viewer_logger)
```

The same pattern is used for MCP logging during startup:

```176:192:webhook_server/app.py
# Configure MCP logging separation
if MCP_SERVER_ENABLED:
    mcp_log_file = root_config.get("mcp-log-file", "mcp_server.log")

    # Use get_logger_with_params to reuse existing logging configuration logic
    # (rotation, sensitive data masking, formatting)
    # This returns a logger configured for the specific file
    mcp_file_logger = get_logger_with_params(log_file_name=mcp_log_file)

    # Add the configured handler to the actual MCP logger and stop propagation
    # This ensures MCP logs go ONLY to mcp_server.log and not webhook_server.log
    if mcp_file_logger.handlers:
        for handler in mcp_file_logger.handlers:
            mcp_logger.addHandler(handler)

        mcp_logger.propagate = False
```

## Log Viewer Files

When enabled, the log viewer reads the same files from `<data-dir>/logs/`; it does not build a separate database.

It scans current text logs, rotated text logs, and structured webhook files:

```1087:1136:webhook_server/web/log_viewer.py
# Find all log files including rotated ones and JSON files
log_files: list[Path] = []
log_files.extend(log_dir.glob("*.log"))
log_files.extend(log_dir.glob("*.log.*"))
log_files.extend(log_dir.glob("webhooks_*.json"))

# Sort log files to prioritize JSON webhook files first (primary data source),
# then other files by modification time (newest first)
# This ensures webhook data is displayed before internal log files
def sort_key(f: Path) -> tuple[int, float]:
    is_json_webhook = f.suffix == ".json" and f.name.startswith("webhooks_")
    # JSON webhook files: (0, -mtime) - highest priority, newest first
    # Other files: (1, -mtime) - lower priority, newest first
    return (0 if is_json_webhook else 1, -f.stat().st_mtime)

...
async with aiofiles.open(log_file, encoding="utf-8") as f:
    # Use appropriate parser based on file type
    if log_file.suffix == ".json":
        # JSONL files: one compact JSON object per line
        # Process both "log_entry" and "webhook_summary" entries
        # Skip infrastructure logger entries that lack webhook context
        async for line in f:
            entry = self.log_parser.parse_json_log_entry(line)
            if entry and not LogViewerController._is_infrastructure_noise(entry):
                buffer.append(entry)
    else:
        # Text log files: parse line by line
        # Skip infrastructure logger entries that lack webhook context
        async for line in f:
            entry = self.log_parser.parse_log_entry(line)
            if entry and not LogViewerController._is_infrastructure_noise(entry):
                buffer.append(entry)
```

It also filters out known infrastructure noise when those entries have no webhook context:

```302:320:webhook_server/web/log_viewer.py
@staticmethod
def _is_infrastructure_noise(entry: LogEntry) -> bool:
    """Check if a log entry is infrastructure noise that should be excluded.

    Infrastructure loggers (MCP server, log viewer) produce high-frequency
    entries without webhook context. These are filtered out to prevent them
    from drowning actual webhook processing entries in unfiltered queries.

    Only excludes entries that have NO webhook context (hook_id is None),
    preserving any infrastructure log that happens to correlate with a webhook.

    Args:
        entry: LogEntry to check

    Returns:
        True if the entry is infrastructure noise and should be excluded

    """
    return entry.logger_name in LogViewerController._INFRASTRUCTURE_LOGGERS and entry.hook_id is None
```

What this means in practice:

- The log viewer can show current and rotated text logs together with structured webhook logs.
- Structured JSON files are the primary source for webhook summaries, workflow steps, and export data.
- Text logs provide the detailed line-by-line context that summary records intentionally do not include.
- Exports are streamed to the client on demand; they are not written back into the data directory as extra files.

> **Warning:** The project does not add application-level authentication to the `/logs` endpoints. Treat the log viewer as an internal tool and protect it with trusted network placement or a reverse proxy that adds authentication.


---

Source: testing-and-maintenance.md

# Testing and Maintenance

This project uses a layered verification model. Most day-to-day changes are checked with fast local tests under `webhook_server/tests/`, while full GitHub workflow validation lives in `webhook_server/tests/e2e/`. On top of that, the repository is maintained with checked-in automation for linting, releases, dependency updates, and PR or issue hygiene.

> **Note:** Automation in this repository is driven mostly by checked-in tool and bot configuration. There are no `.github/workflows` files in the repo, so the practical "pipeline" lives in `pytest`, `tox`, `pre-commit`, `release-it`, Renovate, and repository bot settings.

## Test strategy

### Default local suite

The default `pytest` configuration is set up for normal contributor workflows: async tests work out of the box, coverage is always collected, logs are visible during the run, and E2E tests are skipped unless you opt in.

```ini
[pytest]
asyncio_mode = auto
addopts =
    --pdbcls=IPython.terminal.debugger:TerminalPdb
    --cov-config=pyproject.toml --cov-report=html --cov-report=term --cov=webhook_server
    --log-cli-level=DEBUG
    -m 'not e2e'
markers =
    e2e: "End-to-end tests that require real GitHub interactions (deselect with '-m \"not e2e\"')"
```

The main `tox` test environment runs the default suite in parallel with `pytest-xdist`, using the command from `tox.toml`: `uv run --extra tests pytest -n auto webhook_server/tests`.

> **Tip:** For everyday development, stay in the default non-E2E suite. It is faster, parallelized, and already includes coverage reporting.

### Unit tests

The unit layer focuses on isolated behavior. Tests stub GitHub objects, patch network boundaries, and assert exact outcomes for helpers, handlers, and config logic. You can see this style clearly in `webhook_server/tests/test_webhook.py`, which verifies webhook creation without talking to GitHub:

```python
@patch("webhook_server.utils.webhook.get_github_repo_api")
def test_process_github_webhook_success_no_existing_hooks(
    self,
    mock_get_repo_api: Mock,
    sample_data: dict[str, Any],
    apis_dict: dict[str, dict[str, Any]],
    mock_repo: Mock,
) -> None:
    mock_get_repo_api.return_value = mock_repo

    success, message, _ = process_github_webhook(
        repository_name="test-repo", data=sample_data, webhook_ip="http://example.com", apis_dict=apis_dict
    )

    assert success is True
    assert "Create webhook is done" in message

    mock_repo.create_hook.assert_called_once_with(
        name="web",
        config={"url": "http://example.com", "content_type": "json"},
        events=["push", "pull_request"],
        active=True,
    )
```

In practice, this unit layer covers a lot of ground:

- webhook helper functions and GitHub API wrappers
- handler behavior such as `PullRequestHandler`, `CheckRunHandler`, `IssueCommentHandler`, and `PushHandler`
- configuration loading and schema validation
- log parsing, filtering, structured logging, and API-call accounting
- performance-sensitive internals such as log parsing and memory usage

### Integration tests

The integration layer exercises larger slices of the application together. These tests usually run the real FastAPI app with `TestClient`, point it at checked-in test manifests, and stub only the true external boundary.

A representative example from `webhook_server/tests/test_app.py` posts a webhook into the app and verifies the real HTTP response:

```python
@pytest.fixture
def client(self) -> TestClient:
    return TestClient(FASTAPI_APP)

@patch.dict(os.environ, {"WEBHOOK_SERVER_DATA_DIR": "webhook_server/tests/manifests"})
@patch("webhook_server.app.GithubWebhook")
def test_process_webhook_success(
    self, mock_github_webhook: Mock, client: TestClient, valid_webhook_payload: dict[str, Any], webhook_secret: str
) -> None:
    payload_json = json.dumps(valid_webhook_payload)
    signature = self.create_github_signature(payload_json, webhook_secret)

    mock_webhook_instance = Mock()
    mock_github_webhook.return_value = mock_webhook_instance

    headers = {
        "X-GitHub-Event": "pull_request",
        "X-GitHub-Delivery": "test-delivery-123",
        "x-hub-signature-256": signature,
        "Content-Type": "application/json",
    }

    response = client.post("/webhook_server", content=payload_json, headers=headers)

    assert response.status_code == 200
    assert response.json()["message"] == "Webhook queued for processing"
```

This is where the project verifies behavior such as:

- request validation and signature handling
- health endpoints and background-task behavior
- event dispatch through `GithubWebhook.process()`
- log viewer API endpoints, exports, and WebSocket handling
- configuration loading from `webhook_server/tests/manifests`

Only E2E tests have a dedicated pytest marker. Unit and integration tests live together in `webhook_server/tests/` and are distinguished by what they exercise rather than by separate markers.

### End-to-end tests

The E2E suite is intentionally closer to real life. It starts support infrastructure, uses a real GitHub repository, and verifies the observable side effects of webhook processing.

The core fixture in `webhook_server/tests/e2e/conftest.py` shows how that infrastructure is brought up:

```python
@pytest.fixture(scope="session")
def e2e_server(server_envs: dict[str, str], github_webhook_cleanup: None) -> Generator[None]:
    server_port = server_envs["server_port"]
    smee_url = server_envs["smee_url"]
    project_root = server_envs["project_root"]
    docker_compose_file = server_envs["docker_compose_file"]

    smee_process = start_smee_client(server_port=server_port, smee_url=smee_url)
    start_docker_compose(docker_compose_file=docker_compose_file, project_root=project_root)
    wait_for_container_health(
        docker_compose_file=docker_compose_file,
        project_root=project_root,
        container_name="github-webhook-server-e2e",
        timeout=60,
    )

    yield
```

The E2E tests themselves verify real GitHub outcomes. For example, `webhook_server/tests/e2e/test_pull_request_flow.py` waits for labels and check runs to appear on an actual PR instead of checking internal mocks.

The checked-in E2E guide runs the suite with `uv run --group tests pytest webhook_server/tests/e2e/ -v -m e2e`, and it expects a local `.dev/.env` file like this:

```bash
SERVER_PORT=5000
SMEE_URL=https://smee.io/YOUR_UNIQUE_CHANNEL
TEST_REPO=owner/repo-name
DOCKER_COMPOSE_FILE=.dev/docker-compose.yaml
TZ=America/New_York
```

A few practical details matter here:

- all GitHub operations in the E2E helpers use `gh` CLI
- the suite starts both a local Docker Compose stack and a `smee` relay client
- test fixtures create and clean up real branches, PRs, and webhooks
- `.dev/` is ignored by Git, so each developer keeps E2E configuration local

> **Warning:** E2E tests use real GitHub credentials, a real test repository, and a public Smee relay. Use a disposable test repo, keep `.dev/` local, and do not run the suite in parallel with `-n auto`.

### Specialized tests

The suite also includes targeted operational checks outside the usual unit/integration/E2E split:

- `test_performance_benchmarks.py` measures log parsing and filtering speed at larger scales.
- `test_memory_optimization.py` and `test_frontend_performance.py` protect log viewer scalability.
- `test_api_call_counting.py` verifies GitHub API usage tracking.
- `test_config_schema.py` and `test_schema_validator.py` cover configuration correctness from both schema and runtime angles.

## Coverage expectations

Coverage is enforced, not just reported. The threshold lives in `pyproject.toml`, and test files themselves are excluded from measurement so the percentage reflects application code.

```toml
[tool.coverage.run]
omit = ["webhook_server/tests/*"]

[tool.coverage.report]
fail_under = 90
skip_empty = true

[tool.coverage.html]
directory = ".tests_coverage"
```

That means:

- the test run fails if application coverage drops below 90%
- HTML coverage output is written to `.tests_coverage`
- the default pytest configuration already enables `--cov=webhook_server`, so you do not need to remember extra coverage flags for normal runs

## Pre-commit tooling and local maintenance

The repository uses `pre-commit` as the first quality gate before review. The configured hooks cover formatting, linting, types, JavaScript checks, and secret scanning.

A trimmed excerpt from `.pre-commit-config.yaml` shows the core stack:

```yaml
ci:
  autofix_prs: false
  autoupdate_commit_msg: "ci: [pre-commit.ci] pre-commit autoupdate"

repos:
  - repo: https://github.com/pre-commit/pre-commit-hooks
    rev: v6.0.0
    hooks:
      - id: check-added-large-files
      - id: check-merge-conflict
      - id: detect-private-key
      - id: trailing-whitespace
      - id: end-of-file-fixer

  - repo: https://github.com/astral-sh/ruff-pre-commit
    rev: v0.15.6
    hooks:
      - id: ruff
      - id: ruff-format

  - repo: https://github.com/pre-commit/mirrors-mypy
    rev: v1.19.1
    hooks:
      - id: mypy
        exclude: (tests/)

  - repo: https://github.com/pre-commit/mirrors-eslint
    rev: v10.0.3
    hooks:
      - id: eslint
        files: \.js$
```

The full config also includes:

- `flake8` with additional plugins
- `detect-secrets`
- `gitleaks`

Because `autofix_prs` is set to `false`, `pre-commit.ci` reports failures and opens hook update PRs, but it does not push automatic fixes back to contributor branches.

There is also a maintenance-focused `tox` setup in `tox.toml` with two environments: `unittests` for the main test suite and `unused-code`, which runs `pyutils-unusedcode` to catch dead code early.

One especially practical integration point is built into the server itself: when a managed repository contains `.pre-commit-config.yaml`, the project automatically adds the `pre-commit.ci - pr` status context to the default status checks. In other words, `pre-commit.ci` is treated as part of the review flow, not just as an optional extra.

## Release automation

Releases are automated with `release-it`. The checked-in configuration handles version bumping, changelog generation, Git tagging and pushing, and GitHub Release creation.

```json
{
  "npm": {
    "publish": false
  },
  "git": {
    "commit": true,
    "commitMessage": "Release ${version}",
    "tag": true,
    "tagAnnotation": "Release ${version}",
    "push": true,
    "pushArgs": ["--follow-tags"],
    "changelog": "uv run scripts/generate_changelog.py ${from} ${to}"
  },
  "github": {
    "release": true,
    "releaseName": "Release ${version}",
    "tokenRef": "GITHUB_TOKEN"
  },
  "plugins": {
    "@release-it/bumper": {
      "in": "pyproject.toml",
      "out": { "file": "pyproject.toml", "path": "project.version" }
    }
  },
  "hooks": {
    "after:bump": "uv sync"
  }
}
```

In practice, that automation does the following:

- updates `project.version` in `pyproject.toml`
- regenerates dependencies after the bump with `uv sync`
- creates a release commit and annotated tag
- pushes commits and tags together
- creates a GitHub Release using `GITHUB_TOKEN`

The release notes text comes from `scripts/generate_changelog.py`, which groups commits by conventional prefixes such as `feat`, `fix`, `docs`, `test`, and `ci`.

> **Tip:** Conventional commit prefixes make release notes much easier to read in this project, because the changelog generator groups entries by prefix.

## Dependency updates and repository bots

Dependency updates are handled by Renovate, and the repository has several other bots configured for review and hygiene.

The Renovate configuration is intentionally simple and low-noise:

```json
{
  "$schema": "https://docs.renovatebot.com/renovate-schema.json",
  "extends": [
    ":dependencyDashboard",
    ":maintainLockFilesWeekly",
    ":prHourlyLimitNone",
    ":semanticCommitTypeAll(ci )"
  ],
  "prConcurrentLimit": 0,
  "lockFileMaintenance": {
    "enabled": true
  },
  "packageRules": [
    {
      "matchPackagePatterns": ["*"],
      "groupName": "python-deps"
    }
  ]
}
```

That setup means:

- Renovate keeps a dependency dashboard.
- Lock file maintenance is enabled weekly.
- Dependency PRs are not throttled by hourly or concurrent limits.
- Updates are grouped into a single `python-deps` stream instead of a flood of unrelated PRs.

The rest of the repository bot setup looks like this:

- `pre-commit.ci` is configured through `.pre-commit-config.yaml` and is part of the expected status-check flow.
- CodeRabbit is configured in `.coderabbit.yaml` with auto-review on non-draft PRs targeting `main`, `request_changes_workflow: true`, and tool integrations including Ruff, Pylint, ESLint, ShellCheck, Yamllint, Gitleaks, Semgrep, Actionlint, and Hadolint.
- The stale bot is configured in `.github/stale.yml` to mark inactive items stale after 60 days and close them 7 days later, while exempting `pinned` and `security`.
- The In Solidarity bot is configured in `.github/in-solidarity.yml` to enforce inclusive-language checks at failure level.

If you use `github-webhook-server` to manage your own repositories, the shipped example config also treats common automation accounts as trusted bots by listing `renovate[bot]` and `pre-commit-ci[bot]` under `auto-verified-and-merged-users`. That is a good starting point if you want dependency and hook-update PRs to fit cleanly into an automated review flow.


---

Source: troubleshooting.md

# Troubleshooting

Most problems with `github-webhook-server` fall into a few buckets: the server cannot find its config, the repository key in `config.yaml` does not match what GitHub sent, draft PR commands are being blocked on purpose, Podman runtime state is stale, or the webhook was accepted but later failed during background processing.

> **Tip:** Validate the config file before restarting the service:
```bash
uv run webhook_server/tests/test_schema_validator.py /home/podman/data/config.yaml
```

## Startup Problems

By default, the server reads `config.yaml` from `WEBHOOK_SERVER_DATA_DIR`, or `/home/podman/data` if that environment variable is unset. It also fails fast if the root `repositories:` section is missing.

```20:33:webhook_server/libs/config.py
        self.data_dir: str = os.environ.get("WEBHOOK_SERVER_DATA_DIR", "/home/podman/data")
        self.config_path: str = os.path.join(self.data_dir, "config.yaml")
        self.repository = repository
        self.exists()
        self.repositories_exists()
        self.validate_labels_config()

    def exists(self) -> None:
        if not os.path.isfile(self.config_path):
            raise FileNotFoundError(f"Config file {self.config_path} not found")

    def repositories_exists(self) -> None:
        if not self.root_data.get("repositories"):
            raise ValueError(f"Config {self.config_path} does not have `repositories`")
```

If you are using the example container setup, the mounted data directory is expected to contain the server data files, including `config.yaml` and `webhook-server.private-key.pem`.

```5:8:examples/docker-compose.yaml
    volumes:
      - "./webhook_server_data_dir:/home/podman/data:Z" # Should include config.yaml and webhook-server.private-key.pem
      # Mount temporary directories to prevent boot ID mismatch issues
      - "/tmp/podman-storage-${USER:-1000}:/tmp/storage-run-1000"
```

Check these first:
- `config.yaml` exists in the mounted data directory.
- `repositories:` exists and is not empty.
- `webhook-server.private-key.pem` exists next to `config.yaml`.
- `github-app-id` matches the GitHub App you actually installed.

If startup only fails after you enable IP verification, that is usually a network reachability problem. The app intentionally fails closed when `verify-github-ips` or `verify-cloudflare-ips` is enabled but it cannot load any allowlist data.

### Settings Changed But Behavior Did Not

> **Note:** In the current startup path, `ip-bind`, `port`, `max-workers`, and `webhook-secret` are read from `config.yaml`, not from environment variables such as `WEBHOOK_SERVER_PORT` or `WEBHOOK_SECRET`.

```11:16:entrypoint.py
_config = Config()
_root_config = _config.root_data
_ip_bind = _root_config.get("ip-bind", "0.0.0.0")
_port = _root_config.get("port", 5000)
_max_workers = _root_config.get("max-workers", 10)
_webhook_secret = _root_config.get("webhook-secret")
```

Direct environment reads in the current code are limited to `WEBHOOK_SERVER_DATA_DIR`, plus optional switches such as `ENABLE_LOG_SERVER` and `ENABLE_MCP_SERVER`. If changing container env vars seems to do nothing, move those settings into `config.yaml` first.

## Webhook Accepted But No Automation Ran

The webhook endpoint is designed to return quickly and do the real work in the background. That is great for reliability, but it also means a successful GitHub delivery does not guarantee successful processing.

> **Note:** `200 OK` means “queued”, not “completed”. When something looks wrong, use the `delivery_id` from the response or the `X-GitHub-Delivery` header to trace the webhook in the logs.

Common background-only failures include:
- `Repository not found in configuration`
- GitHub API or network connectivity errors
- GitHub App installation/access problems
- Podman build or push failures

A good first step is to search the logs for the `delivery_id`, then inspect the step timeline if the log server is enabled.

## Repository Lookup Failures

A very common configuration mistake is using the full `owner/repo` as the top-level key under `repositories`. The server does not look up repositories that way. It looks up the short GitHub repository name from the webhook payload.

```89:112:webhook_server/libs/github_api.py
        self.repository_name: str = hook_data["repository"]["name"]
        self.repository_full_name: str = hook_data["repository"]["full_name"]
        self._bg_tasks: set[Task[Any]] = set()
        self.parent_committer: str = ""
        self.x_github_delivery: str = headers.get("X-GitHub-Delivery", "")
        self.github_event: str = headers["X-GitHub-Event"]
        self.config = Config(repository=self.repository_name, logger=self.logger)
        ...
        if not self.config.repository_data:
            raise RepositoryNotFoundInConfigError(f"Repository {self.repository_name} not found in config file")
```

The example config shows the intended shape clearly: the map key is the short repo name, while the nested `name:` field is the full `owner/repo`.

```139:145:examples/config.yaml
repositories:
  my-repository:
    name: my-org/my-repository
    log-level: DEBUG # Override global log-level for repository
    log-file: my-repository.log # Override global log-file for repository
    mask-sensitive-data: false # Override global setting - disable masking for debugging this specific repo (NOT recommended in production)
```

If those do not line up, you can get a successful delivery with no useful automation, because the background worker logs `Repository not found in configuration` and stops there.

If the config key is correct and you instead see errors mentioning `manage-repositories-app`, check GitHub App setup next:
- The server reads `webhook-server.private-key.pem` from the data directory.
- It uses `github-app-id` from `config.yaml`.
- The `manage-repositories-app` must be installed on the target repository.

You can also hit PR lookup failures on `status` or `check_run` deliveries. Those events are matched back to an open PR by PR number, head SHA, or commit SHA. If the PR is already closed, the SHA no longer matches, or the app/token cannot read the repo, the event may be skipped.

## Draft PR Commands Are Skipped

Draft PR behavior is intentionally conservative. If you do nothing, commands on draft PRs are blocked by default.

The example config documents the three supported modes:

```38:45:examples/config.yaml
# Commands allowed on draft PRs (optional)
# If not set: commands are blocked on draft PRs (default behavior)
# If empty list []: all commands allowed on draft PRs
# If list with values: only those commands allowed on draft PRs
# allow-commands-on-draft-prs: []  # Uncomment to allow all commands on draft PRs
# allow-commands-on-draft-prs:     # Or allow only specific commands:
#   - build-and-push-container
#   - retest
```

The handler follows that config exactly:

```174:197:webhook_server/libs/handlers/issue_comment_handler.py
        # Check if command is allowed on draft PRs
        if is_draft and _command != COMMAND_TEST_ORACLE_STR:
            allow_commands_on_draft = self.github_webhook.config.get_value("allow-commands-on-draft-prs")
            if not isinstance(allow_commands_on_draft, list):
                self.logger.debug(
                    f"{self.log_prefix} Command {_command} blocked: "
                    "draft PR and allow-commands-on-draft-prs not configured"
                )
                return
            # Empty list means all commands allowed; non-empty list means only those commands
            if len(allow_commands_on_draft) > 0:
                # Sanitize: ensure all entries are strings for safe join and comparison
                allow_commands_on_draft = [str(cmd) for cmd in allow_commands_on_draft]
                if _command not in allow_commands_on_draft:
                    self.logger.debug(
                        f"{self.log_prefix} Command {_command} is not allowed on draft PRs. "
                        f"Allowed commands: {allow_commands_on_draft}"
                    )
                    await asyncio.to_thread(
                        pull_request.create_issue_comment,
                        f"Command `/{_command}` is not allowed on draft PRs.\n"
                        f"Allowed commands on draft PRs: {', '.join(allow_commands_on_draft)}",
                    )
                    return
```

That leads to a few easy-to-miss behaviors:
- No `allow-commands-on-draft-prs` setting: commands are blocked by default.
- `allow-commands-on-draft-prs: []`: all draft PR commands are allowed.
- A non-empty list: only the listed commands are allowed.
- `/test-oracle` is the exception and is allowed even on draft PRs.
- If the setting is missing or not a YAML list, a blocked command can look like a silent no-op.
- This setting only affects draft `issue_comment` command handling. Other draft PR event processing is still skipped.

> **Tip:** If you only need limited draft automation, whitelist a small set such as `retest` or `build-and-push-container` instead of allowing everything.

## Podman Runtime and Container Build Problems

The code has a built-in workaround for the Podman boot-ID cache problem. If a command fails with the well-known reboot error, the handler removes stale runtime directories and retries the command once.

```166:193:webhook_server/libs/handlers/runner_handler.py
    def is_podman_bug(self, err: str) -> bool:
        _err = "Error: current system boot ID differs from cached boot ID; an unhandled reboot has occurred"
        return _err in err.strip()

    def fix_podman_bug(self) -> None:
        self.logger.debug(f"{self.log_prefix} Fixing podman bug")
        shutil.rmtree("/tmp/storage-run-1000/containers", ignore_errors=True)
        shutil.rmtree("/tmp/storage-run-1000/libpod/tmp", ignore_errors=True)

    async def run_podman_command(self, command: str, redact_secrets: list[str] | None = None) -> tuple[bool, str, str]:
        rc, out, err = await run_command(
            command=command,
            log_prefix=self.log_prefix,
            redact_secrets=redact_secrets,
            mask_sensitive=self.github_webhook.mask_sensitive,
        )
        ...
        if self.is_podman_bug(err=err):
            self.fix_podman_bug()
            return await run_command(
                command=command,
                log_prefix=self.log_prefix,
                redact_secrets=redact_secrets,
                mask_sensitive=self.github_webhook.mask_sensitive,
            )
```

The example container setup already hints at the recommended fix by mounting `/tmp/storage-run-1000` from the host. The same example service also runs with `privileged: true`, which is worth keeping if you are running Podman inside the container.

If container builds still fail:
- Keep the `/tmp/storage-run-1000` bind mount from `examples/docker-compose.yaml`.
- Keep `privileged: true` when using Podman in the service container.
- Restart the service after a host reboot so the startup cleanup runs again.
- Check the repository `container:` block in `config.yaml`.
- If a PR comment says `No build-and-push-container configured for this repository`, add that `container:` block first.
- If build succeeds but push fails, recheck registry username, password, repository path, and any custom `build-args` or `args`.

## Signature and Payload Errors

Failures before background processing starts return real HTTP errors. The most common ones are:
- `Missing X-GitHub-Event header`
- `Missing repository in payload`
- `Missing repository.name in payload`
- `Missing repository.full_name in payload`
- `x-hub-signature-256 header is missing!`
- `Request signatures didn't match!`

If you configured `webhook-secret`, GitHub must send a matching `x-hub-signature-256` header for the exact request body the server receives. A wrong secret, a proxy/relay that rewrites the body, or hand-crafted test requests are the usual causes.

## Observability Tips

When the behavior is unclear, follow the webhook through the logs before changing config again.

The server writes structured daily webhook summaries under the data directory as JSONL files:

```74:91:webhook_server/utils/structured_logger.py
        self.log_dir = Path(self.config.data_dir) / "logs"
        ...
    def _get_log_file_path(self, date: datetime | None = None) -> Path:
        """Get log file path for the specified date.
        ...
        if date is None:
            date = datetime.now(UTC)
        date_str = date.strftime("%Y-%m-%d")
        return self.log_dir / f"webhooks_{date_str}.json"
```

That gives you a stable place to start when a webhook was accepted but did not do what you expected.

Useful routes when `ENABLE_LOG_SERVER=true`:
- `/logs` for the browser UI
- `/logs/api/entries?hook_id=<delivery_id>` for filtered log lines
- `/logs/api/workflow-steps/<delivery_id>` for the step timeline
- `/logs/api/pr-flow/<delivery_id>` for PR workflow visualization
- `/logs/api/step-logs/<delivery_id>/<step_name>` for logs within a single step
- `/logs/ws` for real-time streaming

A good investigation flow is:
1. Capture the GitHub `delivery_id`.
2. Search `webhook_server.log` or `webhooks_YYYY-MM-DD.json` for that ID.
3. If the log server is enabled, open `/logs/api/entries?hook_id=<delivery_id>`.
4. If the webhook queued but failed later, inspect `/logs/api/workflow-steps/<delivery_id>` next.

> **Warning:** Treat the log viewer as an internal tool. The project expects it to live on trusted networks, and the step-log endpoint is explicitly restricted to trusted clients.

If `/logs` returns 404, the log server is not enabled. Set `ENABLE_LOG_SERVER=true` and restart the service.

## Quick Symptom Guide

- `Config file /home/podman/data/config.yaml not found`: the data directory mount or `WEBHOOK_SERVER_DATA_DIR` is wrong.
- `Config ... does not have repositories`: `repositories:` is missing or empty.
- GitHub delivery says success, but nothing happened: the webhook was queued, then failed later in background processing.
- `Repository not found in configuration`: the top-level key under `repositories:` does not match GitHub’s short repo name.
- `Repository owner/repo not found by manage-repositories-app`: the GitHub App is not installed on that repository, or the app credentials are wrong.
- Draft PR commands do nothing: `allow-commands-on-draft-prs` is missing, mis-typed, or does not include that command.
- `Command /... is not allowed on draft PRs`: the draft whitelist is working; add the command to the list or mark the PR ready.
- `current system boot ID differs from cached boot ID`: stale Podman runtime state; keep the runtime mount and restart after cleanup.
- `No build-and-push-container configured for this repository`: the repository has no `container:` section.
- `Request signatures didn't match!`: the configured webhook secret and the GitHub webhook secret do not match.


---