Query Tee (service)

The query-tee is a standalone service which can be used for testing purposes to compare the query performances of 2+ backend systems (ie. Cortex clusters) ingesting the same exact series.

This service exposes Prometheus-compatible read API endpoints and, for each received request, performs the request against all backends tracking the response time of each backend and then sends back to the client one of the received responses.

How to run it

You can run query-tee in two ways:

Build it from sources
```
go run ./cmd/query-tee -help
```

Run it via the provided Docker image

docker run quay.io/cortexproject/query-tee -help

The service requires at least 1 backend endpoint (but 2 are required in order to compare performances) configured as comma-separated HTTP(S) URLs via the CLI flag -backend.endpoints. For each incoming request, query-tee will clone the request and send it to each backend, tracking performance metrics for each backend before sending back the response to the client.

How it works

API endpoints

The following Prometheus API endpoints are supported by query-tee:

/api/v1/query (GET)
/api/v1/query_range (GET)
/api/v1/labels (GET)
/api/v1/label/{name}/values (GET)
/api/v1/series (GET)
/api/v1/metadata (GET)
/api/v1/alerts (GET)
/api/v1/rules (GET)

Pass-through requests

query-tee supports acting as a transparent proxy for requests to routes not matching any of the documented API endpoints above. When enabled, those requests are passed on to just the configured preferred backend. To activate this feature it requires setting -proxy.passthrough-non-registered-routes=true flag and configuring a preferred backend.

Authentication

query-tee supports HTTP basic authentication. It allows either to configure username and password in the backend URL, to forward the request auth to the backend or merge the two.

The request sent from the query-tee to the backend includes HTTP basic authentication when one of the following conditions are met:

If the endpoint URL has username and password, query-tee uses it.
If the endpoint URL has username only, query-tee keeps the username and inject the password received in the incoming request (if any).
If the endpoint URL has no username and no password, query-tee forwards the incoming request basic authentication (if any).

Backend response selection

query-tee allows to configure a preferred backend from which picking the response to send back to the client. The preferred backend can be configured via the CLI flag -backend.preferred=<hostname>, setting it to the hostname of the preferred backend.

When a preferred backend is set, query-tee sends back to the client:

The preferred backend response if the status code is 2xx or 4xx
Otherwise, the first received 2xx or 4xx response if at least a backend succeeded
Otherwise, the first received response

When a preferred backend is not set, query-tee sends back to the client:

The first received 2xx or 4xx response if at least a backend succeeded
Otherwise, the first received response

Note: from the query-tee perspective, a backend request is considered successful even if the status code is 4xx because it generally means the error is due to an invalid request and not to a backend issue.

Backend results comparison

query-tee allows to optionally enable the query results comparison between two backends. The results comparison can be enabled via the CLI flag -proxy.compare-responses=true and requires exactly two configured backends with a preferred one.

When the comparison is enabled, the query-tee compares the response received from the two configured backends and logs a message for each query whose results don’t match, as well as keeps track of the number of successful and failed comparison through the metric cortex_querytee_responses_compared_total.

Floating point sample values are compared with a small tolerance that can be configured via -proxy.value-comparison-tolerance. This prevents false positives due to differences in floating point values rounding introduced by the non deterministic series ordering within the Prometheus PromQL engine.

Slow backends

query-tee sends back to the client the first viable response as soon as available, without waiting to receive a response from all backends.

Exported metrics

query-tee exposes the following Prometheus metrics on the port configured via the CLI flag -server.metrics-port:

# HELP cortex_querytee_request_duration_seconds Time (in seconds) spent serving HTTP requests.
# TYPE cortex_querytee_request_duration_seconds histogram
cortex_querytee_request_duration_seconds_bucket{backend="<hostname>",method="<method>",route="<route>",status_code="<status>",le="<bucket>"}
cortex_querytee_request_duration_seconds_sum{backend="<hostname>",method="<method>",route="<route>",status_code="<status>"}
cortex_querytee_request_duration_seconds_count{backend="<hostname>",method="<method>",route="<route>",status_code="<status>"}

# HELP cortex_querytee_responses_total Total number of responses sent back to the client by the selected backend.
# TYPE cortex_querytee_responses_total counter
cortex_querytee_responses_total{backend="<hostname>",method="<method>",route="<route>"}

# HELP cortex_querytee_responses_compared_total Total number of responses compared per route name by result.
# TYPE cortex_querytee_responses_compared_total counter
cortex_querytee_responses_compared_total{route="<route>",result="<success|fail>"}