Health service
Transcription:Real-Time Deployments:ContainerThe container is able to expose an HTTP health service, which offers startup, liveness and readiness probes. This is accessible from Port 8001, and has 3 endpoints, started
, live
and ready
. These can be used to see whether all services in the container are running or active respectively. This may be especially helpful if you are deploying the container into a Kubernetes cluster. If you are using Kubernetes, we recommend that you also refer to the Kubernetes documentation around
liveness and readiness probes.
The health service is enabled by default and should run as a subprocess of the main entrypoint to the container.
Endpoints
The health service offers three endpoints:
/started
This endpoint provides a startup probe. It can be queried using an HTTP GET request. You must include the relevant port, 8001, in the request.
This probe indicates whether all services in the container have successfully started. Once it returns a successful response code, it should never return an unsuccessful response code later.
Possible responses:
200
if all of the services in the container have successfully started.503
otherwise.
A JSON object is also returned in the body of the response, indicating the status.
Example:
$ curl -i address.of.container:8001/started
HTTP/1.0 200 OK
Server: BaseHTTP/0.6 Python/3.8.5
Date: Mon, 08 Feb 2021 12:46:21 GMT
Content-Type: application/json
{
"started": true
}
/live
This endpoint provides a liveness probe. It can be queried using an HTTP GET request. You must include the relevant port, 8001, in the request.
This probe indicates whether all services in the container are active. The services in the container send regular updates to the health service, if they don't send an update for more than 10 seconds then they will be marked as 'dead' and this endpoint will return an unsuccessful response code. For example, if the WebSocket server in the container were to crash, this endpoint should indicate that.
Possible responses:
200
if all of the services in the container have successfully started, and have recently sent an update to the health service.503
otherwise.
A JSON object is also returned in the body of the response, indicating the status.
Example:
$ curl -i address.of.container:8001/live
HTTP/1.0 200 OK
Server: BaseHTTP/0.6 Python/3.8.5
Date: Mon, 08 Feb 2021 12:46:45 GMT
Content-Type: application/json
{
"alive": true
}
/ready
This endpoint provides a readiness probe. It can be queried using an HTTP GET request.
The container has been designed to process one audio stream at a time. This probe indicates whether the container is currently transcribing something. If the server is already transcribing an audio stream, it is considered not ready. This probe can be used as a scaling mechanism.
Note: The readiness check is accurate within a 2-second resolution. If you do use this probe for load balancing, be aware that bursts of traffic within that 2-second window could all be allocated to a single container since it's readiness state will not change.
Possible responses:
200
if the container is not currently transcribing audio.503
otherwise.
In the body of the response there is also a JSON object with the current status.
Example:
$ curl -i address.of.container:8001/ready
HTTP/1.0 200 OK
Server: BaseHTTP/0.6 Python/3.8.5
Date: Mon, 08 Feb 2021 12:47:05 GMT
Content-Type: application/json
{
"ready": true
}