Chapter 7 · CORE

Integration Test Job Script

📄 07_integration_test_job_script.md 🏷 Core

Chapter 7: Integration Test Job Script

In the previous chapter, Stateless Queries, we learned how to check if ClickHouse can answer simple questions like 1 + 1. We ran these tests against a single, isolated Docker Server Image.

But ClickHouse is rarely used alone. It is designed to work in clusters, replicate data between servers, and talk to external systems like Kafka or S3.

How do we test what happens if a server crashes? Or if the network fails? A simple SQL query cannot simulate a crashing server.

This brings us to the Integration Test Job Script.

The Problem: Simulating the Real World

Imagine you are testing a new car.

Stateless Test: You check if the radio turns on. (Easy, done in the garage).
Integration Test: You drive the car on a highway, in the rain, while towing a trailer.

The Challenge: To run an "Integration Test" for a database, you need a complex environment. You might need:

Three ClickHouse servers (to test copying data).
One ZooKeeper server (to coordinate them).
A simulated network that can "break."

Setting this up manually for every test would take hours. We need a script that can create this entire "virtual world," run the test, and then destroy it—all in a few minutes.

Central Use Case: We want a script that can:

Take a list of Python test files (e.g., test_replication.py).
Automatically spin up the required Docker containers for each test.
Run the test logic.
Clean everything up.

Key Concepts

To solve this, we use a Python script named integration_test_job.py. It uses the following tools:

1. Pytest

Pytest is a standard framework for testing in Python. While our Stateless queries were just SQL files, Integration tests are actual Python programs. This allows us to write logic like: "Start Server A, insert data, kill Server A, check if data is on Server B."

2. The Runner Script

The script integration_test_job.py is the manager. It doesn't write the tests; it organizes them. It prepares the Docker environment and tells Pytest which files to execute.

3. Test Batching (Sharding)

We have hundreds of integration tests. Running them one by one would take too long (10+ hours). Batching means we split the tests into groups (e.g., 5 groups).

Runner 1 runs Group 1.
Runner 2 runs Group 2.

This allows us to run everything in parallel.

How to Use the Script

The CI system (Praktika) invokes this script. It tells the script which "slice" of tests to run.

Defining the Arguments

The script accepts arguments to know which batch of tests it is responsible for.

# integration_test_job.py (Simplified)
import argparse

def parse_args():
    parser = argparse.ArgumentParser()
    # Which "slice" of tests to run? (e.g., 1/5)
    parser.add_argument("--shard", type=int, default=1)
    parser.add_argument("--shards", type=int, default=1)
    
    # Where is the ClickHouse binary?
    parser.add_argument("--binary", required=True)
    
    return parser.parse_args()

Explanation:

--shard 1 and --shards 5 means: "I am worker #1 out of 5 total workers." The script uses this math to pick only 20% of the tests to run.

Finding the Tests

The script scans the directory tests/integration/ to find all available test files.

import os

def get_all_tests(test_dir):
    # Find all files starting with "test_" and ending with ".py"
    all_files = os.listdir(test_dir)
    tests = [f for f in all_files if f.startswith("test_") and f.endswith(".py")]
    
    # Sort them to ensure every runner sees the same list
    return sorted(tests)

Explanation: We look for files like test_replicated_merge_tree.py. Sorting is crucial: if Runner 1 and Runner 2 have different lists, they might skip a test or run it twice.

Under the Hood: The Execution Flow

When this script runs, it acts as a bridge between the Test Code and the Docker Environment.

Setup: The script prepares the Docker images (using the image created in Chapter 5).
Selection: It calculates which tests belong to this runner.
Execution: It loops through the tests and calls pytest.
Teardown: It kills any leftover containers to save memory.

Here is the flow:

sequenceDiagram participant CI as CI Runner participant Script as Job Script participant Pytest as Pytest participant Docker as Docker Engine CI->>Script: Run Batch 1/5 Script->>Script: Calculate list of tests loop For each Test File Script->>Pytest: Run test_example.py Pytest->>Docker: Start Containers (Cluster) Docker-->>Pytest: Containers Ready Pytest->>Pytest: Run Python Logic Pytest->>Docker: Stop Containers end Script-->>CI: Report Results

Implementation Details: Invoking Pytest

The core of the script is simply building a command line string to launch pytest.

import subprocess

def run_test(test_name):
    # Construct the command
    cmd = [
        "pytest",
        "-v",               # Verbose output
        f"tests/integration/{test_name}"
    ]
    
    print(f"Running {test_name}...")
    # Execute the command and wait for it to finish
    subprocess.check_call(cmd)

Explanation:

We use subprocess.check_call to run Pytest as if we typed it in the terminal.
If Pytest fails (returns an error code), the script will catch it and mark the CI job as failed.

Passing the Image

How does the test know which version of ClickHouse to use? The Job Script passes this information via Environment Variables.

import os

def set_environment(clickhouse_binary_path):
    # Tell the integration tests where the binary is
    os.environ['CLICKHOUSE_BINARY'] = clickhouse_binary_path
    
    # We can also pass the Docker Image tag here
    os.environ['CLICKHOUSE_IMAGE'] = "clickhouse/server:latest"

Explanation: When the Python test runs, it reads os.environ['CLICKHOUSE_BINARY']. This ensures that the test uses the exact binary we just built in Chapter 4, not some old version installed on the machine.

Why This Matters

The Integration Test Job Script is the "Site Manager" for our construction work.

Automation: It handles the complexity of Docker without the developer needing to worry about it.
Parallelism: By using "sharding" (batches), we can run 500 slow tests in the time it takes to run 100.
Consistency: It ensures every test gets a fresh, clean environment.

Summary

In this chapter, we learned about the Integration Test Job Script.

It is a Python script that orchestrates the testing process.
It uses Pytest to run the actual test logic.
It splits work into Batches so multiple runners can work at the same time.
It sets up the environment so tests use the correct Docker Image and Binary.

Now that we have the "Manager" script ready to run our tests, we need to look at the tests themselves. What does an actual Integration Test look like? How do we write Python code to make a database crash on purpose?

In the next chapter, we will write our first real Integration Test.

Next Chapter: Integration Tests

Generated by Code IQ

← Previous

Stateless Queries

Integration Tests