`tfw_logger`: gRPC extension for machine learning #2421

krizhanovsky · 2025-05-07T11:54:05Z

Motivation

At the moment tfw_logger is only used to write access logs and security logs in #2399 , but we also need to feed the data to the Tempesta Escudo classification daemons. The daemons, using NN, are likely to be run on a separate cluster, or at least must have such ability. From the other side, simple classifications can be done on the local host. Thus, we need both the gRPC and local interfaces, like zero-copy #77. Use flatbuffers to minimize serialization overhead on the proxy nodes.

However, this issue adds plenty of HTTP headers to send and copying all of them could hurt Tempesta FW performance. Ideally, if tfw_logger could use the zero-copy mappings #77.

Scope

Tempesta Managed from Escudo already used gRPC/flatbuffers on the client (CLI) and server (manager) sides, so let's reuse it. To do so we need to move the gRPC code to tempesta/utils and fetch tempesta as a git submodule for Tempesta Escudo (@consuelo2210 FYI).

Need to add new logging facility grpc plus to current mmap:

{
   "log": "/var/log/tempesta_access.log",
    "access_log": {
        "host": "localhost",
        "table": "access_log",
        "mmap_log_buffer_size": 4096,
        "extra-headers": [ "sec-fetch-site" ],
    },
    "dos": {
        "host": "localhost",
        "table": "security_dos",
        "mmap_log_buffer_size": 4096,
        "extra-headers": [ "sec-fetch-site" ],
    },    
    "suspicious": {
        "host": "localhost",
        "table": "suspicious",
        "mmap_log_buffer_size": 4096,
        "extra-headers": [ "sec-fetch-site" ],
    },
    "grpc": {
        "host": "localhost:4433",
        "extra-headers": [ "sec-fetch-site", "sec-fetch-mode" ]
    }
}

We also need to be able to add following headers to the list of extra-headers to be logged in mmap and grpc modes (let's leave dmesg as is to not to overload already overloaded kernel logging):

Accept-Language
Accept-Encoding
Accept
Content-Language
Cookie
Upgrade-Insecure-Requests
Sec-Fetch (Sec-Fetch-Mode, Sec-Fetch-Site, Sec-Fetch-Dest)
Cache-Control
Connection
Keep-Alive

This list is expected to grow and expected to be specified in the configuration, so need to make the headers special. Also, since this is a pretty a volume of data, need to apply some simple compression. We can start with simple flags, e.g. for Connection just use: 0 - no header, 1 - keep-alive, 2 - close, 3 - anything other. The same flag technique can be applied for sec-fetch headers. Ideally, compression should be done on the tfw_logger side, but if we have the data already compressed (e.g. with Huffman) we probably can just pass it as is to the user space.

Testing

Just borrow the parts of the current ML logic and reuse them in the tests for gRPC/flatbuffers receving.

Documentation

Add this feature to https://tempesta-tech.com/knowledge-base/Handling-clients/ in case if any open source users also benefit from getting logs via gRPC.

The text was updated successfully, but these errors were encountered:

krizhanovsky added this to the 0.9 - LA milestone May 7, 2025

krizhanovsky added enhancement crucial enterprise labels May 7, 2025

krizhanovsky mentioned this issue May 7, 2025

Configuration parsing error: mmap_log_buffer_size #2313

Open

WitalyAnisimov assigned WitalyAnisimov and unassigned WitalyAnisimov May 7, 2025

krizhanovsky mentioned this issue May 6, 2025

Security events observability #2399

Open

krizhanovsky modified the milestones: 0.9 - LA, 1.0 - GA May 8, 2025

krizhanovsky mentioned this issue May 13, 2025

Sec-Fetch-User HTTP Request Header #1414

Open

krizhanovsky mentioned this issue May 23, 2025

Morgana future/fix/configuration parsing error mmap log buffer size #2428

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`tfw_logger`: gRPC extension for machine learning #2421

`tfw_logger`: gRPC extension for machine learning #2421

krizhanovsky commented May 7, 2025 •

edited

Loading

tfw_logger: gRPC extension for machine learning #2421

tfw_logger: gRPC extension for machine learning #2421

Comments

krizhanovsky commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Scope

Testing

Documentation

`tfw_logger`: gRPC extension for machine learning #2421

`tfw_logger`: gRPC extension for machine learning #2421

krizhanovsky commented May 7, 2025 •

edited

Loading