Observability in Juturna¶

Logging¶

Nodes have a built-in logger that can be used straight out of the box. A node logger is created as a sublogger of its containing pipeline, which in turn is a sublogger of the juturna process. Nothing special is required to use the juturna logger within a node, and it can be used anywhere in its class code:

from juturna.components import Node
from juturna.components import Message
from juturna.payloads import BasePayload


class MyAwesomeNode(Node[BasePayload, BasePayload]):
    def __init__(self, *args, **kwargs):
        super().__init__(**kwargs)

        # initialisation code

        self.logger.info('awesome node created!')

    def update(self, message: Message[BasePayload]):
        # run the node update logic

        self.logger.info('node did something awesome!')

Logging messages will carry useful pieces of informatio so that, once printed, it will be easy to understand which entry is printed by which node. As an example, consider the script below, loading a pipe called my_pipe:

import logging

import juturna as jt

logging.basicConfig(level=logging.DEBUG)

pipe = jt.components.Pipeline.from_json('config.json')
pipe.warmup()

This will generate logging entries split by node:

2026-02-12 15:05:20 | INFO     | jt.my_pipe.node_0       | init sources: []
2026-02-12 15:03:54 | INFO     | jt.my_pipe.node_0       | trx created, model id 123197287926000
2026-02-12 15:03:55 | INFO     | jt.my_pipe.node_1       | audio loaded
2026-02-12 15:03:55 | INFO     | jt.my_pipe.node_1       | duration: 149.942875
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_0
2026-02-12 15:03:55 | ERROR    | jt.my_pipe.node_1       | model not found, defaulting
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_1
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_2
2026-02-12 15:03:55 | INFO     | jt.my_pipe.node_3       | endpoint reached
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_3
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | pipe warmed up!

Here we can see that each entry tells us the node that generated it, the pipe it belongs to, and lastly the logging message.

Thanks to their hierarchical structure, loggers can be selectivel enabled or disable only through the names of their corresponding components. Looking again at the pipe above, we can tell the loggers to raise the level to ERROR only for node_1:

import logging

import juturna as jt

logging.basicConfig(level=logging.DEBUG)
jt.log.jt_logger('my_pipe.node_1').setLevel(logging.ERROR)

pipe = jt.components.Pipeline.from_json('config.json')
pipe.warmup()

This time, the logs will look like:

2026-02-12 15:05:20 | INFO     | jt.my_pipe.node_0       | init sources: []
2026-02-12 15:03:54 | INFO     | jt.my_pipe.node_0       | trx created, model id 123197287926000
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_0
2026-02-12 15:03:55 | ERROR    | jt.my_pipe.node_1       | model not found, defaulting
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_1
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_2
2026-02-12 15:03:55 | INFO     | jt.my_pipe.node_3       | endpoint reached
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | warmed up node node_3
2026-02-12 15:03:55 | INFO     | jt.my_pipe              | pipe warmed up!

Formatters and handlers¶

Logs in juturna can be formatted using built-in formatters: simple, colored, full (default), compact, development, minimal and json. To change it, run:

import logging

import juturna as jt


logging.basicConfig(level=logging.DEBUG)
jt.log.formatter('minimal')

pipe = jt.components.Pipeline.from_json('config.json')
pipe.warmup()

INFO: init sources: []
INFO: trx created, model id 123197287926000
INFO: audio loaded
INFO: duration: 149.942875
INFO: warmed up node node_0
ERROR: model not found, defaulting
INFO: warmed up node node_1
INFO: warmed up node node_2
INFO: endpoint reached
INFO: warmed up node node_3
INFO: pipe warmed up!

New handlers can also be added to the juturna logger. As an example, a file handler can be added, to store the logs into a file:

import logging

import juturna as jt

logging.basicConfig(level=logging.DEBUG)
handler = logging.FileHandler('juturna.log')

jt.log.add_handler(handler, formatter='development')

Telemetry¶

In juturna, it is possible to collect low-level data about basic events such as receptions and transmissions. Such data can be used to compute useful metrics about nodes latency, bandwidth and throughput.

Telemetry can be activated on a pipeline level. To do so, add the telemetry entry in the pipe configuration file, pointing at the file where telemetry data will be stored.

{
  "version": "1.0.2",
  "plugins": ["./plugins"],
  "pipeline": {
    "name": "my_pipe",
    "id": "123456",
    "folder": "./run",
    "telemetry": "tele.csv",
    "nodes": [],
    "links": []
  }
}

Once the pipe is started and running, the file ./run/my_pipe/tele.csv will be incrementally filled with telemetry entries. Each node records a telemetry entry:

every time it receives a message,
every time it transmits a message.

Each entry is formatted as follows:

time, event, node, origin, msg_id, src_id, size

Generally speaking then, a telemetry record is a collection of basic information related to a particular event.

time is the timestamp at which the event was recorded (this is taken when the telemetry record is built within the node)
event is the event being metered (rx for a reception event, tx for a transmission event)
node is the name of the node recording the event
origin is the name of the node that produced the message related to the recorded event (for a reception event, this is the node that generated the received message; for a transmission message, this will be the same as node)
msg_id is the id of the message that triggered the event
src_id is the id of the previous message that triggered the last recorded event
size is the size in bytes of the message that triggered the event

Imagine a pipeline with a source node A, a processing node B, and a sink node C. In its implementation, node A has a function that writes messages on its own internal queue, triggering the update function. When the first message is received by the node (so, generated and read by the worker thread), the first telemetry event will be recorder, and it will look something like this:

t_0,rx,A,A,0,,s_0

Here, node A both generated and received the message, so node and origin are the same. Also, being this the first message processed by the node, there is no id stored of previous events, so src_id is none.

When this message is transmitted, another telemetry entry will be recorded:

t_1,tx,A,A,0,0,s_0

At this point, node B receives the message, and produces its own telemetry read:

t_2,rx,B,A,0,,s_0

This entry tells us that at time t_2 node B received message with id 0 generated by node A, and no previous message passed through it. B chruns the message content and sends a new message to C:

t_3,tx,B,B,1,0,s_1

Interact with the filesystem¶

A pipeline can be configured with its own folder on the filesystem. Here, each node will be assigned a directory where it can store data, artifacts, or anything else the node developer might need to. The following pipe:

{
  "version": "0.1.0",
  "plugins": ["./plugins"],
  "pipeline": {
    "id": "quality",
    "name": "quality",
    "folder": "./pipeline",
    "nodes": [],
    "links": []
}

will create the ./pipeline folder, local to where the juturna process was launched, and as many subfolders are there are nodes in the pipe.

The node class offers useful tools to access the filesystem. Assuming we want to save some text to file during an update call, we can simply do:

def update(self, message: Message):
    # do update stuff

    with open(pathlib.Path(self.pipe_path, 'file_dump.txt'), 'a') as f:
        f.write('saving another entry')

So, self.pipe_path points to the folder on the filesystem assigned to this node.

Similarly, calling dump_json(message, f_name) will write a json version of a message in a node folder. This can be triggered at any time in the node execution, however, every transmitted message by the node can be automatically dumped to disk by adding the "auto_dump": true flag in the node configuration.

{
  "name": "node_name",
  "type": "<NODE_TYPE>"
  "mark": "<NODE_MARK>"
  "auto_dump": true
  "configuration": {}
}