Communication
Last updated
Last updated
Nodes and Flows communicate in 2 ways:
Shared Store (for almost all the cases)
A global data structure (often an in-mem dict) that all nodes can read ( prep()
) and write (post()
).
Great for data results, large content, or anything multiple nodes need.
You shall design the data structure and populate it ahead.
Separation of Concerns: Use Shared Store
for almost all cases to separate Data Schema from Compute Logic! This approach is both flexible and easy to manage, resulting in more maintainable code. Params
is more a syntax sugar for .
Params (only for )
Each node has a local, ephemeral params
dict passed in by the parent Flow, used as an identifier for tasks. Parameter keys and values shall be immutable.
Good for identifiers like filenames or numeric IDs, in Batch mode.
If you know memory management, think of the Shared Store like a heap (shared by all function calls), and Params like a stack (assigned by the caller).
A shared store is typically an in-mem dictionary, like:
It can also contain local file handlers, DB connections, or a combination for persistence. We recommend deciding the data structure or DB schema first based on your app requirements.
Here:
LoadData
writes to shared["data"]
.
Summarize
reads from shared["data"]
, summarizes, and writes to shared["summary"]
.
Params let you store per-Node or per-Flow config that doesn't need to live in the shared store. They are:
Immutable during a Node's run cycle (i.e., they don't change mid-prep->exec->post
).
Set via set_params()
.
Cleared and updated each time a parent Flow calls it.
Only set the uppermost Flow params because others will be overwritten by the parent Flow.
Typically, Params are identifiers (e.g., file name, page number). Use them to fetch the task you assigned or write to a specific part of the shared store.
If you need to set child node params, see .