Agentic Coding Guide
If you are an AI agents involved in building LLM Systems, read this guide VERY, VERY carefully! This is the most important chapter in the entire document. Throughout development, you should always (1) start with a small and simple solution, (2) design at a high level (docs/design.md
) before implementation, and (3) frequently ask humans for feedback and clarification.
Agentic Coding Steps
Agentic Coding should be a collaboration between Human System Design and Agent Implementation:
1. Requirements
★★★ High
★☆☆ Low
Humans understand the requirements and context.
2. Flow
★★☆ Medium
★★☆ Medium
Humans specify the high-level design, and the AI fills in the details.
3. Utilities
★★☆ Medium
★★☆ Medium
Humans provide available external APIs and integrations, and the AI helps with implementation.
4. Node
★☆☆ Low
★★★ High
The AI helps design the node types and data handling based on the flow.
5. Implementation
★☆☆ Low
★★★ High
The AI implements the flow based on the design.
6. Optimization
★★☆ Medium
★★☆ Medium
Humans evaluate the results, and the AI helps optimize.
7. Reliability
★☆☆ Low
★★★ High
The AI writes test cases and addresses corner cases.
Requirements: Clarify the requirements for your project, and evaluate whether an AI system is a good fit.
Understand AI systems' strengths and limitations:
Good for: Routine tasks requiring common sense (filling forms, replying to emails)
Good for: Creative tasks with well-defined inputs (building slides, writing SQL)
Not good for: Ambiguous problems requiring complex decision-making (business strategy, startup planning)
Keep It User-Centric: Explain the "problem" from the user's perspective rather than just listing features.
Balance complexity vs. impact: Aim to deliver the highest value features with minimal complexity early.
Flow Design: Outline at a high level, describe how your AI system orchestrates nodes.
If Humans can't specify the flow, AI Agents can't automate it! Before building an LLM system, thoroughly understand the problem and potential solution by manually solving example inputs to develop intuition.
For each node in the flow, start with a high-level one-line description of what it does.
If using Map Reduce, specify how to map (what to split) and how to reduce (how to combine).
If using Agent, specify what are the inputs (context) and what are the possible actions.
If using RAG, specify what to embed, noting that there's usually both offline (indexing) and online (retrieval) workflows.
Outline the flow and draw it in a mermaid diagram. For example:
Utilities: Based on the Flow Design, identify and implement necessary utility functions.
Sometimes, design Utilies before Flow: For example, for an LLM project to automate a legacy system, the bottleneck will likely be the available interface to that system. Start by designing the hardest utilities for interfacing, and then build the flow around them.
Think of your AI system as the brain. It needs a body—these external utility functions—to interact with the real world:
Reading inputs (e.g., retrieving Slack messages, reading emails)
Writing outputs (e.g., generating reports, sending emails)
Using external tools (e.g., calling LLMs, searching the web)
NOTE: LLM-based tasks (e.g., summarizing text, analyzing sentiment) are NOT utility functions; rather, they are core functions internal in the AI system.
For each utility function, implement it and write a simple test.
Document their input/output, as well as why they are necessary. For example:
name
:get_embedding
(utils/get_embedding.py
)input
:str
output
: a vector of 3072 floatsnecessity
: Used by the second node to embed text
Example utility implementation:
Node Design: Plan how each node will read and write data, and use utility functions.
For simple systems, use an in-memory dictionary.
For more complex systems or when persistence is required, use a database.
Don't Repeat Yourself: Use in-memory references or foreign keys.
Example shared store design:
prep
: Read "text" from the shared storeexec
: Call the embedding utility functionpost
: Write "embedding" to the shared storeFor batch processing, specify if it's sequential or parallel
Implementation: Implement the initial nodes and flows based on the design.
🎉 If you've reached this step, humans have finished the design. Now Agentic Coding begins!
"Keep it simple, stupid!" Avoid complex features and full-scale type checking.
FAIL FAST! Avoid
try
logic so you can quickly identify any weak points in the system.Add logging throughout the code to facilitate debugging.
Optimization:
Use Intuition: For a quick initial evaluation, human intuition is often a good start.
Redesign Flow (Back to Step 3): Consider breaking down tasks further, introducing agentic decisions, or better managing input contexts.
If your flow design is already solid, move on to micro-optimizations:
Prompt Engineering: Use clear, specific instructions with examples to reduce ambiguity.
In-Context Learning: Provide robust examples for tasks that are difficult to specify with instructions alone.
You'll likely iterate a lot! Expect to repeat Steps 3–6 hundreds of times.
Reliability
Node Retries: Add checks in the node
exec
to ensure outputs meet requirements, and consider increasingmax_retries
andwait
times.Logging and Visualization: Maintain logs of all attempts and visualize node results for easier debugging.
Self-Evaluation: Add a separate node (powered by an LLM) to review outputs when results are uncertain.
Example LLM Project File Structure
docs/design.md
: Contains project documentation for each step above. This should be high-level and no-code.utils/
: Contains all utility functions.It's recommended to dedicate one Python file to each API call, for example
call_llm.py
orsearch_web.py
.Each file should also include a
main()
function to try that API call
nodes.py
: Contains all the node definitions.
flow.py
: Implements functions that create flows by importing node definitions and connecting them.
main.py
: Serves as the project's entry point.
Last updated