Your First AI Agent: Simpler Than You Think

Nir Diamant

Feb 10

404

The Complete Guide (with Code)

Read →

39 Comments

Bobby Montalvo

Feb 12

I'm trying to follow along with this tutorial but the formatting is off and python is picky about formatting. I've read your comment about "full code can be found in my repo" but it's not obvious where in the repo this code is. Care to provide a link to the .py file for this tutorial please?

Expand full comment

Reply (3)

Nir Diamant

Feb 12

sure.

here is the link to the specific tutorial: https://github.com/NirDiamant/GenAI_Agents/blob/main/all_agents_tutorials/langgraph-tutorial.ipynb

Expand full comment

Pedro Marques

Mar 5

Copy all tutorial send to deepseek and ask for the code all togheter :)

Expand full comment

David Stark

Mar 17

Yes, same problem. It looks like the formatting was off.

Expand full comment

Reply (1)

Nir Diamant

Mar 17

what is the problem?

Expand full comment

Reply (1)

David Stark

Mar 18

The State class that uses a TypedDict at the beginning. The formatting is all off. It had me questioning if there was a novel Python syntax I had not heard of.

class State(TypedDict): text:

str classification:

str entities: List[str] summary: str

On a positive note, it's good motivation to read up on Python's TypedDict feature from the typing package.

Expand full comment

Reply (1)

Tingwen Zhang

Apr 8

Here is the corrected syntax:

class State(TypedDict):

text: str

classification: str

entities: List[str]

summary: str

The syntax

variable: type

is required for a TypedDict object.

Expand full comment

Reply (1)

Nir Diamant

Apr 8

The current version worked perfect to me, but you are welcome to open a PR to make it more stable

Expand full comment

Reply (1)

Tingwen Zhang

Apr 8

I see that in the repository

https://github.com/NirDiamant/GenAI_Agents/blob/main/all_agents_tutorials/langgraph-tutorial.ipynb

the syntax is good. The minor issue on the website, where the body of class State has newlines in the wrong place.

Expand full comment

Rahul Pandey

Jun 9

My understanding of Agent is that it should be able to handle some complexity which may arise at run time. How is the above designed system different from a simple program which uses LLM to answer some questions? but in the overall flow the so called agent is a linear program flow. If you can elaborate that part

Expand full comment

Reply (1)

Nir Diamant

Jun 9

Thank for you comment! Please refer to other comments on this blog post, as I answered this question :)

Expand full comment

Gnug315

Mar 16

May I recommend reading Gary Marcus’ substack for a different pov.

Expand full comment

Ruben Hassid

Jul 2

As easy as it can be.

Expand full comment

Reply (1)

Nir Diamant

Jul 2

Next time will try to make it even easier :)

Expand full comment

Hrju Blja

Feb 21

Thanks for sharing this, I managed to successfully run the whole code and get the intended output.

That being said, I am struggling to see how this code (putting aside the usage of the LangGraph framework) is different from the old good hard coded and predictable series of functions calls, when an output of a function A serves as an input of a function B. What specifically is "agentic" in this code? How this agent "can dynamically adjust its focus based on what it discovers"?

Expand full comment

potentialMind

Apr 11

This is not an agent but a workflow.

Expand full comment

Reply (1)

Nir Diamant

Apr 11

This is a minimal version of a controllable agent

Expand full comment

Dave Howland

Feb 10

Me, I feel like this needs a stronger use case. It's not clear how AI chaining did something that a series of well-reasoned prompts couldn't.

Expand full comment

Reply (1)

Nir Diamant

Feb 10

Thanks for your input, Dave.

The use case in this blog post is intentionally simple, as its goal is to provide an introduction and help readers understand what agents are and how to create one.

Expand full comment

Bar

Apr 22

How does the summarization considering the information from the previous steps (classification and entities) if it only uses state["text"] and not state["classification"] or state["entities"].

It seems that summarization logic doesn’t use the classification or entity extraction output. Please correct me if I wrong...

Expand full comment

ManjP

Mar 31

Like other readers commented, it is hard to see the power of agents from this example, I understand this was intentionally simplified. Might help to link to a blog post that shows the real power, the autonomous ability to decide based on context. If the agent gets non-English input, could it translate and then summarize?

Expand full comment

Reply (1)

Nir Diamant

Mar 31

You can see many more tutorials on my GitHub. both explanations + code:

https://github.com/NirDiamant/GenAI_Agents

Expand full comment

Salvador Lorca 📚 ⭕️

Mar 15

Good insight 😌 Can i translate part of this article into Spanish with links to you and a description of your newsletter?

Expand full comment

Reply (1)

Nir Diamant

Mar 15

Yes, please explicitly credit and link to this blog post at the beginning of your article

Expand full comment

Reply (1)

Salvador Lorca 📚 ⭕️

Mar 16

Thanks. Of course.

Expand full comment

Claudia Ng

Mar 6

The example code looks like it could be a workflow. How do you decide whether to implement it as an agent or a workflow?

Expand full comment

Reply (1)

Nir Diamant

Mar 6

An agent can be a workflow where each step uses an LLM for a specific complicated task. This way, we have much more control over the agent.

Expand full comment

Zachary Huang

Mar 4Edited

Like your tutorial and have subscribed to your newsletter! I'm new to medium. Can I get you quick feedback on an even simpler agent https://the-pocket.github.io/PocketFlow/design_pattern/agent.html, built on an 100-line framework

I made https://substack.com/home/post/p-158349453

Expand full comment

Ravi Teja Lanka

Mar 2

Hello Nir, thanks for the excellent blog post, I would like to have few inputs from you, especially with the way we use the word AI Agents, In the example you have shown as part of blog is more of an AI system which is a workflow, according to the blog from Anthropic "Building Effective Agents", an AI Agent is one which takes decisions autonomously. It would be helpful if you can write a blog on building a true AI agent, and how to distinguish when to use AI workflow and when to use an AI Agent

Expand full comment

Reply (1)

Nir Diamant

Mar 2

Thanks for your input, Ravi.

The use case in this blog post is intentionally simple, as its goal is to provide an introduction and help readers understand what agents are and how to create one.

Expand full comment

xiongzhenglong

Feb 28

AttributeError Traceback (most recent call last)

in <cell line: 0>()

6 state_input = {"text": sample_text}

----> 7 result = app.invoke(state_input)

9 print("Classification:", result["classification"])

6 frames

in classification_node(state)

6 )

7 message = HumanMessage(content=prompt.format(text=state["text"]))

----> 8 classification = llm.invoke([message]).content.strip()

9 return {"classification": classification}

AttributeError: 'tuple' object has no attribute 'invoke'

Could anyone help identify why this is happening? Thanks!

Expand full comment

Reply (2)

Nir Diamant

Mar 9

I answered the open issue on the repo.

Expand full comment

Nir Diamant

Mar 3

I saw that you opened an issue on the repo. I'll take care of it (or someone else will) soon.

Thanks! 🙂

Expand full comment

David

Feb 23

Very good post. Can I translate part of this article into Spanish with links to you and a description of your newsletter?

Expand full comment

Reply (1)

Nir Diamant

Feb 23

Hmm okay.

Please put these info at the beginning of your post as a disclaimer with link to the original post and the newsletter,as you said

Expand full comment

Alain Airom

Feb 10

It seems that the app should be changed to; import os

from typing import TypedDict, List

from langgraph.graph import StateGraph, END

from langchain.prompts import PromptTemplate

from langchain_openai import ChatOpenAI

from langchain.schema import HumanMessage

class State(TypedDict):

text: str

classification: str

entities: List[str]

summary: str

llm = ChatOpenAI(model="gpt-4-0613", temperature=0)

def classification_node(state: State):

prompt = PromptTemplate(

input_variables=["text"],

template="Classify the following text into one of the categories: News, Blog, Research, or Other.\n\nText:{text}\n\nCategory:"

)

message = HumanMessage(content=prompt.format(text=state["text"]))

classification = llm.invoke([message]).content.strip()

return {"classification": classification, **state}

def entity_extraction_node(state: State):

prompt = PromptTemplate(

input_variables=["text"],

template="Extract all the entities (Person, Organization, Location) from the following text. Provide the result as a comma-separated list.\n\nText:{text}\n\nEntities:"

)

message = HumanMessage(content=prompt.format(text=state["text"]))

entities_str = llm.invoke([message]).content.strip()

entities = [e.strip() for e in entities_str.split(",")] if entities_str else []

return {"entities": entities, **state}

def summarization_node(state: State):

prompt = PromptTemplate(

input_variables=["text"],

template="Summarize the following text in one short sentence.\n\nText:{text}\n\nSummary:"

)

message = HumanMessage(content=prompt.format(text=state["text"]))

summary = llm.invoke([message]).content.strip()

return {"summary": summary, **state}

workflow = StateGraph(State)

# Add nodes to the graph

workflow.add_node("classification_node", classification_node)

workflow.add_node("entity_extraction", entity_extraction_node)

workflow.add_node("summarization", summarization_node)

# Add edges to the graph

workflow.set_entry_point("classification_node")

workflow.add_edge("classification_node", "entity_extraction")

workflow.add_edge("entity_extraction", "summarization")

workflow.add_edge("summarization", END)

# Compile the graph

app = workflow.compile() # This line is crucial

sample_text = """

OpenAI has announced the GPT-4 model, which is a large multimodal model that exhibits human-level performance on various professional benchmarks. It is developed to improve the alignment and safety of AI systems.

additionally, the model is designed to be more efficient and scalable than its predecessor, GPT-3. The GPT-4 model is expected to be released in the coming months and will be available to the public for research and development purposes.

"""

state_input = {"text": sample_text, "classification": "", "entities": [], "summary": ""}

result = app.invoke(state_input)

print("Classification:", result["classification"])

print("\nEntities:", result["entities"])

print("\nSummary:", result["summary"])

But at the end no access to gpt-4-0613

Expand full comment

Reply (1)