12 Comments
User's avatar
Lirio's avatar

Olá, muito bom :). Obrigado.

Hello, Pure Gold. Thanks

Expand full comment
Nir Diamant's avatar

You are welcome :))

Expand full comment
Vadzim's avatar

It's just gold, thanks!

Expand full comment
Nir Diamant's avatar

Thanks for the feedback! You are welcome

Expand full comment
Daniel's avatar

Most “AI agents” are still goldfish. Real autonomy needs hybrid memory: retrieval + compression + scoring, not just sliding windows.

Expand full comment
Justin Norris's avatar

Love the clarity of this article and simplicity of the examples. Makes a relatively technical topic easy to understand.

Expand full comment
Nir Diamant's avatar

Thanks for that feedback! Helps me to know how to write the next ones :)

Expand full comment
Damien Lewke's avatar

Amazing write up! Thank you

Expand full comment
Nir Diamant's avatar

Thanks for the feedback! Happy you liked it :)

Expand full comment
dan mantena's avatar

Love the structure of this article. Looking forwarw to slow reading it later today.

Based on this recent study from anthropic, it doesn't seem like that a more refined RAG system will not really solve the alignment problem or reduce the externalities of unpredictability from autonomous agents down to a manageable level that would make sense for a real world deployment just yet. Curious to hear your thoughts on this.

https://www.anthropic.com/research/project-vend-1

Expand full comment
Norm Katz's avatar

Hi Nir, thanks for this comprehensive article on managing conversational memory. It’s something I’m actually dealing with right now so this was very helpful.

One thing that always bugs me about similar articles I’ve read is when the author doesn’t describe the actual work that needs to go into each suggested method. I’m not talking about coding, I’m talking about categorizing your data. For instance, in a graph database, it doesn’t magically create the relationships for you, and sometimes once you exhaustively listed the classes or subdomains, it can take a lot of work to decide how you want them to be connected/related. As the tools evolve, they will be able to analyze your data and suggest both categories and relationships. But if you’re a medical specialist, or work for one as a data engineer, you usually already have the domain expertise, but building or updating the relationship graph can be tricky, including for things like avoiding redundancy and circular references.

Would you say that most of the techniques you mentioned are now mature enough to do most of this work, or we’re not there yet so developers and data scientists still have to do the work to define what aspects of their data they need to capture, relate, and recall and then test each configuration over time and make adjustments along the way?

Expand full comment
Rahul Pandey's avatar

Hi Nir thanks for the article. I have one question though, we are collating information over time mostly using summarization. This summarized info is from multiple documents. Now it can happen that one of the document is deleted. It is very expensive to re-summarize. Any methods which can help?

Expand full comment