Why LLM Agents?
In a human body brain (specifically
consciousness, excluding memory) in itself
can’t do anything without hands/sense organs
memory!
Consider LLM as brain and Agent using LLM as
complete body
Why (What is) LLM Agents?
Consider a LLM application that is designed to help financial analysts.
Simple question “What was X corporation’s total revenue for FY
2022?”-> RAG pipeline with company's data can answer
Real life question which the analyst would ask. -> “What were the
three takeaways from the Q2 earnings call from FY 23? Focus on the
technological moats that the company is building”
-this information requires more than a simple lookup from an
earnings call. It requires planning, tailored focus, memory, using
different tools, and breaking down a complex question into
simpler sub-parts….
-These concepts assembled together are essentially what we have
come to refer to as an LLM Agent.
LLM’s vs Agentic Response
ReAct is one type of Agent Technique
LLM Agent overview
Agent uses LLM as brain’s cerebrum to perform multiple
decision making
Component One: Planning
-Task Decomposition: A complicated task usually involves many
steps. An agent needs to know what they are and plan ahead.
-Self-Reflection: Allows autonomous agents to improve iteratively by
refining past action decisions and correcting previous mistakes.
Component One: Planning
-Task Decomposition: A complicated task usually involves many
steps. An agent needs to know what they are and plan ahead.
-Self-Reflection: Allows autonomous agents to improve iteratively by
refining past action decisions and correcting previous mistakes.
-one notable technique: ReACT: SYNERGIZING REASONING
AND ACTING IN LANGUAGE MODELS
Component Two: Memory
Categorization of human memory.
Component Two: Memory
We can roughly consider the following mappings, for Human memory to
Agent’s memory,
●Sensory memory-> embedding representations for raw inputs,
including text, image or other modalities;
●Short-term memory -> in-context learning. It is short and finite, as it
is restricted by the finite context window length of Transformer.
●Long-term memory-> external vector store that the agent can
attend to at query time, accessible via fast retrieval
Component Three: Tool Use
Utilizing LLM to know which tool to use when, and how to use that tool
-
TALM(Tool Augmented Language Models; Parisi et al. 2022)
-Toolformer by Meta
-HuggingGPT
The format of API calls in TALM