Levels of AI Agents: from Rules to Large Language Models

yuhuang 874 views 16 slides Aug 07, 2024
Slide 1
Slide 1 of 16
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16

About This Presentation

AI agents are defined as artificial entities to perceive the environment, make decisions and take actions. Inspired by the 6 levels of autonomous driving by SAE (Society of Automotive Engineers), the AI agents are also categorized based on utilities and strongness, as the following levels: L0—no A...


Slide Content

Levels of AI Agents
Yu Huang
Roboraction.AI

Foundation Models and Large Language Models
•Foundation models are pre-trainedon a surrogate task and then
adapted to the downstream task of interest via fine-tuning.
•LLMsare trained on text data, to enable understanding natural
language, via text generation & comprehension.
•The emergent capabilities of LLMs: Prompting, In context learning
(ICL), Chain of thoughts (CoT) and Instruction following.
•Efficient Parameter Fine Tuning (EPFT): LoRA
•Alignment with human preference: RLHF
•LLMs are possible penetration of AI to AGI.

What is an AI Agent?
•Agent is an entity, that is able to perceive its environment and
execute actions.
•The AI Agent is the entity exhibiting intelligent behavior and
possessing capabilities like autonomy, reactivity, pro-activeness, and
social interactions.

Embodied AI
•Embodied AI is designed for enabling agents to display AI, not only in
virtual (such as cyber space) but also physical world, crucial for
realizing AGI.
•Multi-modal Large Models (MLMs) and world models are prominent
features of embodied AI.
•Some basic components:
•Embodied perception (including navigation)
•Embodied interaction (QA and embodied grasping)
•Embodied simulation (world model and adaptation)
•Embodied agent (MLMs, task and action planning, embodied control)
•Visual-Language-Action (VLA) model is a MLM in Embodied AI.

Levels of AGI“Levels of AGI: Operationalizing Progress on the Path to AGI”, arXiv2311.02462, 2023

OpenAI’sAGI Levelshttps://medium.com/@a.sale/chatgpt-5-and-beyond-openais-
five-level-roadmap-to-agi-unveiled-be09db42ca27
July 2024

Capabilities of AI Agents

Andrew Ngo on AI Agent Workflows

Levels of AI Agents

Perception and Action

Reasoning&Decision making

Memory and Reflection

Autonomous Learning and Generalization

Personality and Collaboration

Conclusion
Mo#vated by 5 levels of • autonomous driving by SAE, levels of AI
agents are classified based on intelligence u#li#es and power;
Mostly current AI Agents’ level lies on L2• -L3;
Some AI agents are on research and even developed for L4 or L5, but •
honestly, the system performance is not sa#sfying.
Challenging issues come from AI brain in the agent plaIorm, i.e. LLM, •
like hallucina#on, explainability, performance-cost (computa#on and
memory) tradeoff, safety & security, copyright and privacy etc.
Agent related issues: role playing• , catastrophic forgeQng, misuse,
threats to human race, collec#ve intelligence in agent society etc.