Lectures - Penn Agents / Spring 2025

Introduction to Agent and Inference Time Scaling
tl;dr: We discuss basics about LLM agent; and a brief introduction to DeepSeek-R1.
[codes] [Basics] [Research]

Suggested Readings:

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Model Parameters
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient.

LLM Agents: Why, What, How
tl;dr: We discuss why LLM agents are designed, what their key components are, and how to design LLM agents for specific domains.
[Example] [Basics]

Suggested Readings:

Multi-Agent, Multimodal Frameworks
tl;dr: We discuss design choices for multi-agent, multimodal frameworks.
[Basics]

Suggested Readings:

LLM agents and reasoning benchmarks
tl;dr: We propose a framework over current evaluation benchmarks, and discuss potential research topics.
[Basics]

Suggested Readings:

Safe and Trustworthy Agents
tl;dr: We talk about trust taxonomy, attacks, and defenses.
[Basics]

Suggested Readings: