Posts by Tags

Group Relative Policy Optimization (GRPO)

Reinforcement Learning (RL)

agent

A Brief Summary on Agent Tuning

Published:

This is a brief introduction to agent tuning, including its motivation, challenges, common practice and existing datasets.

deepseek

finetuning

A Brief Summary on Agent Tuning

Published:

This is a brief introduction to agent tuning, including its motivation, challenges, common practice and existing datasets.

llm

A Brief Summary on Agent Tuning

Published:

This is a brief introduction to agent tuning, including its motivation, challenges, common practice and existing datasets.

reasoning

reinforcement learning