type
status
date
Jun 10, 2024 06:38 AM
slug
summary
tags
category
icon
password
Paper Information If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents https://arxiv.org/abs/2401.00812
This surveys discusses multiple benefits of including code in LLM pretraining from three aspects:
- improving programming and reasoning skills
- using external tools with executable steps
- performing self-improvement from feedback
Functionalities of Code Pretraining
- Performance gain: programming, reasoning, structure generation
- Tool Use: generate executable steps during decision-making (Planning, CoT)
- Interaction (Agent): self-improvement

Furthermore, there are several benefits in building intelligent agents:
- enhance perception and planning skills
- direct action primitive grounding and modular memory organization
- providing self-correction and self-improvement
Code improves performance

Advantages of learning from code:
- Directly improve programming skills
- Empower complex reasoning (CoT → PoT)
- Capture structured knowledge
- structural reasoning
- markup language understanding
Code improves tool using

Type of tools
Code improves self-correction

How to learn from feedback?
- Selection based method
- majority voting
- re-ranking
- Prompted-based method
- Fine-tuning
Challenges
- Causality between code pretraining and LLM’s performance improvement
- Enhancing reasoning beyond code
- Improving multi-turn interactions