Paper Reading: Code Pretraining in LLMs | Blog of sicer

Blog of sicer

Personal blog deployed by NotionNext

🔇Paper Reading: Code Pretraining in LLMs

Jun 10, 2024

| Oct 10, 2024

Words 145Read Time≈ 1 min

type

status

date

Jun 10, 2024 06:38 AM

slug

summary

tags

category

icon

password

Paper Information If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents https://arxiv.org/abs/2401.00812

This surveys discusses multiple benefits of including code in LLM pretraining from three aspects:

improving programming and reasoning skills

using external tools with executable steps

performing self-improvement from feedback

Functionalities of Code Pretraining

Performance gain: programming, reasoning, structure generation

Tool Use: generate executable steps during decision-making (Planning, CoT)

Interaction (Agent): self-improvement

notion image

Furthermore, there are several benefits in building intelligent agents:

enhance perception and planning skills

direct action primitive grounding and modular memory organization

providing self-correction and self-improvement

Code improves performance

notion image

Advantages of learning from code:

Directly improve programming skills

Empower complex reasoning (CoT → PoT)

Capture structured knowledge

structural reasoning
markup language understanding

Code improves tool using

notion image

Type of tools

Code improves self-correction

notion image

How to learn from feedback?

Selection based method

majority voting
re-ranking

Prompted-based method

Fine-tuning

Challenges

Causality between code pretraining and LLM’s performance improvement

Enhancing reasoning beyond code

Improving multi-turn interactions

Author:sicer
URL:https://blog.sicer.top/article/code-pretrain
Copyright:All articles in this blog, except for special statements, adopt BY-NC-SA agreement. Please indicate the source!

Relate Posts :

Tags:

LLM

Paper

Survey

Alibaba Qwen: Generalizing an LLM from 8k to 1M Context using Qwen-Agent Long Context Challenge in LLMs

Loading...

Catalog

0%