Document Details

2401.01335.pdf

Download View Text Delete

Clip: Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Zixiang Chen ∗† Yihe Deng ∗‡ Huizhuo Yuan ∗§ Kaixuan Ji ¶ Quanquan Gu ‖ Abstract Harnessing the power of human-annotated data through Supervised Fine-Tuning (SFT) is pivotal for advancing Large Language Models (LLMs). In this paper, we delve into the prospect

Filename: 2401.01335.pdf

Filetype: application/pdf

Size: 1719766 bytes

Uploaded On: 2024-01-27

Abstract:

Summary:

Tags:

Notes:

Visible: 1

Status: Parsed

Author:

CreationDate: 2024-01-03T01:33:22+00:00

Creator: LaTeX with hyperref

Keywords:

ModDate: 2024-01-03T01:33:22+00:00

PTEX.Fullbanner: This is pdfTeX, Version 3.141592653-2.6-1.40.25 (TeX Live 2023) kpathsea version 6.3.5

Producer: pdfTeX-1.40.25

Subject:

Title:

Trapped: False

Pages: 28

Return to Document Library