Document Details
Clip:
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Zixiang Chen ∗† Yihe Deng ∗‡ Huizhuo Yuan ∗§ Kaixuan Ji ¶ Quanquan Gu ‖ Abstract Harnessing the power of human-annotated data through Supervised Fine-Tuning (SFT) is pivotal for advancing Large Language Models (LLMs). In this paper, we delve into the prospect
Filename:
2401.01335.pdf
Filetype:
application/pdf
Size:
1719766 bytes
Uploaded On:
2024-01-27
Abstract:
Summary:
Tags:
Notes:
Visible:
1
Status:
Parsed
Author:
CreationDate:
2024-01-03T01:33:22+00:00
Creator:
LaTeX with hyperref
Keywords:
ModDate:
2024-01-03T01:33:22+00:00
PTEX.Fullbanner:
This is pdfTeX, Version 3.141592653-2.6-1.40.25 (TeX Live 2023) kpathsea version 6.3.5
Producer:
pdfTeX-1.40.25
Subject:
Title:
Trapped:
False
Pages:
28
Return to Document Library