使用进化算法优化大型语言模型的提示生成

版权申诉

39 浏览量更新于2024-06-14 收藏 586KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇论文是关于在ICLR 2024会议上发表的研究，主题是将大型语言模型（LLMs）与进化算法相结合，创建出强大的提示（prompt）优化器，名为EVOPROMPT。该框架针对离散提示优化问题，利用进化算法的高效性和快速收敛特性。通过将LLMs与EAs连接，可以处理需要连贯且人类可读的自然语言表达的离散提示，从而结合LLMs强大的语言处理能力和EAs的优化能力，自动化提示生成过程，减少人为努力。研究团队来自清华大学、微软研究院和东北大学。" 本文探讨了如何利用大型语言模型（LLMs）的优秀性能和进化算法（EAs）的高效性来优化任务提示。通常，LLMs在执行各种任务时需要精细设计的提示，这些提示的创建往往需要大量的人力。为了克服这个问题，研究者提出了一个新颖的框架——EVOPROMPT，它采用进化算法的思想来优化离散提示。在传统的应用中，进化算法因其在解决复杂优化问题时展现出的良好性能和快速收敛特性而被广泛使用。然而，对于需要保持连贯性和可读性的自然语言提示来说，直接应用进化算法并不适用。因此，EVOPROMPT通过将LLMs与EAs连接，使得进化算法能够在保持语言流畅性和理解性的同时，对提示进行优化。这一创新方法使得系统能够同时利用LLMs的强大语义理解和生成能力，以及EAs的搜索和优化功能。论文中可能详细介绍了EVOPROMPT的工作原理，包括如何初始化种群，如何定义适应度函数以评估提示的有效性，以及如何通过遗传操作（如选择、交叉和变异）来更新提示。此外，可能会讨论实验设置，包括所使用的具体语言模型、进化算法的参数选择，以及在不同任务上的性能对比，以证明EVOPROMPT的有效性和优越性。总体而言，这项工作旨在解决LLMs依赖于人工精心设计提示的问题，通过自动化和智能化的方法，提高模型在各种任务中的性能，减少了对人类专家的依赖。这不仅在实践中具有重要意义，还为未来混合智能系统的设计提供了新的思路。

资源详情

资源推荐

Published as a conference paper at ICLR 2024

“tweet” -> “review”

“Categorize”-> “Analyze”

“Sentiment analysis” -> “Sentiment identification”

Differential Evolution (DE) Algorithm Implemented by LLMs

𝒃 − 𝒄

Query:

Please follow the instruction step-by-step to generate a better prompt.

Identify the

different parts

between the Prompt 1 and Prompt 2:

New Prompt: In this task, you are given reviews about products. The task is to

analyze each review and identify if it is positive or negative.

Final Prompt: <prompt>

Here, you'll be given reviews about products and you'll

need to analyze each review and identify if it is positive or negative.</

prompt>

Prompt 1: Categorize the tweet according to if it has a positive or negative sentiment.

Prompt 2: Carry out sentiment analysis for every sentence to decide if it is positive or

negative.

Different parts:

"tweet" vs "sentence"

''Categorize'' vs ''Carry out sentiment analysis''

Prompt 3: In this task, you are given sentences from product reviews. The task is to

classify a sentence as positive or as negative.

𝐂𝐫𝐨𝐬𝐬𝐨𝐯𝐞𝐫

𝒂 + 𝑭(𝒃 − 𝒄)

𝑭(𝒃 − 𝒄)

Randomly

mutate

the different parts

Combine the different parts with Prompt 3, selectively replace it with the different parts

in Step 2 and generate a new prompt.

4. Cross over

the prompt in the Step 3 with the following basic prompt and generate a final

prompt bracketed with <prompt> and </prompt>:

Basic Prompt: Here, you'll be given sentences from reviews about products and you'll

need to decide if it's a positive or a negative review.

Response:

Figure 2: DE process implemented by LLMs (Evo

(·)

in Algorithm 1). In Step 1, LLMs ﬁnd the

different parts (words in

■

and

■

) between Prompt 1 and Prompt 2 (

b − c

in typical DE). In Step 2,

LLMs perform mutation (words in

■

) on them (imitation of

F(b − c)

). Next, LLMs incorporate

the current best prompt as Prompt 3 with the mutated results in Step 2, to generate a new prompt

(counterpart of

a + F(b − c)

in DE). Finally, LLMs perform crossover upon the current basic prompt

and the generated prompt in Step 3. See Figure 5 in Appendix B.2 for the complete response.

3.2 INSTANTIATION WITH GENETIC ALGORITHM

Selection In GA, parent solutions are conventionally selected using the roulette wheel selection

method, guided by their ﬁtness values (Lipowski & Lipowska, 2012). Analogously, we employ the

roulette wheel selection to choose two parent prompts from the current population, based on their

performance scores obtained on the development sets. Let

denote the performance score of the

-th

prompt within a population containing

prompts. The probability of selecting the

-th prompt as a

parent can be expressed as p

= s

j=1

Evolution Conforming to the GA framework, we generate a new candidate prompt via two steps:

1) Crossover is performed between the parent prompts to produce a new offspring prompt that inherits

characteristics from both parents; 2) Mutation is applied to the offspring prompt, introducing random

alterations to certain elements. We formalize this two-stage operation into algorithmic instructions

for guiding LLMs to implement Evo(·) in Algorithm 1. The entire process is illustrated in Figure 1.

Update We employ a straightforward selection strategy for updating the population: at each

iteration, EVOPROMPT produces

new prompts, which are merged with the existing population of

prompts. Subsequently, the top

prompts, based on their scores, are retained to form the updated

population. Accordingly, the overall quality of the population undergoes continuous enhancement,

culminating in the selection of the best one within the ﬁnal population as the optimal prompt.

剩余23页未读，继续阅读

百态老人

粉丝: 5109
资源: 2万+

使用进化算法优化大型语言模型的提示生成

求解大规模问题协同进化动态粒子群优化算法.pdf

基于黑洞多目标进化算法的永磁直线同步电机优化设计.pdf

基于量子微分进化算法的分布式电源多目标优化规划.pdf

基于自适应进化模型的粒子群优化算法.pdf

基于多种群多模型协同进化的粒子群优化算法.pdf

基于遗传算法的自然语言参数阈值优化方法.pdf

基于量子差分进化算法的电力系统无功优化.pdf

基于改进差分进化算法的电力系统无功优化.pdf

三角骨架差分进化算法的电力系统无功优化.pdf

基于动态云进化粒子群算法的风电系统无功优化方法.pdf

基于多目标决策协调进化算法的电力系统无功优化.pdf

一种新的基于三种进化模型的粒子群优化算法.pdf

粒子群算法的进化式实时优化方法.pdf

基于差分进化算法的分布式能源系统多目标优化.pdf

论文研究-基于改进差分进化算法的烧结矿配料优化 .pdf

遗传算法和粒子群算法求解渠系多目标优化模型.pdf

最新资源