当前位置:高等教育资讯网  >  中国高校课件下载中心  >  大学文库  >  浏览文档

南京大学:《高级优化 Advanced Optimization》课程教学资源(讲稿)Lecture 06 Prediction with Expert Advice - Hedge, minimax bound, lower bound; mirror descent(motivation and preliminary)

资源类别:文库,文档格式:PDF,文档页数:42,文件大小:9.01MB,团购合买
• Problem Setup • Algorithms • Connection to Online Convex Optimization
点击下载完整版文档(PDF)

版像 NJUAT 南京大学 人工智能学院 SCHODL OF ARTIFICIAL INTELUGENCE,NANJING UNIVERSITY Lecture 6.Prediction with Expert Advice Advanced Optimization(Fall 2023) Peng Zhao zhaop@lamda.nju.edu.cn Nanjing University

Lecture 6. Prediction with Expert Advice Peng Zhao zhaop@lamda.nju.edu.cn Nanjing University Advanced Optimization (Fall 2023)

Outline Problem Setup ·Algorithms Connection to Online Convex Optimization Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 2

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 2 Outline • Problem Setup • Algorithms • Connection to Online Convex Optimization

Motivation Consider that we are making predictions based on external experts. OPPENHEIMER 兹西磁 TITANIC 5 A Chinese Odyssey Part Two- Oppenheimer Titanic Cinderella IMDb 豆 IMDb 豆 IMDb 9.2/10 87% 7.8/10 8.8/10 93% 8.5/10 9.5/10 88% 7.9/10 Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 3

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 3 Motivation • Consider that we are making predictions based on external experts. A Chinese Odyssey Part Two - Cinderella Titanic 9.2/10 87% 7.8/10 8.8/10 93% 8.5/10 9.5/10 88% 7.9/10 Oppenheimer

Prediction with Expert Advice Other examples include Weather report: 的1 的2 3 的4 Stock prediction: 81 82 品3 84 Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 4

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 4 Prediction with Expert Advice • Other examples include 1 2 3 4 ? 1 2 3 4 ? Weather report: Stock prediction:

PEA Problem Setup Question 1 Question 2 Question 3 Advice1 Advice,2 Advice3 Advice2 1 Advice22 Advice23 Experts Advices 1 Advices2 Advices3 Advice,1 Advice2 Advice3 Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 5

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 5 PEA Problem Setup Question 1 Question 2 Question 3 Advice1 1 Advice1 2 Advice1 3 Advice2 1 Advice2 2 Advice2 3 Advice3 1 Advice3 2 Advice3 3 Advice4 1 Advice4 2 Advice4 3 Experts

PEA Problem Setup Question 1 Question 2 Question 3 00行00 ==mm==■m 。----=。== Advice,1 Advice,2 Advice,3 1 1 Advice2 1 Advice22 Advice2 3 Experts Advice,1 Advice32 Advices3 Advice,1 Advice,2 Advice3 Learner Answer 1 Answer 2 Answer 3 Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 6

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 6 PEA Problem Setup Question 1 Question 2 Question 3 Advice1 1 Advice1 2 Advice1 3 Advice2 1 Advice2 2 Advice2 3 Advice3 1 Advice3 2 Advice3 3 Advice4 1 Advice4 2 Advice4 3 Experts Learner Answer 1 Answer 2 Answer 3

PEA:Formulization The online learner(player)aims to make the prediction based by combining N experts'advice. At each round t=1,2,... (1)the player first picks a weight p from a simplex AN; (2)and simultaneously environments pick a loss vector eERN; (3)the player suffers loss fi(p:)(p,e),observes e and updates the model. The feasible domain is the(W-l)-dim simplex△v={p∈Rv|p,≥0,∑1pi=l} We typically assume that0≤lt,i≤1holds for all t∈[T]andi∈[W]. Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 7

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 7 PEA: Formulization • The online learner (player) aims to make the prediction based by combining N experts’ advice

PEA:Formulization The online learner(player)aims to make the prediction based by combining N experts'advice. At each round t=1,2,... (1)the player first picks a weight p from a simplex AN; (2)and simultaneously environments pick a loss vector eERN; (3)the player suffers loss fi(p)(p,e),observes e:and updates the model. The goal is to minimize the regret with respect to the best expert: T T t iE(N] Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 8

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 8 PEA: Formulization • The online learner (player) aims to make the prediction based by combining N experts’ advice. • The goal is to minimize the regret with respect to the best expert:

A Natural Solution Follow the Leader (FTL) Select the expert that performs best so far,specifically, pL=arg min (p,Ln-)=argmin L-1, pE△N ie[N] where L-l∈Nis the cumulative loss vector withL-l,i≌∑ls,i T 61,1=0.49 → 21=1 83,1=0 Regr=∑p,L,)-a∑i t=1 N]1 T 2 =O(T) 61,2=0.51 2,2=0 3,2=1 FTL achieves linear regret in the worst case! Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 9

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 9 A Natural Solution • Follow the Leader (FTL) Select the expert that performs best so far, specifically, FTL achieves linear regret in the worst case!

A Natural Solution Follow the Leader (FTL) Select the expert that performs best so far,specifically, p=arg min (p,L-1)=argmin L-1 pE△N iE[N] whereRN is the cumulative loss vector with. Pitfall:decision is actually a one-hot vector,which can be very unstable. Replacing the 'max'operation in FTL by 'softmax'. Advanced Optimization(Fall 2023) Lecture 6.Prediction with Expert Advice 10

Advanced Optimization (Fall 2023) Lecture 6. Prediction with Expert Advice 10 A Natural Solution • Follow the Leader (FTL) Select the expert that performs best so far, specifically, Pitfall: decision is actually a one-hot vector, which can be very unstable. Replacing the ‘max’ operation in FTL by ‘softmax’

点击下载完整版文档(PDF)VIP每日下载上限内不扣除下载券和下载次数;
按次数下载不扣除下载券;
24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
共42页,可试读14页,点击继续阅读 ↓↓
相关文档

关于我们|帮助中心|下载说明|相关软件|意见反馈|联系我们

Copyright © 2008-现在 cucdc.com 高等教育资讯网 版权所有