ๆญฃๅœจๅŠ ่ฝฝๅ›พ็‰‡...
Sample x1,x2,...,xm}from Paata(x) m m o=arg mgx ฮ P6=arg mxlog(xy m arg max logP(xargmx ExalogPo()] ๅฐ (not related to arg max Paata(x)logPg(x)dx-Paata(x)logPaata(x)dx Pe(x) Difference between Pdata and Pe arg max Pata(x)P dx =arg min KL(PaatallPe) Maximum Likelihood Minimize KL DivergenceSample ๐‘ฅ 1 , ๐‘ฅ 2 , โ€ฆ , ๐‘ฅ ๐‘š from ๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž ๐‘ฅ ๐œƒ โˆ— = ๐‘Ž๐‘Ÿ๐‘” max ๐œƒ เท‘ ๐‘–=1 ๐‘š ๐‘ƒ๐œƒ ๐‘ฅ ๐‘– = ๐‘Ž๐‘Ÿ๐‘” max ๐œƒ ๐‘™๐‘œ๐‘”เท‘ ๐‘–=1 ๐‘š ๐‘ƒ๐œƒ ๐‘ฅ ๐‘– = ๐‘Ž๐‘Ÿ๐‘” max ๐œƒ เท ๐‘–=1 ๐‘š ๐‘™๐‘œ๐‘”๐‘ƒ๐œƒ ๐‘ฅ ๐‘– โ‰ˆ ๐‘Ž๐‘Ÿ๐‘” max ๐œƒ ๐ธ๐‘ฅ~๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž[๐‘™๐‘œ๐‘”๐‘ƒ๐œƒ ๐‘ฅ ] = ๐‘Ž๐‘Ÿ๐‘” max ๐œƒ เถฑ ๐‘ฅ ๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž ๐‘ฅ ๐‘™๐‘œ๐‘”๐‘ƒ๐œƒ ๐‘ฅ ๐‘‘๐‘ฅ โˆ’ เถฑ ๐‘ฅ ๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž ๐‘ฅ ๐‘™๐‘œ๐‘”๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž ๐‘ฅ ๐‘‘๐‘ฅ = ๐‘Ž๐‘Ÿ๐‘” min ๐œƒ ๐พ๐ฟ ๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž||๐‘ƒ๐œƒ = ๐‘Ž๐‘Ÿ๐‘” max ๐œƒ เถฑ ๐‘ฅ ๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž ๐‘ฅ ๐‘™๐‘œ๐‘” ๐‘ƒ๐œƒ ๐‘ฅ ๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž ๐‘ฅ ๐‘‘๐‘ฅ Maximum Likelihood = Minimize KL Divergence (not related to ๐œƒ) Difference between ๐‘ƒ๐‘‘๐‘Ž๐‘ก๐‘Ž and ๐‘ƒ๐œƒ
<<ๅ‘ไธŠ็ฟป้กตๅ‘ไธ‹็ฟป้กต>>
©2008-็Žฐๅœจ cucdc.com ้ซ˜็ญ‰ๆ•™่‚ฒ่ต„่ฎฏ็ฝ‘ ็‰ˆๆƒๆ‰€ๆœ‰