Skip to content

calculated | content

About
Contact
Home

Thoughts on Data Science, Machine Learning, and AI

About
Contact
Home

K-L+Divergence+The+Kullback+Liebler+Divergence+is+a+measure+of+disparity+between.

July 2, 2017 Return to: What is Free Energy: Hinton, Helmholtz, and LegendreFull resolution (960 × 720)

Image navigation

P: the true model (the probability density function that the data are drawn from) and. Q:the approximating model(s) ie. the candidate models. The AIC is derived from the KL-divergence. It tries to select for the model Q that minimizes the KL-divergence. Because the P is fixed (ie. it is dependant on actual data) the AIC is derived only from the component of the KL-divergence relating to q (ie. integral of -p(x) log q(x)) As the KL divergence can’t be less than zero, maximizing this will minimize the KL-divergence. Taken from Wikipedia page ‘Kullback-Liebler Divergence.

Recent Posts

WW-PGD: Projected Gradient Descent optimizer
WeightWatcher, HTSR theory, and the Renormalization Group
Fine-Tuned Llama3.2: Bad Instructions ?
What’s instructive about Instruct Fine-Tuning: a weightwatcher analysis
Describing Double Descent with WeightWatcher

Archives

Blog at WordPress.com.

Comment
Subscribe Subscribed
- calculated | content
- Already have a WordPress.com account? Log in now.