Skip to content
calculated | content

calculated | content

  • About
  • Contact
  • Home

Thoughts on Data Science, Machine Learning, and AI

  • About
  • Contact
  • Home

Tag: machine learning

Fine-Tuned Llama3.2: Bad Instructions ?

Recently, Meta released LLama3.2 1B and 3B Instruct Fine Tuned LLM. To mixed reviews. On the one hand, it’s ranking … More

AI, artificial-intelligence, llm, machine learning, technology

SVDSmoothing LLM Layers with WeightWatcher

Recently, Microsoft Research published the LASER method: ”Layer-Selective Rank Reduction” in this recent, very popular paper The Truth is in There: … More

AI, artificial-intelligence, DATA SCIENCE, llm, machine learning

Evaluating LLMs with WeightWatcher Part III: The Magic of Mistral, a Story of Dragon Kings

Recently, the Mistral models have taken the LLM world by storm. The Mistral Mixture of Experts (MOE) 8x7b model outperforms other … More

AI, artificial-intelligence, llm, machine learning, python

Evaluating Fine-Tuned LLMs with WeightWatcher

if you are fine-tuning your own LLMs, you need a way to evaluate them. And while there are over a dozen … More

AI, artificial-intelligence, generative-ai, llm, machine learning

Why WeightWatcher Works

I am frequently asked, why does weightwatcher work ? The weightwatcher tool uses power law fits to model the eigenvalue … More

Deep Learning, machine learning

WeightWatcher: Empirical Quality Metrics for Deep Neural Networks

We introduce the weightwatcher (ww) , a python tool for a python tool for computing quality metrics of trained, and … More

AI, DATA SCIENCE, Deep Learning, KERAS, machine learning, NLP, PYTORCH, TENSORFLOW

Heavy Tailed Self Regularization in Deep Neural Nets: 1 year of research

My talk at ICSI-the International Computer Science Institute at UC Berkeley. ICSI is a leading independent, nonprofit center for research … More

AI, Deep Learning, machine learning, UC Berkeley

Foundations: The Partition Function.

We are going to examine the Partition function that arises in Deep Learning methods like Restricted  Boltzmann Machines. We take … More

Deep Learning, machine learning

Music Recommendations and the Logistic Metric Embedding

In this post, we are going to see  how to build our own music recommender, using the Logistic Metric Embedding … More

machine learning, Music Recommender

Recent Posts

  • WW-PGD: Projected Gradient Descent optimizer
  • WeightWatcher, HTSR theory, and the Renormalization Group
  • Fine-Tuned Llama3.2: Bad Instructions ?
  • What’s instructive about Instruct Fine-Tuning: a weightwatcher analysis
  • Describing Double Descent with WeightWatcher

Archives

  • December 2025
  • December 2024
  • October 2024
  • March 2024
  • February 2024
  • January 2024
  • March 2023
  • February 2023
  • July 2022
  • June 2022
  • October 2021
  • August 2021
  • July 2021
  • April 2021
  • November 2020
  • September 2020
  • February 2020
  • December 2019
  • April 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • June 2018
  • April 2018
  • December 2017
  • September 2017
  • July 2017
  • June 2017
  • February 2017
  • January 2017
  • October 2016
  • September 2016
  • June 2016
  • February 2016
  • December 2015
  • April 2015
  • March 2015
  • January 2015
  • November 2014
  • September 2014
  • August 2014
  • November 2013
  • October 2013
  • August 2013
  • May 2013
  • April 2013
  • December 2012
  • November 2012
  • October 2012
  • September 2012
  • April 2012
  • February 2012
Blog at WordPress.com.
  • Subscribe Subscribed
    • calculated | content
    • Join 732 other subscribers
    • Already have a WordPress.com account? Log in now.
    • calculated | content
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...