<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Home on Abhinav Chavali</title><link>https://abhichavali.com/</link><description>Recent content in Home on Abhinav Chavali</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Wed, 03 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://abhichavali.com/index.xml" rel="self" type="application/rss+xml"/><item><title>Notes on deep learning</title><link>https://abhichavali.com/posts/deep-learning/</link><pubDate>Wed, 03 Jun 2026 00:00:00 +0000</pubDate><guid>https://abhichavali.com/posts/deep-learning/</guid><description>&lt;h2 id="notes-on-deep-learning">Notes on deep learning&lt;/h2>
&lt;p>A short dummy post about deep learning — placeholder content to exercise the Org → Hugo → deploy pipeline.&lt;/p>
&lt;h3 id="what-is-deep-learning">What is deep learning?&lt;/h3>
&lt;p>Deep learning is a branch of machine learning that uses neural networks with many layers to learn representations from data. Instead of hand-designed features, the model learns hierarchical features: early layers might detect edges, later layers objects or concepts.&lt;/p>
&lt;h3 id="core-ingredients">Core ingredients&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Data&lt;/strong> — labeled examples (images, text, audio) at sufficient scale&lt;/li>
&lt;li>&lt;strong>Architecture&lt;/strong> — CNNs for vision, transformers for language, and hybrids for multimodal tasks&lt;/li>
&lt;li>&lt;strong>Loss function&lt;/strong> — measures how wrong predictions are (cross-entropy, MSE, etc.)&lt;/li>
&lt;li>&lt;strong>Optimizer&lt;/strong> — SGD, Adam, and variants that update weights to reduce loss&lt;/li>
&lt;li>&lt;strong>Compute&lt;/strong> — GPUs or TPUs make training large models practical&lt;/li>
&lt;/ul>
&lt;h3 id="a-minimal-mental-model">A minimal mental model&lt;/h3>
&lt;ol>
&lt;li>Forward pass: input flows through the network to produce a prediction&lt;/li>
&lt;li>Loss: compare prediction to the target&lt;/li>
&lt;li>Backward pass: backpropagation computes gradients&lt;/li>
&lt;li>Update: optimizer adjusts weights; repeat for many epochs&lt;/li>
&lt;/ol>
&lt;h3 id="why-it-matters">Why it matters&lt;/h3>
&lt;p>Deep learning powers modern speech recognition, machine translation, recommendation systems, and generative models. The same training loop — predict, measure error, update — scales from small experiments to billion-parameter models, with engineering around data, stability, and deployment becoming as important as the math.&lt;/p></description></item></channel></rss>