<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Nlp on Hitesh Pattanayak</title><link>/tags/nlp/</link><description>Recent content in Nlp on Hitesh Pattanayak</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sun, 29 Mar 2026 21:14:51 -0700</lastBuildDate><atom:link href="/tags/nlp/index.xml" rel="self" type="application/rss+xml"/><item><title>Vector Embeddings &amp; Similarity: The Foundation of RAG</title><link>/posts/vector-embeddings-similarity-the-foundation-of-rag/</link><pubDate>Sun, 29 Mar 2026 21:14:51 -0700</pubDate><guid>/posts/vector-embeddings-similarity-the-foundation-of-rag/</guid><description>A practical deep-dive into vector embeddings and cosine similarity — the mathematical foundation that makes retrieval in RAG systems actually work.</description></item><item><title>Vector Databases, ANN, and Chunking: Storing Knowledge for Retrieval</title><link>/posts/vector-databases-ann-and-chunking-storing-knowledge-for-retrieval/</link><pubDate>Sun, 29 Mar 2026 21:11:47 -0700</pubDate><guid>/posts/vector-databases-ann-and-chunking-storing-knowledge-for-retrieval/</guid><description>A practical guide for software engineers covering how vector databases use Approximate Nearest Neighbor algorithms to search millions of embeddings efficiently, and how to chunk documents intelligently so your RAG pipeline actually retrieves useful, precise context.</description></item><item><title>Training for Greatness: Speed, BLEU Records, and the Multimodal Vision</title><link>/posts/training-for-greatness-speed-bleu-records-and-the-multimodal-vision/</link><pubDate>Sat, 21 Mar 2026 12:20:48 -0700</pubDate><guid>/posts/training-for-greatness-speed-bleu-records-and-the-multimodal-vision/</guid><description>A practical deep-dive into how the original Transformer model shattered translation benchmarks, slashed training costs, and laid the architectural foundation for every major LLM that followed.</description></item><item><title>Inside the Machine: Encoders, Decoders, and Masking</title><link>/posts/inside-the-machine-encoders-decoders-and-masking/</link><pubDate>Sat, 21 Mar 2026 12:19:38 -0700</pubDate><guid>/posts/inside-the-machine-encoders-decoders-and-masking/</guid><description>A practical deep-dive into how the Transformer&amp;rsquo;s encoder and decoder stacks work, covering residual connections, positional encoding, masked self-attention, and cross-attention with code examples throughout.</description></item><item><title>The End of the RNN Era &amp; The Query, Key, Value Revolution</title><link>/posts/the-end-of-the-rnn-era-the-query-key-value-revolution/</link><pubDate>Sat, 21 Mar 2026 12:06:03 -0700</pubDate><guid>/posts/the-end-of-the-rnn-era-the-query-key-value-revolution/</guid><description>A practical walkthrough of why RNNs hit a fundamental wall with sequential processing and long-range dependencies, and how the Query-Key-Value attention mechanism solves both problems in one elegant step.</description></item><item><title>Sub-Word Tokenization: Breaking Words Like a Pro</title><link>/posts/tokenization/</link><pubDate>Wed, 02 Oct 2024 00:00:00 +0000</pubDate><guid>/posts/tokenization/</guid><description>Take a detour before diving into transformers and explore sub-word tokenization techniques like Byte-Pair Encoding, WordPiece, and Unigram models. Learn how they handle rare words, reduce vocabulary size, and make models more efficient!</description></item></channel></rss>