<?xml version="1.0" encoding="utf-8" ?><rss version="2.0"><channel><title>Bing: Quantization Example Scale and Scale Back</title><link>http://www.bing.com:80/search?q=Quantization+Example+Scale+and+Scale+Back</link><description>Search results</description><image><url>http://www.bing.com:80/s/a/rsslogo.gif</url><title>Quantization Example Scale and Scale Back</title><link>http://www.bing.com:80/search?q=Quantization+Example+Scale+and+Scale+Back</link></image><copyright>Copyright © 2026 Microsoft. All rights reserved. These XML results may not be used, reproduced or transmitted in any manner or for any purpose other than rendering Bing results within an RSS aggregator for your personal, non-commercial use. Any other use of these results requires express written permission from Microsoft Corporation. By accessing this web page or using these results in any manner whatsoever, you agree to be bound by the foregoing restrictions.</copyright><item><title>Quantization (signal processing) - Wikipedia</title><link>https://en.wikipedia.org/wiki/Quantization_(signal_processing)</link><description>In mathematics and digital signal processing, quantization is the process of mapping input values from a large set (often a continuous set) to output values in a (countable) smaller set, often with a finite number of elements. Rounding and truncation are typical examples of quantization processes.</description><pubDate>Tue, 23 Jun 2026 15:36:00 GMT</pubDate></item><item><title>What is Quantization - GeeksforGeeks</title><link>https://www.geeksforgeeks.org/deep-learning/quantization-in-deep-learning/</link><description>Quantization is a model optimization technique that reduces the precision of numerical values such as weights and activations in models to make them faster and more efficient. It helps lower memory usage, model size, and computational cost while maintaining almost the same level of accuracy. Quantization Need of Quantization In Large Language Models which contain billions of parameters ...</description><pubDate>Sun, 21 Jun 2026 23:16:00 GMT</pubDate></item><item><title>Model Quantization: Concepts, Methods, and Why It Matters</title><link>https://developer.nvidia.com/blog/model-quantization-concepts-methods-and-why-it-matters/</link><description>Quantization reduces the precision of model parameters and activations (for example, from FP32/FP16 to FP8) to shrink memory footprint, improve inference speed, and lower energy consumption, while carefully trading off some accuracy; for transformers, this applies to three main elements: weights, activations, and the KV cache in decoder-only LLMs. The post explains how different floating-point ...</description><pubDate>Tue, 23 Jun 2026 16:33:00 GMT</pubDate></item><item><title>Quantization · Hugging Face</title><link>https://huggingface.co/docs/optimum/concept_guides/quantization</link><description>We’re on a journey to advance and democratize artificial intelligence through open source and open science.</description><pubDate>Wed, 10 Jun 2026 03:19:00 GMT</pubDate></item><item><title>What Is Quantization? | How It Works &amp; Applications</title><link>https://www.mathworks.com/discovery/quantization.html</link><description>Quantization is the process of mapping continuous infinite values to a smaller set of discrete finite values. In the context of simulation and embedded computing, it is about approximating real-world values with a digital representation that introduces limits on the precision and range of a value.</description><pubDate>Mon, 22 Jun 2026 09:10:00 GMT</pubDate></item><item><title>What is quantization? - IBM</title><link>https://www.ibm.com/think/topics/quantization</link><description>Quantization is the process of reducing the precision of a digital signal, typically from a higher-precision format to a lower-precision format. This technique is widely used in various fields, including signal processing, data compression and machine learning.</description><pubDate>Fri, 19 Jun 2026 07:34:00 GMT</pubDate></item><item><title>Quantization - Wikipedia</title><link>https://en.wikipedia.org/wiki/Quantization</link><description>Quantization is the process of constraining an input from a continuous or otherwise large set of values (such as the real numbers) to a discrete set (such as the integers).</description><pubDate>Mon, 22 Jun 2026 05:07:00 GMT</pubDate></item><item><title>A Visual Guide to Quantization - by Maarten Grootendorst</title><link>https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-quantization</link><description>In this post, I will introduce the field of quantization in the context of language modeling and explore concepts one by one to develop an intuition about the field. We will explore various methodologies, use cases, and the principles behind quantization. In this visual guide, there are more than 50 custom visuals to help you develop an intuition about quantization!</description><pubDate>Tue, 23 Jun 2026 06:39:00 GMT</pubDate></item><item><title>Digital Communication - Quantization - Online Tutorials Library</title><link>https://www.tutorialspoint.com/digital_communication/digital_communication_quantization.htm</link><description>Quantization Noise It is a type of quantization error, which usually occurs in analog audio signal, while quantizing it to digital. For example, in music, the signals keep changing continuously, where a regularity is not found in errors. Such errors create a wideband noise called as Quantization Noise.</description><pubDate>Tue, 23 Jun 2026 00:19:00 GMT</pubDate></item><item><title>What Is Quantization? Optimizing Data Compression - Coursera</title><link>https://www.coursera.org/articles/quantization</link><description>What is quantization? Quantization is a technique for reducing the size of machine learning models without sacrificing accuracy or function. Large language models (LLMs), or signal processing models, often use a high volume of high-precision data.</description><pubDate>Sat, 20 Jun 2026 21:44:00 GMT</pubDate></item></channel></rss>