LLM.int8() and Emergent Features
When I attended NAACL, I wanted to do a little test. I had two pitches for my LLM.int8() paper. One pitch is about how I use advanced quantization methods to achieve no performance degradation transformer inference at scale that makes large models more accessible. The other pitch talks about emergent outliers in transformers and how … Continue reading LLM.int8() and Emergent Features
Copy and paste this URL into your WordPress site to embed
Copy and paste this code into your site to embed