A bag of tricks to increase either training or inference latency or memory and storage requirements for large language models.
Share this post
A Hacker's Guide to LLM Optimization
Share this post
A bag of tricks to increase either training or inference latency or memory and storage requirements for large language models.