A bag of tricks to increase either training or inference latency or memory and storage requirements for large language models.
A Hacker's Guide to LLM Optimization
A Hacker's Guide to LLM Optimization
A Hacker's Guide to LLM Optimization
A bag of tricks to increase either training or inference latency or memory and storage requirements for large language models.