SW/HW Co-optimization Strategy for Large Language Models (LLMs)

How to stretch every bit out of your system to run LLMs faster? — best practice

Liz Li
Towards Data Science
5 min readDec 16, 2023

--

Leading Large Language Models (LLMs) like ChatGPT, Llama, etc. are revolutionizing the tech industry and impacting everyone’s lives. However, their cost poses a significant hurdle. Applications utilizing OpenAI APIs incur substantial expenses for continuous operation ($0.03 per 1,000 prompt tokens and $0.06 per 1,000 sampled tokens).

--

--

Sr. AI architect. Talks about artificial intelligence, deep learning, computer architecture, disruptive technology, start up, personal growth, and more