Descripción

SHARK is an open-source toolkit for high-performance serving of popular generative AI and large language models.