LLMShare: Optimizing LLM Inference Serving with Hardware Architecture Exploration

Publication
ACM/IEEE Design Automation Conference (DAC)

Add the full text or supplementary notes for the publication here using Markdown formatting.