People News Events Projects Publications

All Events

All Posts

mlsys-seminar (6)

Published on
March 6, 2025
The EAGLE Series: Lossless Inference Acceleration for LLMs
MLSys-Seminar
Speaker: Prof. Hongyang Zhang, University of Waterloo
This talk presents the EAGLE series, a groundbreaking approach to accelerating large language model inference without compromising output quality. Instead of traditional token-level processing, EAGLE operates at the structured feature level and incorporates sampling results to reduce uncertainty. The technology has gained significant industry adoption, with integration into major frameworks including vLLM, SGLang, TensorRT-LLM, and several others from AWS and Intel.
Published on
February 20, 2025
LLM360: From 360° Open Source to 360° Collaboration in AI
MLSys-Seminar
Speaker: Dr. Zhengzhong (Hector) Liu, MBZUAI
The LLM360 project advances AI through open-source foundation models and datasets. This talk explores key initiatives including K2, the most capable fully open-source language model, and TxT360, examining the true meaning of open source while proposing new approaches to academic and industry collaboration in open-source AI.
Published on
February 6, 2025
Enable Large Language Model Deployment Across Cloud and Edge with ML Compilation
MLSys-Seminar
Speaker: Prof. Tianqi Chen, CMU
In this talk, we will discuss the lessons learned in building an efficient large language model deployment system for both server and edge settings. We will cover general techniques in machine learning compilation and system support for efficient structure generation. We will also discuss the future opportunities in system co-design for cloud-edge model deployments.

All Events

All Posts

The EAGLE Series: Lossless Inference Acceleration for LLMs

LLM360: From 360° Open Source to 360° Collaboration in AI

Enable Large Language Model Deployment Across Cloud and Edge with ML Compilation