UC San Diego ML Systems Group

We are a group of faculty, researchers, and students targeting at the intersection of machine learning and systems. Our current members span the Computer Science and Engineering Department (CSE) and the Halıcıoğlu Data Science Institute (HDSI) at the University of California, San Diego. Our research focuses on a broad spectrum of topics aimed at advancing next-generation systems for machine learning and developing innovative algorithms.

Research Areas

Systems for ML/AI

ML/AI for building systems

ML compilers and runtimes

Distributed ML/AI

Hardware software co-design for ML/AI

ML/AI system benchmarks and datasets

AIGC and agents

AI systems for science

News & Events

Published on
March 6, 2025
MLSys-Seminar
The EAGLE Series: Lossless Inference Acceleration for LLMs
Speaker: Prof. Hongyang Zhang, University of Waterloo
This talk presents the EAGLE series, a groundbreaking approach to accelerating large language model inference without compromising output quality. Instead of traditional token-level processing, EAGLE operates at the structured feature level and incorporates sampling results to reduce uncertainty. The technology has gained significant industry adoption, with integration into major frameworks including vLLM, SGLang, TensorRT-LLM, and several others from AWS and Intel.
Read more →
Published on
February 20, 2025
MLSys-Seminar
LLM360: From 360° Open Source to 360° Collaboration in AI
Speaker: Dr. Zhengzhong (Hector) Liu, MBZUAI
The LLM360 project advances AI through open-source foundation models and datasets. This talk explores key initiatives including K2, the most capable fully open-source language model, and TxT360, examining the true meaning of open source while proposing new approaches to academic and industry collaboration in open-source AI.
Read more →
Published on
February 6, 2025
MLSys-Seminar
Enable Large Language Model Deployment Across Cloud and Edge with ML Compilation
Speaker: Prof. Tianqi Chen, CMU
In this talk, we will discuss the lessons learned in building an efficient large language model deployment system for both server and edge settings. We will cover general techniques in machine learning compilation and system support for efficient structure generation. We will also discuss the future opportunities in system co-design for cloud-edge model deployments.
Read more →
Published on
September 12, 2024
news
Our paper received an ACM SIGSOFT Distinguished Paper Award
Our ISSTA'24 paper "Multi-modal Learning for WebAssembly Reverse Engineering" received an ACM SIGSOFT Distinguished Paper Award.
Read more →
Published on
July 1, 2024
news
Hanxian Huang being selected as a 2024 MLCommons Rising Star
Congratulations to MLsys group student Hanxian Huang on being selected as a 2024 MLCommons Rising Star. She was among the 41 junior researchers selected from over 170 applicants globally. The MLCommons Rising Stars are selected based on their excellence in Machine Learning (ML) and Systems research and stand out for their current and future contributions and potential.
Read more →

All Posts →

UC San Diego ML Systems Group

Research Areas

News & Events

The EAGLE Series: Lossless Inference Acceleration for LLMs

LLM360: From 360° Open Source to 360° Collaboration in AI

Enable Large Language Model Deployment Across Cloud and Edge with ML Compilation

Our paper received an ACM SIGSOFT Distinguished Paper Award

Hanxian Huang being selected as a 2024 MLCommons Rising Star

Sponsors