Vikranth Srivatsa

WukLab. Sysnet. Sky/RiseLab

prof_pic.jpeg

2nd year PhD Student at UCSD

I’m a 2nd-year PhD student at WukLab, focusing on ML systems and LLM inference, under the guidance of Professor Yiying Zhang. My research covers load balancing, memory management, and efficient scheduling for large language models.

I completed my undergraduate and master’s degrees in Electrical Engineering and Computer Science at UC Berkeley, where I worked at RISE Labs with Professor Joseph Gonzalez. My research there included serverless execution across multi-cloud, 5G, and edge computing with Moustafa Adelbaky, and machine learning distribution shift explainability with Yaoqing Yang and Yaodong Yu.

news

Oct 04, 2024 Preprint update of Preble: Efficient Distributed Prompt Scheduling for LLM Serving comparison with latest SGLang
May 01, 2024 🎉 InferCept was accepted to ICML 2024!

latest posts

selected publications

  1. aug-llm-inference-xl.gif
    INFERCEPT: Efficient Intercept Support for Augmented Large Language Model Inference
    Reyna Abhyankar, Zijian He, Vikranth Srivatsa, and 2 more authors
    In Forty-first International Conference on Machine Learning, Jul 2024
  2. preble.gif
    Preble: Efficient Distributed Prompt Scheduling for LLM Serving
    Vikranth Srivatsa, Zijian He, Reyna Abhyankar, and 2 more authors
    arXiv preprint arXiv: 2407.00023, Jul 2024
  3. waterbirds.png
    The Effect of Model Size on Worst-Group Generalization
    Alan Le Pham, Eunice Chan, Vikranth Srivatsa, and 6 more authors
    In NeurIPS 2021 Workshop on Distribution Shifts: Connecting Methods and Applications, Jul 2021
  4. cloudless_architecture.jpg
    Cloudless and Mixclaves
    Vikranth Srivatsa
    EECS Department, University of California, Berkeley, May 2023